Will Meta scrape and crawl through all our data now?

LilDumpy@lemmy.world · 3 years ago

Will Meta scrape and crawl through all our data now?

UziBobuzi@kbin.social · 3 years ago

They can do it anyway, without threads being in the mix at all. Unfortunately the only way to be sure no corporation can scrape your data is to not be on the internet at all.

LilDumpy@lemmy.world · 3 years ago

Ahh, very true, but aren’t there legal obligations regarding privacy if data is collected via a site vs the public web?

awderon@lemmy.world · 3 years ago

OpenAI is currently being sued because they used everything they could fin to train their AI models. We will see how that works out.

https://edition.cnn.com/2023/06/28/tech/openai-chatgpt-microsoft-data-sued/index.html

TimeIncarnate@lemmy.world · 3 years ago

Short answer is “no.”

Slightly longer answer is: “all of your public posts on Lemmy or Mastodon or any other federated platform are the Public web. So no, it’s not different.”

SkyNTP@lemmy.ml · 3 years ago

If you post something online, it is public and you have to assume someone or even everyone has already scraped and harvested the data.

It has always been like this.

If you come from an incumbent social media platform, perhaps you never got yo experience this lesson for yourself. But that data has also been harvested. They just gave you a bit of illusion of privacy.

The only thing online that is private is E2E encryption directly with a party you trust, and only if you are the only ones with a copy of the keys.

treadful@lemmy.zip · 3 years ago

The benefit now is that one company can’t get exclusive access to your public data. It’s open to anyone that wants it.

DLSchichtl@lemmy.world · edit-2 2 years ago

Removed by mod

RightHandOfIkaros@lemmy.world · edit-2 3 years ago

Always have been.

Push your local legislation to change the law in favor of consumer data protection and not infinite growing company profits.

Alexmitter@kbin.social · edit-2 3 years ago

Not more or less then they can already do by just using web bots.

escapedgoat@kbin.social · 3 years ago

…now?

sweet summer child.

Jeze3D@kbin.social · 3 years ago

They always could? These are public facing platforms. You’re being scraped by far more than just meta.

Anomander@kbin.social · 3 years ago

Yeah, absolutely nothing was preventing them from doing so already, without launching Threads.

Blocking Meta / Threads instances isn’t going to stop them, either.

KarsicKarl@kbin.social · 3 years ago

What scraping can get is very little public information.

There’s a lot of information that servers keep contained such as IP addresses of where you are when you made a post. Other info such as your email address remains contained within your own instance. Meta cannot get at that information. No other Fediverse server can get at that.

This blog from Gargoron (Eugen Rochko) who essentially created ActivityPub that underpin all these Fediverse systems including Mastodon, Calckey, Pixelfed, kbin, Lemmy etc.

https://blog.joinmastodon.org/2023/07/what-to-know-about-threads/

Ab_intra@lemmy.world · edit-2 3 years ago

What I wonder is how Lemmy handles this. He is writing about how Mastodon do things, not Lemmy.

UziBobuzi@kbin.social · 3 years ago

If you post something public, people can access it. Corporations can access it. It’s one of the reasons I ditched all my social media that identified me directly. They can scrape my stuff, sure; but they won’t be able to link it to my actual name, face or existence in real life.