Ideas to build a federated StackExchange alternative

lemmyreader · 2 months ago

Ideas to build a federated StackExchange alternative

@DaseinPickle@leminal.space · 2 months ago

How could anybody stop the AI robbers from stealing content from the fediverse?

@delirious_owl@discuss.online · 2 months ago

Why does that matter? The content is licensed CC BY-SA. The point here is to prevent AI answers.

@DaseinPickle@leminal.space · 2 months ago

It seems to matter for the users at Stack Overflow. And why should anybody give anything for free to the crooks in Silicon Valley. All they do is create technology designed to extract value out of people and give as little as possible back.

@delirious_owl@discuss.online · 2 months ago

Because that’s the nature of FOSS. The good news is, if they trained on you data that’s licensed CC BY-SA (as all SO content is), then you can request their source code, and they legally must provide it.

This is a good thing.

velox_vulnus · 2 months ago

In the Fediverse, everyone gets access to the data. However, if privacy is what’s bugging you, then you’re free to use a forum - which is going to be archived by someone on the internet, so in a way, the stuff you post on the internet is not going to be private - there’s nothing that can be done about it, except for going under a pseudonym. However, the same cannot be said for Stack Exchange. Will they let you parse their site for free, when Reddit and other private platforms are charging money for the same? They’re using 16 years worth of free volunteer work to make lots of bucks.
In their quest for integrating AI, now the new site will vomit verbal diarrhea. Humans don’t do that. These language models are absolutely terrible in their tasks. They can’t replace humans, at least for now, we know it.
Earlier, the site was free, and their means of earning was through some sort of enterprise solution, but now that they’re going to add AI, it is going to be very resource-intensive. Who is paying for all of that? We have to, from our own pockets, for low quality answers, with no respect to the question asked by the user? Yeah, welcome to paywall 2.0!
Their lofy model will use answers from 2010s to train their data, most of which isn’t applicable in today’s time. Will you be using X11 configs for Wayland on Linux? Or GTK+ solutions for GTK4?

@DaseinPickle@leminal.space · 2 months ago

It’s not about privacy. It’s about AI companies stealing other peoples work and knowledge and profiting. Like what they did with artists. And I think that’s bothering a lot of people. It’s kind of sad that we cannot exchange information with each other for free, without some Silicon Valley crooks taking advantage and trying to convert other people’s good will into profit.

These LLMs are also polluting the web with AI junk and slop. The web is absolutely tainted with shitty ChatGPT text and images, making it harder and harder to find authentic information. I think a lot of people don’t want to contribute with that.

lemmyreader · 2 months ago

robots.txt may help : https://neil-clarke.com/block-the-bots-that-feed-ai-models-by-scraping-your-website or blocking by IP addresses.

velox_vulnus · 2 months ago

No, it can’t be. I may be using robots.txt on, say, lemmy.ml, but those posts will still be broadcasted on lemmy.world, or hexbear.net.

Ideas to build a federated StackExchange alternative

Ideas to build a federated StackExchange alternative

char (@char@ioc.exchange)