AI bots on Reddit reaching the front page? I absolutely believe it

ByteOnBikes@slrpnk.net · 14 hours ago

AI bots on Reddit reaching the front page? I absolutely believe it

Thrife@feddit.org · 14 hours ago

Is reddit still feeding Googles LLM or was it just a one time thing? Meaning will the newest LLM generated posts feed LLMs to generate posts?

shittydwarf@lemmy.dbzer0.com · edit-2 14 hours ago

The truly valuable data is the stuff that was created prior to LLMs, anything after this is tainted by slop. Any verifiable human data would be worth more, which is why they are simultaneously trying to erode any and all privacy

gandalf_der_12te@discuss.tchncs.de · 8 hours ago

I’m not sure about that. It implies that only humans are able to produce high-quality output. But that seems wrong to me.

First of all, not everything that humans produce has high quality; rather, the opposite.
Second, with the development of AI i think it will be very well possible for AI to generate good-quality output in the future.

morrowind@lemmy.ml · 7 hours ago

Microsoft’s PHI-4 is primarily trained on synthetic (generated by other AIs) data. It’s not a future thing, it’s been happening for years

whotookkarl@lemmy.world · 10 hours ago

These days the LLMs feed the LLMs so you can model models unless you’re excluding any public data from the last decade. You have to assume all public data based on users is tainted when used for training.