Five@slrpnk.net to Reddit@lemmy.worldEnglish · 6 months agoReddit Will License Its Data to Train LLMs, So We Made a Firefox Extension That Lets You Replace Your Comments With Any (Non-Copyrighted) Text - The Ludditetheluddite.orgexternal-linkmessage-square42fedilinkarrow-up1288arrow-down18cross-posted to: hackernews@lemmy.smeargle.fansreddit@lemmy.worldquitterredditbyereddit@lemmy.worldtechnology@lemmy.world
arrow-up1280arrow-down1external-linkReddit Will License Its Data to Train LLMs, So We Made a Firefox Extension That Lets You Replace Your Comments With Any (Non-Copyrighted) Text - The Ludditetheluddite.orgFive@slrpnk.net to Reddit@lemmy.worldEnglish · 6 months agomessage-square42fedilinkcross-posted to: hackernews@lemmy.smeargle.fansreddit@lemmy.worldquitterredditbyereddit@lemmy.worldtechnology@lemmy.world
minus-squareFaceDeer@fedia.iolinkfedilinkarrow-up4arrow-down2·edit-26 months agoThe place I know about off the top of my head is academictorrents.com where you can find lots of large data sets useful for academic research. The torrent files themselves are small, so I’m sure they can be found in other places too.
The place I know about off the top of my head is academictorrents.com where you can find lots of large data sets useful for academic research. The torrent files themselves are small, so I’m sure they can be found in other places too.