Luu Tuyen@lemmy.world to Technology@lemmy.worldEnglish · 17 hours agoTikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAIfortune.comexternal-linkmessage-square70fedilinkarrow-up1373arrow-down15
arrow-up1368arrow-down1external-linkTikTok’s parent launched a web scraper that’s gobbling up the world’s online data 25-times faster than OpenAIfortune.comLuu Tuyen@lemmy.world to Technology@lemmy.worldEnglish · 17 hours agomessage-square70fedilink
minus-squarejagged_circle@feddit.nllinkfedilinkEnglisharrow-up7·8 hours agoI think a common nginx config is to just redirect malicious bots to some well-cached terrabyte file. I think hetzner hosts one iirc
minus-squareSomething Burger 🍔linkfedilinkEnglisharrow-up7·8 hours agohttps://github.com/iamtraction/ZOD 42kB ZIP file which decompresses into 4.5 PB.
I think a common nginx config is to just redirect malicious bots to some well-cached terrabyte file. I think hetzner hosts one iirc
https://github.com/iamtraction/ZOD
42kB ZIP file which decompresses into 4.5 PB.