nucleative@lemmy.world to

Fuck AI@lemmy.worldEnglish · 4 days ago

ClaudeBot crawled within minutes of a new site going live

74

ClaudeBot crawled within minutes of a new site going live

nucleative@lemmy.world to

Fuck AI@lemmy.worldEnglish · 4 days ago

Today I set up a little project website on a new subdomain. It’s not a www subdomain or a newly registered domain, which is easy to detect. We’re talking about:

Randomchars.mydomain.com

Within 20 minutes, the anthropic ClaudeBot was on it. I could tell because the nginx access log showed a hit to robots.txt and then a handful of pages.

First off, how the hell did they find it? Next, is my DNS provider, Amazon Route 53 selling this kind of data now? Or is there some kind of DNS wildcard query?

Chat

techconsulnerd@programming.dev
link
fedilink
arrow-up
3·
4 days ago
Perhaps it was crawling a list of IP addresses and your web server is also serving the website to your IP address (not domain/subdomain). You can configure the web server to show blank page or 403 error if accessed by IP address.

Fuck AI@lemmy.world

fuck_ai@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !fuck_ai@lemmy.world

“We did it, Patrick! We made a technological breakthrough!”

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

AI, in this case, refers to LLMs, GPT technology, and anything listed as “AI” meant to increase market valuations.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

981 users / day
3.94K users / week
8.26K users / month
15.2K users / 6 months
22 local subscribers
5.51K subscribers
2.8K Posts
39K Comments
Modlog