mastodon.xyz is one of the many independent Mastodon servers you can use to participate in the fediverse.
A Mastodon instance, open to everyone, but mainly English and French speaking.

Administered by:

Server stats:

743
active users

#scrapingbots

0 posts0 participants0 posts today
James Ravenscroft<p>Update on my <a href="https://social.lol/tags/anubis" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>anubis</span></a> deployment. I've been hit by about 15k "suspicious" browsers today and 20 of them validated... So 0.13% of my visitors are human (or at least sophisticated enough to fool anubis). Gee wizz... <a href="https://social.lol/tags/forgejo" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>forgejo</span></a> <a href="https://social.lol/tags/scrapingrestrictions" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrapingrestrictions</span></a> <a href="https://social.lol/tags/scrapingbots" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scrapingbots</span></a></p>
Liz Probert<p>How crawlers impact the operations of the Wikimedia projects <a href="https://diff.wikimedia.org/2025/04/01/how-crawlers-impact-the-operations-of-the-wikimedia-projects/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">diff.wikimedia.org/2025/04/01/</span><span class="invisible">how-crawlers-impact-the-operations-of-the-wikimedia-projects/</span></a> <a href="https://greennet.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a>, <a href="https://greennet.social/tags/Crawlers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Crawlers</span></a>, <a href="https://greennet.social/tags/Infrastructure" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Infrastructure</span></a>, <a href="https://greennet.social/tags/KnowledgeAsAService" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>KnowledgeAsAService</span></a>, <a href="https://greennet.social/tags/KnowledgeContent" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>KnowledgeContent</span></a>, <a href="https://greennet.social/tags/Operations" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Operations</span></a>, <a href="https://greennet.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a>, <a href="https://greennet.social/tags/ScrapingBots" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ScrapingBots</span></a>, <a href="https://greennet.social/tags/Traffic" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Traffic</span></a>, <a href="https://greennet.social/tags/WikimediaFoundation" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WikimediaFoundation</span></a>, <a href="https://greennet.social/tags/WikimediaProjects" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WikimediaProjects</span></a></p>
Admin Jerry<p>If <a href="https://hear-me.social/tags/Cloudflare" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Cloudflare</span></a> is to be believed, <a href="https://hear-me.social/tags/Lemmy" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Lemmy</span></a> instances have a built-in AI scraping bot operating beneath the covers. Do you think the developers have snuck it in?</p><p>Looking through my logs, these requests have all been blocked by Cloudflare because they are identified as "AI Bots". There are many more requests by Lemmy instances blocked in the logs. This is just a sample. Other Lemmy requests from these servers get through. Only a few are blocked as AI Bots. </p><p>Cloudflare says they use AI to determine if a request is a legitimate request or an AI bot trying to scrape.</p><p>207.204.58.144<br>AS19045 DIRECTCOM<br>United States<br>User agent: Lemmy/0.19.5; +<a href="https://lemmy.cryonex.net" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">lemmy.cryonex.net</span><span class="invisible"></span></a></p><p>23.127.223.238<br>AS7018 ATT-INTERNET4<br>United States<br>User agent: Lemmy/0.19.3; +<a href="https://lemux.minnix.dev" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">lemux.minnix.dev</span><span class="invisible"></span></a></p><p>2a01:cb19:f85:ec00:82fa:5bff:fe51:ed4a<br>AS3215 France Telecom - Orange<br>France<br>User agent: Lemmy/0.19.5; +<a href="https://lemmy.sidh.bzh" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">lemmy.sidh.bzh</span><span class="invisible"></span></a></p><p>50.247.53.42<br>AS7922 COMCAST-7922<br>United States<br>User agent: Lemmy/0.19.5; +<a href="https://toast.ooo" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">toast.ooo</span><span class="invisible"></span></a></p><p>69.42.19.234<br>AS11404 AS-WAVE-1<br>United States<br>User agent: Lemmy/0.19.5; +<a href="https://lemmy.schlunker.com" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">lemmy.schlunker.com</span><span class="invisible"></span></a></p><p>155.138.226.183<br>AS20473 AS-CHOOPA<br>United States<br>User agent: Lemmy/0.19.5; +<a href="https://lemmy.mbl.social" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">lemmy.mbl.social</span><span class="invisible"></span></a></p><p><a href="https://hear-me.social/tags/MastoAdmin" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MastoAdmin</span></a> <a href="https://hear-me.social/tags/AIBots" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AIBots</span></a> <a href="https://hear-me.social/tags/Scrapers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scrapers</span></a> <a href="https://hear-me.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://hear-me.social/tags/ScrapingBots" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ScrapingBots</span></a> <a href="https://hear-me.social/tags/privacy" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>privacy</span></a></p>