this post was submitted on 18 Jun 2025
        
      
      850 points (98.9% liked)
      Fediverse
    37526 readers
  
      
      8 users here now
      A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).
If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!
Rules
- Posts must be on topic.
- Be respectful of others.
- Cite the sources used for graphs and other statistics.
- Follow the general Lemmy.world rules.
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)
        founded 2 years ago
      
      MODERATORS
      
    you are viewing a single comment's thread
view the rest of the comments
    view the rest of the comments
Just like when mastodon.social condemned Meta for their horrible moderation decisions and inability to act properly in the interest of its users, and said that the instance would be cutting ties/not federating with Threads, they kept on federating like nothing happened.
I don't believe anything coming out of mastodon.social unless I can see action being taken with my own two eyes.
Also, blocking scrapers is very easy, and it has nothing to do with a robots.txt (which they ignore).
How is blocking scrapers easy?
This instance receives 500+ IPs with differing user agents all connecting at once but keeping within rate limits by distribution of bots.
The only way I know it's a scraper is if they do something dumb like using "google.com" as the referrer for every request or by eyeballing the logs and noticing multiple entries from the same /12.
Exactly this, you can only stop scrapers that play by the rules.
Each one of those books powering GPT had like protection on them already.
The entirety of the internet disagrees.
Can you please show exactly there this was said?