• FishFace@piefed.social
    link
    fedilink
    English
    arrow-up
    9
    arrow-down
    1
    ·
    9 hours ago

    It’s hard to host anything public without relying on something like cloudflare.

    But, what makes you say it’s “ai crawlers” rather than conventional botnets and so on? Very few organisations have the resources to train large ai models, but the resources needed to run a botnet or something are much lower.

    • RedBauble@sh.itjust.works
      link
      fedilink
      arrow-up
      1
      ·
      2 hours ago

      Because the 1000 requests/10 minutes on my server are done by AmazonBot, mostly. Followed by ASNs from Huawei, Azure and the like.

      • FishFace@piefed.social
        link
        fedilink
        English
        arrow-up
        1
        ·
        36 minutes ago

        AmazonBot follows robots.txt. I don’t so what Huawei and Azure ASNs have to do with it - that sounds like those requests simply come from inside a Huawei and an Azure network, respectively, but could otherwise be anything.

    • cmnybo@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      11
      ·
      8 hours ago

      The botnets usually try to login to SSH and pages like phpmyadmin & wp-admin looking for something they can infect rather than scraping every single page on a website frequently. Unless you do something to become the target of a DDoS attack or don’t secure your server, they usually aren’t much more than a source of log spam.

    • LainTrain@lemmy.dbzer0.com
      link
      fedilink
      arrow-up
      1
      arrow-down
      12
      ·
      9 hours ago

      He just repeats what he heard elsewhere, he doesn’t really understand what the words mean, just fancy text prediction