I’m not finding much in the way of user agent strings for the new “AI” browsers (Perplexity Comet, GPT Atlas, Opera Neon, etc). If there’s any I’m missing, feel free to add to my list. I’m trying avoid downloading and (shudder) installing any of these just to figure out the UA string they use.

Essentially, I want to deny these browsers use of a few applications I develop (yes, this includes Tesseract).

However, I want to avoid any false positives, so I’m trying to be as specific as I can in the filter.

For now, I’m not bothering with Edge/Chrome despite them now advertising themselves as “AI Browsers”; I’m just denying the ones made directly by the AI companies.

  • Rayquetzalcoatl@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    26 days ago

    I can say that I’ve used a plugin for WordPress called Dark Visitors which seems to track and report on bot crawlers, including LLMs. Not sure how accurate their info is, but our marketing dept seems to believe it so 🤷‍♂️

    Maybe looking into what Dark Visitors does would give you pointers on how you might detect them.