Technology

Do generative AI search engines like Perplexity and Google Overviews respect robots.txt or steal publisher traffic?

🤖 AI reviewed 📅 Jun 1, 2026 👨‍⚕️ Expert reviewed ✍️ TryQuerra Editorial Team

Verdict

Evidence suggests that generative AI search engines like Perplexity have been reported to bypass robots.txt protocols, potentially impacting publisher traffic. While some sources claim Perplexity respects robots.txt, others report instances., but effects can vary across users and contexts.

Generative AI search engines like Perplexity have faced scrutiny for potentially bypassing robots.txt protocols, impacting publisher traffic.

Based on 8 reviewed sources including New Data Shows Just How Badly OpenAI And Perplexity Are Screwing Over Publishers, New data shows just how badly OpenAI and Perplexity are screwing over publishers, Perplexity Is a Bullshit Machine | WIRED.

Trust Score: 83%

8 sources reviewed

Updated Jun 1, 2026

Trust score breakdown ?

Source quality

88%

Source diversity

93%

Consensus strength

84%

Freshness

76%

Expert agreement

86%

Source agreement

90%

Score is an AI-weighted composite using 8 sources. Higher source agreement means fewer meaningful contradictions across reviewed sources. Learn how we calculate trust →

Full answer body

Expanded summary

Generative AI search engines like Perplexity have faced scrutiny for potentially bypassing robots.txt protocols, impacting publisher traffic. While some sources claim that Perplexity respects robots.txt directives, others report instances of it ignoring the protocol and accessing content without permission. For example, Wired confirmed that Perplexity accessed webpages regardless of robots.txt instructions. This raises concerns about the ethical implications of AI bots on publisher traffic and content monetization. The situation is complex, with conflicting reports on whether Perplexity adheres to robots.txt guidelines consistently.

Full analysis

Limitations and Caveats

The extent to which Perplexity bypasses robots.txt protocols is not definitively established.

Conclusion

Evidence highlights

Perplexity has been reported to access webpages regardless of robots.txt instructions.
Some sources claim that major AI platforms, including Perplexity, respect robots.txt directives.
There are conflicting reports on whether Perplexity consistently follows robots.txt guidelines.

Disagreements and caveats