Quick answer: AI engines cite pages that they can crawl (citation bots allowed, content in raw HTML), that are easy to quote (direct answers, statistics, clear structure), and that they can trust (sources, dates, authors, schema). You don't need to rank #1 — you need to be the easiest page to lift an answer from. Here's each lever, with the evidence behind it.
When ChatGPT search, Perplexity or Google AI answers a question, it retrieves a set of candidate pages, extracts the passages that best answer the question, and cites the sources it used. That selection is not classic ranking. The Princeton generative engine optimization study (Aggarwal et al.) tested nine on-page interventions across 10,000 queries and measured which ones increased a source's visibility in AI answers:
The pattern: AI engines reward verifiable, quotable substance, not keyword optimization.
Each vendor runs separate crawlers for training and for search citations. Allow the citation crawlers even if you block training:
| Allow for citations | Optional to block (training only) |
|---|---|
OAI-SearchBot (ChatGPT) | GPTBot |
PerplexityBot | CCBot (Common Crawl) |
Claude-SearchBot | ClaudeBot |
Googlebot | Google-Extended |
Also check for silent blockers: bot-challenge pages ("Checking your browser…") served to crawlers, noindex tags, and client-side rendering. AI crawlers do not execute JavaScript — if your content only exists after JS runs, AI sees an empty page.
<blockquote>) were the strongest single lever in the Princeton data.datePublished/dateModified in Article schema, not just visible text.sameAs links (LinkedIn, Wikidata) lets AI ground your brand as a known entity.Every check above is deterministic — it's either true of your HTML or it isn't. A free SiteBeat scan runs ~50 of them in 20 seconds and shows your AI readiness grade, the exact text AI crawlers extract from your page, a per-engine access matrix, and which trust signals are missing. The full €29 audit adds every fix, including a corrected robots.txt and prefilled JSON-LD you can paste in.
AI readiness grade · crawler matrix · copy-paste fixes
Run the free scan →