Most marketers use AEO checkers as black boxes — enter URL, get score, fix the red items. That's a reasonable starting point, but understanding what's actually happening under the hood lets you interpret results more accurately, prioritize fixes more intelligently, and recognize what the tool can and can't tell you.
This guide covers the technical architecture of AEO checkers: what they test, how they test it, how scores are derived, and where the methodology has real limitations.
What an AEO Checker Tests: The 7 Signal Categories
A complete AEO checker evaluates seven categories of signals that influence AI citation behavior. Most tools cover one or two. The best AEO checkers cover all seven.
Category 1: llms.txt (23% weight in tryansly's model)
llms.txt is a file placed at yourdomain.com/llms.txt that provides AI systems with a structured overview of your site's most important content, brand entities, and key pages.
An AEO checker validates:
- Whether
llms.txtexists at the expected path - Whether the file follows correct syntax (Markdown format,
# Brand Nameheader,## Sectionstructure) - Whether key pages and entities are included
- Whether the content accurately represents your brand's scope
A missing or malformed llms.txt is typically the single largest score improvement available — worth more than any other single fix. For the full implementation guide, see the llms.txt complete guide.
Category 2: Schema Markup (20% weight)
Schema.org structured data tells AI retrieval systems what your content is about and what type of content it represents. An AEO checker validates:
- FAQPage schema: Presence, correct question/answer pair structure, JSON-LD vs. Microdata format
- HowTo schema: Step-by-step content with correctly structured steps
- Article schema: Author, publishedAt, headline fields
- Product schema: For e-commerce and software product pages
- Organization and BreadcrumbList: Site-level schema signals
The checker fetches page HTML, extracts JSON-LD and Microdata, and validates against the Schema.org specification. It flags invalid properties, missing required fields, and incorrectly nested structures.
Category 3: AI Crawler Access (18% weight)
Each major AI platform operates its own web crawler:
- OpenAI →
GPTBot - Anthropic →
ClaudeBot - Perplexity →
PerplexityBot - Google →
Google-Extended(for AI training)
An AEO checker fetches yourdomain.com/robots.txt and parses every User-agent / Disallow rule. It checks whether each AI bot is explicitly allowed, explicitly blocked, or affected by a blanket User-agent: * rule. Any block is flagged as critical — a blocked bot means zero citations from that platform regardless of content quality.
For the full robots.txt configuration guide, see GPTBot and AI crawlers robots.txt audit.
Category 4: Content Extractability (15% weight)
AI retrieval systems need to extract clean, readable text from your pages. An AEO checker evaluates:
- JavaScript rendering: Is content present in server-rendered HTML, or only after JS executes?
- Heading structure: Are H1/H2/H3 headings present and descriptive?
- Content density: Is the content-to-code ratio sufficient for reliable extraction?
- Reading accessibility: Is the content in plain paragraphs, or locked behind accordions, tabs, or modals?
This is typically evaluated using a headless HTTP fetch (no JS execution) followed by text extraction and structural analysis.
Category 5: Entity Authority (12% weight)
AI models are entity-aware — they know about brands, products, people, and organizations as structured entities, not just strings of text. An AEO checker evaluates signals that establish your brand as a recognized entity:
- Whether your brand has a Knowledge Panel in Google Search
- Whether your brand appears in structured databases (Wikidata, Wikipedia)
- Whether entity-defining content (clear brand/product/category definitions) exists on your site
- Whether
OrganizationandProductschema correctly defines your brand entity
For a deep dive on entity authority signals, see entity authority and knowledge graphs for AI search.
Category 6: Citation Probe Performance (8% weight)
This is where static analysis ends and live testing begins. Citation probes submit real queries to AI platforms and record whether your brand URL appears as a cited source.
tryansly.com runs 31 structured probes across Perplexity, Claude, and ChatGPT — covering category discovery queries, comparison queries, problem-solution queries, and buying-intent queries relevant to your domain. For each probe:
- Perplexity response analyzed for your domain in citations
- Claude response analyzed for your domain in citations
- ChatGPT Browse response analyzed for your domain in citations
The result is a citation rate per platform, a total citation count, and identification of which competitor domains appear on probes where you don't.
Category 7: Technical Performance (4% weight)
Core Web Vitals — Largest Contentful Paint (LCP), Cumulative Layout Shift (CLS), Interaction to Next Paint (INP) — and overall PageSpeed score affect crawl depth and retrieval reliability. Slow pages are less thoroughly crawled; high CLS can interfere with text extraction.
This is typically evaluated via PageSpeed Insights API, which provides real-world field data.
Static Analysis vs Live Citation Probes: The Critical Distinction
The most important distinction in AEO checker methodology is between static analysis and live citation probes:
Static analysis is like a pre-flight checklist. It tests whether the conditions that should enable citations are in place — bots are allowed, schema is valid, content is extractable. A clean static analysis means citations are possible.
Live citation probes are the actual test flight. They submit real queries to real AI platforms and observe whether citations actually occur. A strong probe performance means citations are happening now, regardless of what the static analysis shows.
A tool with only static analysis can give you false confidence: "everything looks good" — but you might still have zero actual citations. A tool with only probe testing can give you results without diagnostic guidance: "you have 5% citation rate" — but no indication of why or what to fix.
The best AEO checkers — like tryansly.com — run both and correlate them. When static scores are high but citation rates are low, the issue is likely content quality or third-party corroboration gaps. When static scores are low, the technical fixes are the priority.
How Scoring Works
AEO scores are weighted aggregates of category scores, which are themselves averages of individual check results. In tryansly's model:
| Category | Weight | Example checks |
|---|---|---|
| llms.txt | 23% | File exists, valid syntax, entity completeness |
| Schema markup | 20% | FAQPage on key pages, valid JSON-LD, required fields |
| AI crawler access | 18% | GPTBot, PerplexityBot, ClaudeBot all allowed |
| Content extractability | 15% | Server-rendered content, heading structure, content density |
| Entity authority | 12% | Knowledge Panel, Wikipedia, Organization schema |
| Citation probes | 8% | Citation rate across 31 live probes |
| Technical performance | 4% | LCP, CLS, PageSpeed score |
Each individual check returns pass/fail or a graduated score. Category scores are averaged from their checks. The overall score is the weighted average of category scores.
The score is designed to prioritize: a brand with a broken llms.txt and blocked bots should see those issues dominate the score, because fixing them produces the largest citation rate improvement.
What AEO Checkers Can't Measure
No AEO checker can tell you:
- Whether your brand appears in AI model training data (base knowledge, not retrieval)
- How AI models describe your brand in generated text (sentiment and framing)
- Your citation rate on queries outside the probe set
- Your competitors' exact AEO scores (only your own relative performance)
- Future citation behavior after algorithm updates
These limitations don't reduce the tool's value — they define where to use it. For trained knowledge representation, GEO strategies are more relevant. For current retrieval citation performance, AEO checker results are the signal to act on.
For a comparison of the available AEO checking tools, see the best AEO tools 2026 guide. To run a full audit right now, visit tryansly.com — 47 checks across 7 categories plus 31 live citation probes, no login required.