From Zero to 52K AI Citations in Three Months: A Practical AEO Framework

Ignas V.

•Apr 17, 2026

Cover for From Zero to 52K AI Citations in Three Months: A Practical AEO Framework

Most people treat Answer Engine Optimization like traditional SEO with a coat of paint. It isn’t. They’re adjacent disciplines with different variables, and the sites that win in one don’t automatically win in the other.

I spent the first three months of 2026 taking a high-authority health and wellness resource from fewer than 100 AI citations across the entire web to over 52,000 — spread across AI Overviews, ChatGPT, Perplexity, Gemini, and Copilot. No new backlinks. No content pruning exercise. No new articles. The work was entirely structural.

This article is the framework I used. It’s the same framework I’m applying across other properties now, including my own independent research site. If you’re leading organic growth at any non-trivial property in 2026, you’ll need some version of this.

The Core Insight Most People Miss

Google’s ten-blue-links model and the AI-answer-engine model optimize for different things.

Traditional SEO asks: “Is this page the best answer to this query?” The answer determines ranking.

AEO asks: “Is this page one of the three or four sources the answer engine can use to construct an authoritative response?” The answer determines citation.

These are different questions. A site can rank #1 in Google for a query and be completely invisible in AI Overviews for the same query, because Google’s ranking signals and its AI Overview source selection signals are not identical. The same is true for ChatGPT, Perplexity, and Gemini — they each have their own selection logic, biased toward their own retrieval and training patterns.

This matters because AI answer engines are capturing an increasing share of informational queries. For many verticals — especially health, finance, and YMYL content where users want a definitive answer rather than a list of sources — the AI answer is becoming the first and only interaction a user has with your content.

The Framework

I organize AEO work into five pillars. Each pillar maps to a specific way answer engines evaluate whether your content is citable. You can improve each one independently. When you improve them together, the compounding effect is substantial.

Pillar 1: Information Gain

The variable: Does this page contain unique, original, or non-obvious information not available on other pages covering the same topic?

AI answer engines reward sources that contribute net-new information. A page that summarizes what ten other pages already say provides no information gain. A page that includes original research, proprietary data, expert commentary, specific case studies, or a perspective unique to the source contributes information gain.

In the health authority engagement, I worked with the editorial team to move the most unique content to the top of every piece — the clinical perspective, the cited studies, the specific frameworks this resource was known for. Most of that content already existed. It was just buried under generic introductions. Restructuring surfaced it.

What to do: For every important page, ask: “What does this page say that no other page on the open web says as clearly or credibly?” If the answer is nothing, you have a content problem, not an AEO problem. If the answer is something, make sure that something is in the first 150 words, not the last 500.

Pillar 2: Content Extractability

The variable: Can an LLM pull a clean, self-contained 2-3 sentence passage from this page and use it as a direct citation?

LLMs don’t cite entire pages. They cite passages. A page that’s a wall of text — continuous paragraphs with context-dependent references, no clear statement boundaries, and no self-contained quotable sections — is hard to cite cleanly. Even if the information is good, the LLM has to do more work to extract it, and it may not.

Content designed for extractability has:

Direct answers within the first 100 words of any section
Clear definition blocks for key terms
FAQ sections where each answer stands alone as a 2-3 sentence passage
Self-contained statements of fact that don’t require surrounding context
Logical list and table structures where information is genuinely list-like or tabular

This is not “write for LLMs.” It’s “write for clarity.” Clarity is what makes content easy to extract. LLMs are just the current mechanism that rewards it.

Pillar 3: Structured Data Coverage

The variable: Does this page expose its key information as machine-readable structured data?

Structured data is the lowest-cost way for an LLM to extract factual information about a page. It’s pre-parsed, unambiguous, and doesn’t require interpretation. If you mark up an Article with author, datePublished, and dateModified schema, an LLM doesn’t have to guess those values from the HTML — they’re handed over directly.

The schemas that matter most for AEO:

Article on editorial content (with author, datePublished, dateModified)
FAQPage on any page with an FAQ section (critical — this is the single highest-ROI schema for LLM extraction)
Organization and WebSite on the home page (establishes entity identity)
BreadcrumbList on all nested pages (helps establish topical hierarchy)
Person schema for authors (feeds E-E-A-T signals)
HowTo for procedural content
MedicalWebPage, Product, Review where domain-appropriate

Schema validation should pass on every page. A partially valid schema is often worse than no schema, because it creates ambiguity.

Pillar 4: Semantic HTML and Crawler Rendering Cost

The variable: How expensive is it for an AI crawler to parse this page?

AI crawlers have compute budgets like search engine crawlers. Pages that are cheap to parse get crawled more completely and more frequently. Pages that require executing megabytes of JavaScript, deep DOM traversal, or complex layout calculation get crawled less completely — and sometimes get skipped entirely.

Semantic HTML is the fix. A page that uses <article>, <section>, <nav>, <header>, <footer>, <h1> through <h3>, <p>, <ol>, <ul>, and <time> is essentially pre-labeled. The crawler doesn’t have to infer structure from nested divs. The structure is declared.

The corollaries that come with this:

Server-side rendering, not client-side rendering (content should be in the initial HTML response)
Minimal DOM bloat (the fewer elements the page has, the faster it’s parsed)
No render-blocking JavaScript for content (content should be visible without JS execution)
Clean, small HTML pages (total page size should be modest — under 200KB is a good target for content-focused pages)

This is also a fundamental SEO and accessibility best practice. It just happens to be newly important in the AEO era.

Pillar 5: Entity Authority and Off-Site Signals

The variable: Does the web as a whole recognize this source as authoritative on this topic?

AI answer engines don’t just evaluate your page. They evaluate your source. They look at how other authoritative sources reference you, whether you’re cited by the press, whether you show up on platforms LLMs treat as high-signal (Wikipedia, Reddit, specific industry sites), and whether your entity identity is clear and consistent across the web.

This is the slowest-to-move pillar. You can fix schema in a week. Earning citations on Wikipedia and authoritative publications takes months to years.

But it matters. In the health authority engagement, we got to 52K citations in three months because the site was already a recognized authority. The structural work unlocked visibility that the authority signals were already supporting. A newer site doing the same structural work might get to 5,000 citations in three months, not 52,000 — because the underlying authority isn’t yet there.

How I Sequence the Work

When I come into a new engagement, I don’t try to do all five pillars at once. I sequence them:

Week 1-2: Audit and measurement. Establish baseline AI citation counts across every relevant surface (AI Overviews, ChatGPT, Perplexity, Gemini, Copilot). You can’t improve what you don’t measure.
Week 2-4: Technical foundation. Fix the things that are making every page more expensive to parse — header hierarchy, redirect chains, canonical issues, JavaScript rendering problems. This is SEO foundational work that also happens to unblock AEO.
Week 4-8: Structured data rollout. Deploy schema across the site systematically. Start with Article, FAQPage, and Organization. Validate everything.
Week 6-12: Content restructuring. Work with the editorial team to surface information gain and rebuild content for extractability. This is the slowest part because it requires editorial judgment on every piece.
Ongoing: Measurement and iteration. Track citation counts weekly. Identify which page templates are winning the most AI visibility. Double down on those patterns. Kill what isn’t working.

What I’d Do Differently

If I were starting this work today, I would invest in AEO-specific measurement infrastructure earlier. The tools in this space are immature. When I started the health authority engagement, I was building internal tracking manually — screenshotting AI responses, parsing citation patterns, testing prompts. I got to 52K citations without formal tooling, but I’d have gotten there faster with better instrumentation from day one.

I’d also start the structured data rollout before the content restructuring, not alongside it. Schema is mechanical work that can be deployed systematically. Content restructuring requires editorial judgment and is slower. Running them in parallel creates coordination overhead. Running schema first — as infrastructure — lets content restructuring proceed more cleanly.

What’s Next in AEO

A few predictions for the next 12 months, in rough order of when I expect them to matter:

Passage-level ranking inside AI answers. Rankings will no longer be at the URL level but at the passage level. Multiple passages from the same page will compete for citation independently.
Formal AEO measurement tools. Tools like Profound, Brand Radar, and others are early. The space needs a Semrush-equivalent for AEO. Someone will build it.
llms.txt standardization. Like robots.txt but for LLMs. Some publishers are already experimenting. Adoption is still uneven.
Agentic search workflows. As AI tools move from answering questions to executing tasks, the calculus for citation shifts. Being cited as a source during a purchase decision or a research workflow becomes more valuable than being cited in a one-off answer.
Voice and multimodal AEO. The same citation logic that governs ChatGPT text responses will govern voice assistants, agentic browser workflows, and vertical-specific LLMs. The underlying framework doesn’t change. The surfaces multiply.

The Honest Bottom Line

AEO is a real discipline, and most organizations are underinvested in it. But it’s not magic. It’s careful application of five variables — information gain, extractability, structured data, semantic HTML, and entity authority — sequenced correctly, measured rigorously, and executed patiently.

If you’re a senior SEO leader looking at the AI search shift and wondering where to start, start with the five pillars. Sequence them. Measure what you can. Iterate on what works.

The 52K citations aren’t a one-time result. They’re the output of a framework that keeps compounding.

Back to all posts