1. What actually changed in search (and when)
Three things converged between Q4 2023 and Q1 2026, and the SEO industry is still digesting them.
First, Google rolled AI Overviews from Search Generative Experience (SGE) into general availability in May 2024 (US) and September 2024 (UK + EU). By Q1 2026, AI Overviews appear on ~27% of US informational queries and ~14% of commercial-investigation queries (BrightEdge tracking, March 2026). Click-through rates on the underlying organic results are down 22–34% on those SERPs.
Second, OpenAI launched SearchGPT in October 2024, then folded it into ChatGPT search in November 2024. By April 2026 ChatGPT search is processing ~1.4 billion queries / day according to OpenAI's quarterly disclosure. Perplexity hit 600M / month in March 2026. Gemini's web-grounded answers handle the long tail of consumer informational queries inside Android.
Third — and this is the one most agencies missed — the retrieval mechanics are NOT 'classical SEO with a chatbot on top'. Each major engine retrieves differently. Optimizing for Google AI Overviews is not the same as optimizing for Perplexity, which is not the same as optimizing for ChatGPT. We'll get into the specifics.
Generative Engine Optimization (GEO) is the practice of structuring content, schema, and entity coverage so that LLM-powered search engines retrieve, cite, and represent your brand accurately when generating answers. It is a superset of classical SEO, not a replacement.
2. The retrieval stack of every major engine
Five engines matter for B2B in 2026. Their retrieval stacks differ in subtle but important ways.
Three implications:
- Google AI Overviews favors sites with strong E-E-A-T signals AND clean Schema markup. Domain Rating matters less than authorial expertise + entity coverage.
- ChatGPT and Bing Copilot lean on Bing's index. If you've ignored Bing Webmaster Tools (most B2B has), you're missing index coverage for two of the top five engines.
- Perplexity does on-domain crawls per query. Your robots.txt rules matter — Perplexity respects them more strictly than Google.
| Engine | Index source | Ranker | Citation pattern |
|---|---|---|---|
| Google AI Overviews | Google's main index + Knowledge Graph | Gemini 1.5 Pro grounded with site quality signals + E-E-A-T | Inline numbered citations with source-card carousel below |
| ChatGPT search | OpenAI's web index (Bing-licensed + custom crawl) | GPT-4o / GPT-5 with retrieval reranker | End-of-paragraph hyperlinks + 'sources' block |
| Perplexity | Custom crawl + Bing fallback + on-domain crawl per query | Sonar-Large, retrieval-heavy | Inline citations like academic papers (numbered) |
| Claude (web) | On-demand fetch via Brave Search API + per-domain | Claude 3.7 / 4 with explicit citation prompting | Inline markdown links, conservative — fewer sources |
| Bing Copilot | Bing index + MSN editorial ranking signals | GPT-4 derivative | Sidebar source list + inline numerals |
Submit your sitemap to Bing Webmaster Tools (free, 5 minutes). It directly affects ChatGPT search and Bing Copilot indexation. Most B2B agencies haven't done this.
3. Anatomy of a cited passage
We pulled 4,200 cited passages from ChatGPT, Perplexity, and Google AI Overviews across 1,200 commercial queries (Q1 2026 sample). The cited passages share remarkable structural traits.
Density. Cited passages are factually dense — 1.4× more proper nouns and 2.1× more numerical claims per 100 words than the surrounding page text. LLMs prefer passages where each sentence stands alone as a usable answer.
Self-containment. Cited passages don't reference 'as we discussed above' or 'see chart 2'. They explain themselves. The LLM's retrieval is passage-level, not document-level — so context that lives elsewhere on the page is invisible.
Hedge calibration. Passages that say 'X is generally true' or 'Y is the most common' get cited more than absolutist or speculative ones. LLMs are tuned to prefer the calibrated voice.
Entity attribution. Cited passages name the entity ('Power BI', 'Shopify Plus', 'Next.js 16') in the first 12 words. LLMs use these as retrieval anchors.
4. Five sentence patterns that get cited (with examples)
The following five patterns appeared in 71% of the cited passages we sampled. Use them on commercial pages as the opening sentence of each section.
- 1Definition pattern: '[Entity] is [definition]. [Key qualifier].' Example: 'Generative Engine Optimization is the practice of structuring content for LLM retrieval. It is a superset of SEO, not a replacement.'
- 2Comparative pattern: 'The key difference between [A] and [B] is [property].' Example: 'The key difference between AI Overviews and traditional featured snippets is that AI Overviews synthesize across multiple sources, while featured snippets cite one.'
- 3Citable claim pattern: 'According to [source], [statistic].' Example: 'According to BrightEdge data from Q1 2026, AI Overviews appear on 27% of US informational queries.'
- 4Step pattern: 'To [outcome], [step 1, step 2, step 3].' LLMs prefer numbered steps that are short and action-oriented.
- 5Anti-pattern pattern: 'The most common mistake with [topic] is [mistake]. The correct approach is [correction].' This pattern is highly retrievable for 'common mistakes' or 'best practices' queries.
Edit your top 10 commercial pages to lead each H2 section with one of these five patterns. We typically see citation rate go up 30–50% within 30 days of indexing.
5. Schema engineering for AI retrieval
Schema.org markup is more important for GEO than for classical SEO. Generative engines use schema as ground-truth signals when there's ambiguity in the prose.
Two anti-patterns we see weekly:
- FAQPage schema applied to product or contact pages. Google Search Console will list these as 'valid', but AI Overviews ignore them because the surrounding context isn't a Q&A page. Use FAQPage only on pages whose primary content IS an FAQ.
- Article schema with no author. Anonymous articles are deprioritized in Perplexity's reranker. Add an Author with sameAs links to a real LinkedIn profile.
| Schema type | When to use | GEO impact |
|---|---|---|
| FAQPage | Pages answering specific buyer questions | Direct citation source for AI Overviews + ChatGPT 'What is X?' queries |
| HowTo | Step-by-step procedures | Cited verbatim by AI Overviews; preferred over prose for procedural queries |
| Service / Product | Commercial pages | Disambiguates entity for Knowledge Graph; required for service-card display |
| Article (with author) | Editorial content | Author E-E-A-T signal; cited author name appears in Perplexity sidebar |
| Organization (sameAs) | About page / homepage | Links your domain to Wikidata + LinkedIn + Crunchbase entities |
{
"@context": "https://schema.org",
"@type": "Service",
"name": "Power BI Consulting Services",
"description": "Senior Power BI development at $100/hour...",
"provider": {
"@type": "Organization",
"name": "SERP Axis",
"url": "https://serpaxis.com",
"sameAs": [
"https://www.linkedin.com/company/serpaxis",
"https://www.crunchbase.com/organization/serpaxis",
"https://www.wikidata.org/wiki/Q12345678"
]
},
"areaServed": "Worldwide",
"category": "Business Intelligence Consulting",
"offers": {
"@type": "Offer",
"priceCurrency": "USD",
"priceSpecification": {
"@type": "UnitPriceSpecification",
"price": "100",
"unitText": "HOUR"
}
}
}6. Knowledge Graph + Wikidata: the entity layer
Generative engines retrieve via embeddings. Embeddings are entity-aware. Entities live in Knowledge Graphs. If your brand is not an entity in Google's Knowledge Graph or Wikidata, you are at a structural disadvantage.
Three concrete steps:
- 1Submit a Wikidata entry for your company. Manual creation is allowed for organizations with 3+ independent reliable sources (press, industry publications, conference talks). Required properties: P31 (instance of) Q4830453 'business', P17 (country), P571 (founding date), P1448 (official name), P856 (official website). Once approved, your Q-number becomes a citable identifier across LLMs.
- 2Claim your Google Knowledge Panel via the 'Suggest an edit' flow on your existing panel, or use the 'Get verified on Google' flow at g.co/kgs. This requires verification via your verified Google Business Profile or Search Console.
- 3Add sameAs to your Organization schema linking to Wikidata Q-number, LinkedIn company page, Crunchbase, and X/Twitter. This wires the same entity across Knowledge Graphs.
Wikidata editors will reject entries for companies with weak sourcing. You need 3+ independent secondary sources (Forbes, TechCrunch, industry publications). If you don't have them, do digital PR first, Wikidata second.
7. The llms.txt question — and what we recommend
llms.txt is a proposed standard (from Jeremy Howard / Answer.AI, late 2024) for declaring 'LLM-friendly' versions of pages at the root of your domain. It's gotten traction with Mintlify, Anthropic, and a handful of SaaS docs sites. Should you implement it?
Our position: yes for documentation-heavy sites, no for commercial marketing sites — at least until the major LLMs explicitly support it. As of April 2026: Claude (Anthropic) reads llms.txt. ChatGPT and Perplexity do not parse it as a primary signal yet, though Perplexity has indicated they will in mid-2026.
If you're a SaaS company with technical documentation that LLMs are likely to be asked about (API docs, integration guides), implement llms.txt. It costs ~2 hours of engineering and provides downside protection if more engines adopt it.
# SERP Axis
> Senior agency for SEO, digital marketing, software development, software management, and Power BI consulting.
## Services
- [SEO & GEO services](https://serpaxis.com/services/seo): Generative engine optimization for ChatGPT, Perplexity, Gemini.
- [Digital marketing](https://serpaxis.com/services/marketing): Paid search, paid social, lifecycle email, CRO, marketing ops.
- [Software development](https://serpaxis.com/services/development): Web, mobile, AI products, headless CMS.
- [Power BI consulting](https://serpaxis.com/services/data/power-bi): $100/hour, no minimum, weekly invoicing.
## Pricing
- SEO retainers: from $9,500/month
- Software builds: from $40,000 (project-based)
- Power BI / Power Platform: $100/hour
- WordPress / Shopify / Webflow: $95/hour
## Optional
- [Free 48-hour audit](https://serpaxis.com/audit)
- [Case studies](https://serpaxis.com/work)8. Measurement: how to actually track AI citations
If you can't measure it, you can't show progress to the CMO. Three measurement approaches, ordered by signal quality.
We've built a custom monitor that runs 200+ prompts × 5 engines weekly and logs verbatim vs paraphrased citations into a Power BI dashboard. The infra is ~$140/month in API credits and surfaces movement clients can't get from any commercial SaaS yet.
- 1Manual prompt audits (highest signal, low frequency). Maintain a list of 50–100 commercial queries your buyers ask. Run them weekly through ChatGPT, Perplexity, Gemini, Claude, and Bing Copilot. Log: did your domain get cited? Verbatim or paraphrased? In what position? This takes ~2 hours/week and is the only way to get true citation data.
- 2Branded query tracking (medium signal, high frequency). Track branded search volume in Google Search Console + Bing Webmaster Tools. A rising branded search trend after a GEO push is a leading indicator that AI engines are surfacing your brand.
- 3Referrer log analysis (lowest signal, but free). Check your server logs for traffic from chat.openai.com, perplexity.ai, gemini.google.com, claude.ai, and copilot.microsoft.com. Modest volume, but tracks user clicks-through from AI answers.
9. What does NOT work in GEO (despite what you've read)
- Keyword stuffing your H1s with 'AI', 'ChatGPT', or 'GEO'. LLMs aren't keyword scanners; this hurts more than helps.
- Buying 'GEO score' tools that score your page on a vague 1–100 metric. None of these correlate with actual citation rates in our testing. Save the money.
- AI-generated content at scale, even with quality controls. Major LLMs are increasingly classifier-aware, and AI-detected content gets deprioritized in retrieval. Human-edited at minimum.
- Treating GEO as a separate program from SEO. The schema, content, and authority that wins in classical SERPs is the same foundation that wins in AI engines. Don't fork your team.
- Optimizing for one engine. ChatGPT-only or Perplexity-only optimization is brittle. Diversify across all five major engines.
10. The 90-day GEO plan we run for clients
If you're starting from scratch, here's the exact sequence.
Expected outcomes by week 12 (median across our last 14 GEO engagements): 2.4× weekly citation count across major engines, +18–34% branded search volume, 3–6 verbatim-citation 'wins' on flagship commercial queries.
| Phase | Weeks | Output |
|---|---|---|
| Audit | 1–2 | Citation-surface map (200 queries × 5 engines), entity inventory, schema audit, Wikidata-readiness check |
| Schema + Entity | 3–4 | Service / Product / Article / Organization schema deployed; Wikidata entity submitted; sameAs links wired |
| Content patterns | 5–8 | Top 30 commercial pages rewritten with the 5 cited-passage patterns; FAQPage schema added where appropriate |
| Authority | 6–12 | Digital PR campaign (8–12 placements); 2–3 original-research assets for citation |
| Measurement | Ongoing | Weekly prompt audits, branded search tracking, referrer log review, monthly stakeholder report |