1. What actually changed in search (and when)

Three things converged between Q4 2023 and Q1 2026, and the SEO industry is still digesting them.

First, Google rolled AI Overviews from Search Generative Experience (SGE) into general availability in May 2024 (US) and September 2024 (UK + EU). AI Overviews now appear on a large and growing share of US queries — well over a quarter of informational searches by 2026, and a meaningful slice of commercial-investigation queries too. Click-through rates on the underlying organic results decline measurably on those SERPs.

Second, OpenAI launched SearchGPT in October 2024, then folded it into ChatGPT search in November 2024. By April 2026 ChatGPT search is processing ~1.4 billion queries / day according to OpenAI's quarterly disclosure. Perplexity hit 600M / month in March 2026. Gemini's web-grounded answers handle the long tail of consumer informational queries inside Android.

Third — and this is the one most agencies missed — the retrieval mechanics are NOT 'classical SEO with a chatbot on top'. Each major engine retrieves differently. Optimizing for Google AI Overviews is not the same as optimizing for Perplexity, which is not the same as optimizing for ChatGPT. We'll get into the specifics.

Working definition

Generative Engine Optimization (GEO) is the practice of structuring content, schema, and entity coverage so that LLM-powered search engines retrieve, cite, and represent your brand accurately when generating answers. It is a superset of classical SEO, not a replacement.

2. The retrieval stack of every major engine

Five engines matter for B2B in 2026. Their retrieval stacks differ in subtle but important ways.

Three implications:

Google AI Overviews favors sites with strong E-E-A-T signals AND clean Schema markup. Domain Rating matters less than authorial expertise + entity coverage.
ChatGPT and Bing Copilot lean on Bing's index. If you've ignored Bing Webmaster Tools (most B2B has), you're missing index coverage for two of the top five engines.
Perplexity does on-domain crawls per query. Your robots.txt rules matter — Perplexity respects them more strictly than Google.

Engine	Index source	Ranker	Citation pattern
Google AI Overviews	Google's main index + Knowledge Graph	Gemini 1.5 Pro grounded with site quality signals + E-E-A-T	Inline numbered citations with source-card carousel below
ChatGPT search	OpenAI's web index (Bing-licensed + custom crawl)	GPT-4o / GPT-5 with retrieval reranker	End-of-paragraph hyperlinks + 'sources' block
Perplexity	Custom crawl + Bing fallback + on-domain crawl per query	Sonar-Large, retrieval-heavy	Inline citations like academic papers (numbered)
Claude (web)	On-demand fetch via Brave Search API + per-domain	Claude 3.7 / 4 with explicit citation prompting	Inline markdown links, conservative — fewer sources
Bing Copilot	Bing index + MSN editorial ranking signals	GPT-4 derivative	Sidebar source list + inline numerals

Bing Webmaster Tools

Submit your sitemap to Bing Webmaster Tools (free, 5 minutes). It directly affects ChatGPT search and Bing Copilot indexation. Most B2B agencies haven't done this.

3. Anatomy of a cited passage

Look closely at the passages ChatGPT, Perplexity, and Google AI Overviews actually quote, and they share a set of recurring structural traits. These are well-established observations about how generative engines select what to cite.

Density. Cited passages tend to be factually dense — more proper nouns and numerical claims per 100 words than the surrounding page text. LLMs prefer passages where each sentence stands alone as a usable answer.

Self-containment. Cited passages don't reference 'as we discussed above' or 'see chart 2'. They explain themselves. The LLM's retrieval is passage-level, not document-level — so context that lives elsewhere on the page is invisible.

Hedge calibration. Passages that say 'X is generally true' or 'Y is the most common' get cited more than absolutist or speculative ones. LLMs are tuned to prefer the calibrated voice.

Entity attribution. Cited passages name the entity ('Power BI', 'Shopify Plus', 'Next.js 16') in the first 12 words. LLMs use these as retrieval anchors.

4. Five sentence patterns that get cited (with examples)

The following five patterns show up again and again in passages that get cited. Use them on commercial pages as the opening sentence of each section.

1Definition pattern: '[Entity] is [definition]. [Key qualifier].' Example: 'Generative Engine Optimization is the practice of structuring content for LLM retrieval. It is a superset of SEO, not a replacement.'
2Comparative pattern: 'The key difference between [A] and [B] is [property].' Example: 'The key difference between AI Overviews and traditional featured snippets is that AI Overviews synthesize across multiple sources, while featured snippets cite one.'
3Citable claim pattern: 'According to [source], [statistic].' Example: 'According to industry tracking in 2026, AI Overviews appear on well over a quarter of US informational queries.'
4Step pattern: 'To [outcome], [step 1, step 2, step 3].' LLMs prefer numbered steps that are short and action-oriented.
5Anti-pattern pattern: 'The most common mistake with [topic] is [mistake]. The correct approach is [correction].' This pattern is highly retrievable for 'common mistakes' or 'best practices' queries.

Edit your top 10 commercial pages to lead each H2 section with one of these five patterns. Leading with one of these patterns tends to improve how readily engines cite the passage once it's indexed.

5. Schema engineering for AI retrieval

Schema.org markup is more important for GEO than for classical SEO. Generative engines use schema as ground-truth signals when there's ambiguity in the prose.

Two common anti-patterns to avoid:

FAQPage schema applied to product or contact pages. Google Search Console will list these as 'valid', but AI Overviews ignore them because the surrounding context isn't a Q&A page. Use FAQPage only on pages whose primary content IS an FAQ.
Article schema with no author. Anonymous articles are deprioritized in Perplexity's reranker. Add an Author with sameAs links to a real LinkedIn profile.

Schema type	When to use	GEO impact
FAQPage	Pages answering specific buyer questions	Direct citation source for AI Overviews + ChatGPT 'What is X?' queries
HowTo	Step-by-step procedures	Cited verbatim by AI Overviews; preferred over prose for procedural queries
Service / Product	Commercial pages	Disambiguates entity for Knowledge Graph; required for service-card display
Article (with author)	Editorial content	Author E-E-A-T signal; cited author name appears in Perplexity sidebar
Organization (sameAs)	About page / homepage	Links your domain to Wikidata + LinkedIn + Crunchbase entities

Example: Service schema with sameAs entity bindings (Power BI consulting page)

json

{
  "@context": "https://schema.org",
  "@type": "Service",
  "name": "Power BI Consulting Services",
  "description": "Senior Power BI development at $45/hour...",
  "provider": {
    "@type": "Organization",
    "name": "SERP Axis",
    "url": "https://serpaxis.com",
    "sameAs": [
      "https://www.linkedin.com/company/serpaxis",
      "https://www.crunchbase.com/organization/serpaxis",
      "https://www.wikidata.org/wiki/Q12345678"
    ]
  },
  "areaServed": "Worldwide",
  "category": "Business Intelligence Consulting",
  "offers": {
    "@type": "Offer",
    "priceCurrency": "USD",
    "priceSpecification": {
      "@type": "UnitPriceSpecification",
      "price": "45",
      "unitText": "HOUR"
    }
  }
}

6. Knowledge Graph + Wikidata: the entity layer

Generative engines retrieve via embeddings. Embeddings are entity-aware. Entities live in Knowledge Graphs. If your brand is not an entity in Google's Knowledge Graph or Wikidata, you are at a structural disadvantage.

Three concrete steps:

1Submit a Wikidata entry for your company. Manual creation is allowed for organizations with 3+ independent reliable sources (press, industry publications, conference talks). Required properties: P31 (instance of) Q4830453 'business', P17 (country), P571 (founding date), P1448 (official name), P856 (official website). Once approved, your Q-number becomes a citable identifier across LLMs.
2Claim your Google Knowledge Panel via the 'Suggest an edit' flow on your existing panel, or use the 'Get verified on Google' flow at g.co/kgs. This requires verification via your verified Google Business Profile or Search Console.
3Add sameAs to your Organization schema linking to Wikidata Q-number, LinkedIn company page, Crunchbase, and X/Twitter. This wires the same entity across Knowledge Graphs.

Wikidata is curated

Wikidata editors will reject entries for companies with weak sourcing. You need 3+ independent secondary sources (Forbes, TechCrunch, industry publications). If you don't have them, do digital PR first, Wikidata second.

7. The llms.txt question — and what we recommend

llms.txt is a proposed standard (from Jeremy Howard / Answer.AI, late 2024) for declaring 'LLM-friendly' versions of pages at the root of your domain. It's gotten traction with Mintlify, Anthropic, and a handful of SaaS docs sites. Should you implement it?

Our position: yes for documentation-heavy sites, no for commercial marketing sites — at least until the major LLMs explicitly support it. As of April 2026: Claude (Anthropic) reads llms.txt. ChatGPT and Perplexity do not parse it as a primary signal yet, though Perplexity has indicated they will in mid-2026.

If you're a SaaS company with technical documentation that LLMs are likely to be asked about (API docs, integration guides), implement llms.txt. It costs ~2 hours of engineering and provides downside protection if more engines adopt it.

Example llms.txt at the root of your domain

text

# SERP Axis
> Senior agency for SEO, digital marketing, software development, software management, and Power BI consulting.

## Services
- [SEO & GEO services](https://serpaxis.com/services/seo): Generative engine optimization for ChatGPT, Perplexity, Gemini.
- [Digital marketing](https://serpaxis.com/services/marketing): Paid search, paid social, lifecycle email, CRO, marketing ops.
- [Software development](https://serpaxis.com/services/development): Web, mobile, AI products, headless CMS.
- [Power BI consulting](https://serpaxis.com/services/data/power-bi): $45/hour, no minimum, weekly invoicing.

## Pricing
- SEO retainers: from $499/month
- Software builds: from $3,500 (project-based)
- Power BI: $45/hour · Power Platform: $49/hour
- WordPress / Shopify / Webflow: from $45/hour

## Optional
- [Free 48-hour audit](https://serpaxis.com/audit)
- [How we work](https://serpaxis.com/work)

8. Measurement: how to actually track AI citations

If you can't measure it, you can't show progress to the CMO. Three measurement approaches, ordered by signal quality.

If you want to automate this, a custom monitor that runs a couple hundred prompts across all five engines weekly — logging verbatim vs paraphrased citations into a dashboard such as Power BI — is straightforward to build for roughly $100–150/month in API credits, and surfaces movement that off-the-shelf SEO SaaS doesn't yet track.

1Manual prompt audits (highest signal, low frequency). Maintain a list of 50–100 commercial queries your buyers ask. Run them weekly through ChatGPT, Perplexity, Gemini, Claude, and Bing Copilot. Log: did your domain get cited? Verbatim or paraphrased? In what position? This takes ~2 hours/week and is the only way to get true citation data.
2Branded query tracking (medium signal, high frequency). Track branded search volume in Google Search Console + Bing Webmaster Tools. A rising branded search trend after a GEO push is a leading indicator that AI engines are surfacing your brand.
3Referrer log analysis (lowest signal, but free). Check your server logs for traffic from chat.openai.com, perplexity.ai, gemini.google.com, claude.ai, and copilot.microsoft.com. Modest volume, but tracks user clicks-through from AI answers.

9. What does NOT work in GEO (despite what you've read)

Keyword stuffing your H1s with 'AI', 'ChatGPT', or 'GEO'. LLMs aren't keyword scanners; this hurts more than helps.
Buying 'GEO score' tools that score your page on a vague 1–100 metric. There's no evidence these scores correlate with actual citation rates. Save the money.
AI-generated content at scale, even with quality controls. Major LLMs are increasingly classifier-aware, and AI-detected content gets deprioritized in retrieval. Human-edited at minimum.
Treating GEO as a separate program from SEO. The schema, content, and authority that wins in classical SERPs is the same foundation that wins in AI engines. Don't fork your team.
Optimizing for one engine. ChatGPT-only or Perplexity-only optimization is brittle. Diversify across all five major engines.

10. The 90-day GEO plan we run for clients

If you're starting from scratch, here's the exact sequence.

What we instrument from week one: weekly citation count across major engines, branded-search volume, and verbatim-citation wins on flagship commercial queries — so you can see movement as content gets retrieved, rather than guessing.

Phase	Weeks	Output
Audit	1–2	Citation-surface map (200 queries × 5 engines), entity inventory, schema audit, Wikidata-readiness check
Schema + Entity	3–4	Service / Product / Article / Organization schema deployed; Wikidata entity submitted; sameAs links wired
Content patterns	5–8	Top 30 commercial pages rewritten with the 5 cited-passage patterns; FAQPage schema added where appropriate
Authority	6–12	Digital PR campaign (8–12 placements); 2–3 original-research assets for citation
Measurement	Ongoing	Weekly prompt audits, branded search tracking, referrer log review, monthly stakeholder report

Generative Engine Optimization (GEO) in 2026: the complete playbook

1. What actually changed in search (and when)

2. The retrieval stack of every major engine

3. Anatomy of a cited passage

4. Five sentence patterns that get cited (with examples)

5. Schema engineering for AI retrieval

6. Knowledge Graph + Wikidata: the entity layer

7. The llms.txt question — and what we recommend

8. Measurement: how to actually track AI citations

9. What does NOT work in GEO (despite what you've read)

10. The 90-day GEO plan we run for clients

Related deep-dives

Programmatic SEO without spam: the 12-step QA gate that survives Helpful Content Updates

Core Web Vitals in 2026: the INP edition (and what actually moves rankings)

11 agency anti-patterns we refuse to participate in

The cost of waiting
is your competitor.

Generative Engine Optimization (GEO) in 2026: the complete playbook

1. What actually changed in search (and when)

2. The retrieval stack of every major engine

3. Anatomy of a cited passage

4. Five sentence patterns that get cited (with examples)

5. Schema engineering for AI retrieval

6. Knowledge Graph + Wikidata: the entity layer

7. The llms.txt question — and what we recommend

8. Measurement: how to actually track AI citations

9. What does NOT work in GEO (despite what you've read)

10. The 90-day GEO plan we run for clients

Related deep-dives

Programmatic SEO without spam: the 12-step QA gate that survives Helpful Content Updates

Core Web Vitals in 2026: the INP edition (and what actually moves rankings)

11 agency anti-patterns we refuse to participate in

The cost of waiting is your competitor.

The cost of waiting
is your competitor.