ChatGPT SEO is the structured optimization of content and entity signals so that ChatGPT — especially with web search enabled — cites the brand in its answers. Core levers: passage-level citability (200-400 token chunks, self-contained), entity resolution through Wikidata and Schema.org, Bing-index visibility (ChatGPT Search uses Bing as primary retrieval), GPTBot access in robots.txt, and llms.txt as a supplementary signal.

Which index does ChatGPT Search use?

ChatGPT Search primarily uses the Bing index, supplemented by OpenAI's own web crawls (GPTBot). Three consequences: (1) Bing indexation is mandatory — classical Google SEO is not enough, (2) Bing Webmaster Tools must be maintained, (3) GPTBot access in robots.txt determines OpenAI's own data collection for training and on-demand crawling.

What is passage-level citability?

Passage-level citability describes how well a 200-400 token chunk from a document is citable as a standalone answer. Criteria: (1) claim-evidence pairing in the first sentence, (2) no anaphoric pronouns referring to earlier paragraphs, (3) explicit entity mentions instead of implicit references, (4) numerical facts with source attribution, (5) semantic self-coherence. Cross-encoder rerankers reward this structure disproportionately.

How important is robots.txt for ChatGPT?

Two layers: (1) GPTBot (OpenAI's training crawler) can be allowed or blocked via robots.txt — blocking prevents inclusion in training, allowing strengthens presence in model weights; (2) ChatGPT-User (the on-demand crawler for live queries) is separate — blocking it removes the brand from ChatGPT answers with web search. Most B2B brands should allow both.

Does llms.txt help for ChatGPT?

llms.txt is not yet an official OpenAI standard, but is increasingly being read. The file delivers Markdown-based summaries of a domain's authoritative pages and is especially useful for complex technology products and documentation. Implementation cost is low, upside moderate — a sensible part of the GEO stack.

How long until ChatGPT cites a new brand?

For pure live retrieval (ChatGPT with web search), citations appear within days to weeks after Bing indexation and a clean passage structure. For training-based citations (ChatGPT without web search), it takes a full training window — typically 6 to 18 months after crawl. Combined visibility (web plus training) builds over 9 to 15 months.

ChatGPT SEO: How Brands Get Cited Inside ChatGPT Answers

If your brand is not cited inside ChatGPT, it does not exist in a growing share of informational search reality. By OpenAI's own numbers, ChatGPT handles more than 800 million queries per day by late 2026, a meaningful portion with web search enabled. Most brands still treat ChatGPT as a fringe channel — and SEO teams rarely have a dedicated playbook for it.

This guide closes that gap. It describes how ChatGPT technically constructs answers, which signals drive citation probability, and the eight measures that reliably lift citation rate within three to six months in advisory practice.

The retrieval mechanics: how ChatGPT builds an answer

ChatGPT answers with web search are produced in a four-step pipeline. (1) Query interpretation by the language model, generating one or more sub-queries. (2) Retrieval against the Bing index (primary source), supplemented by OpenAI's own web crawls. (3) Re-ranking of the top-N results through a cross-encoder for semantic relevance to the full query. (4) Answer synthesis with explicit or implicit citation of the highest-scoring chunks.

For SEO, this is a structural shift: document ranking no longer decides — chunk quality does. A page in position one is not necessarily cited; a structurally clean passage from position eight may well be. The optimization layer moves from the page to the paragraph.

~80%

of ChatGPT retrieval runs through the Bing index — classical Google SEO is not enough

200-400

tokens per chunk — the sweet spot for maximum citation probability

6 layers

of signals that ChatGPT combines when choosing sources — all of them must be sufficient

The six visibility layers for ChatGPT

Layer 1 — Bing indexation. Mandatory, not optional. No retrieval without it. Set up Bing Webmaster Tools, submit an XML sitemap, activate the IndexNow protocol. Many brands that are excellently indexed in Google have substantial gaps in Bing — often in the range of 15 to 40 percent of URLs.

Layer 2 — GPTBot access. Declare it explicitly in robots.txt. Three bots matter: GPTBot (training crawler), ChatGPT-User (on-demand for live queries) and OAI-SearchBot (for the ChatGPT Search index). Blocking all three isolates the brand entirely from ChatGPT. The default recommendation for B2B brands: allow all three. For premium content publishers: selectively block the training crawler while keeping the live crawlers.

Layer 3 — Passage-level citability. Chunks of 200-400 tokens with a claim-evidence structure in the first sentence. No anaphoric pronouns pointing back to earlier paragraphs. Explicit entity mentions instead of implicit references. Concrete numbers with sources instead of vague phrasing. In our benchmarks, these chunk properties correlate most strongly with cross-encoder reranker scores.

Layer 4 — Entity resolution. ChatGPT must be able to interpret the brand as an unambiguous entity — not as a name-collision candidate. That requires a Wikidata item with referenced properties, a Schema.org graph with @id coherence, a sameAs cluster across authoritative third-party profiles, and consistent attributes (role, industry, location, founding year). Without that resolution, the brand is either not cited at all or confused with a foreign entity.

Layer 5 — Schema.org JSON-LD. For ChatGPT, structured data is not a rich-result signal but a semantic substrate. Article schema with author-@id pointing to a Person schema, Product schema with brand-@id pointing to Organization, FAQPage with explicit claim-answer structure. Schema implementation provides the foundation.

Layer 6 — llms.txt and brand cohesion. Complementary: an llms.txt file with a structured Markdown summary of the most important pages. Not yet an official standard, but increasingly read. Plus consistent brand maintenance on LinkedIn, Crunchbase, GitHub and YouTube — all sources that ChatGPT pulls for context.

ChatGPT bot matrix — robots.txt strategy at a glance
Bot	Function	Recommendation	Consequence if blocked
GPTBot	OpenAI training crawler	Allow	Brand absent from future model training
ChatGPT-User	On-demand live-query crawler	Allow	No presence in ChatGPT answers with web search
OAI-SearchBot	ChatGPT Search index crawler	Allow	Invisible inside the ChatGPT Search interface
Bingbot	Microsoft index (ChatGPT retrieval source)	Mandatory allow	Complete exclusion from ChatGPT web retrieval

Mid-read · Strategy call

Is your bot configuration correct?

A 30-minute live check of your robots.txt and Bing indexation against the four critical crawlers — clear go/no-go list instead of guesswork.

Request the check →

Passage engineering: a concrete example

Weakly citable text (typical for B2B content): "Our platform offers a comprehensive solution for customer service. It combines various features that help teams support their customers better. Many companies use it to optimize their processes."

Rewritten for strong citability: "Zendesk is a customer service platform that integrates ticket management, chat, AI agents and analytics in a single interface. In 2026 it is used by roughly 170,000 companies, including Uber, Siemens and Tesco. Its market share in the customer support SaaS segment is approximately 12 percent (Gartner, Q1/2026)."

The difference is structural: the second text is self-contained (it works without context from other paragraphs), names the entity explicitly, delivers concrete numbers with sources and carries a claim-evidence pairing in its first two sentences. ChatGPT rerankers reward exactly these patterns.

The 90-day protocol for ChatGPT visibility

Month 1 — audit and foundation. Bing-indexation check across all relevant URLs, GPTBot-access audit in robots.txt, entity-maturity check (Wikidata status, schema-graph coherence) and a baseline measurement of current ChatGPT citation across 1,500 prompts (brand, category, long tail).

Month 2 — passage rewrites and schema. Walk through the top 30 URLs with the highest citation potential, refactor chunks to 200-400 tokens, introduce a claim-evidence structure, make them self-contained. In parallel, roll out a Schema.org graph with @id coherence, run the Rich Results Test and deploy llms.txt.

Month 3 — corroboration and entity reinforcement. Maintain the Wikidata item (references from independent secondary sources), secure three to five trade-media features with consistent fact fingerprints, set up author entities for writers. In parallel, run a second citation-rate measurement — the typical uplift in weeks ten to twelve is 30 to 80 percent over baseline.

What classical SEO no longer delivers

Three misconceptions that persist stubbornly in practice. First: "We rank position one in Google, so ChatGPT will cite us." Wrong — ChatGPT uses the Bing index, not Google. Second: "We have strong backlinks, that is enough for LLMs too." Backlinks are entity signals, but without structured data and passage quality they remain unused. Third: "A solid FAQ is enough." FAQs are one signal, but without schema and without passage engineering on the main content they stand alone.

ChatGPT SEO is a discipline in its own right within the GEO stack. Classical SEO remains the foundation — but it is no longer sufficient. Ignore the channel and you lose informational demand. Address it systematically and you build a channel where competitive density is still three to five years behind the classical Google SERP.

What measurement looks like: KPIs for ChatGPT SEO

Four primary metrics from LLM citation monitoring practice. (1) Citation rate: share of prompts in the tracking matrix in which the brand is cited. (2) Position in answer: position of the citation inside the ChatGPT response (top, middle, end). (3) Source-origin breakdown: which URL was cited — own domain, third party, Wikipedia. (4) Competitor share of voice: share of citations against competitors for the same prompts.

Secondary KPIs: entity-resolution rate (is the brand correctly recognised as a unique entity?), hallucination rate (are false attributes assigned?), source freshness (how current are the cited sources?). These secondary metrics explain the primary movements — when citation rate falls, the cause is usually entity-resolution breakage or content aging.

Bottom line: ChatGPT SEO is not an add-on

Treating ChatGPT visibility as a side task of the SEO team systematically underestimates the channel. The retrieval mechanics are different, the KPIs are different, the training and inference cycle is different. Done right, you build a channel that classical SEO does not cover — and where competitive density will stay three to five years lower than in the classical Google SERP.

The structural lever is not more content but better-structured content — plus a clean entity architecture plus Bing-indexation hygiene. The 90-day protocol above provides the sequence. Continuous citation monitoring makes the effect visible.

ChatGPT SEO — how brands appear systematically inside ChatGPT answers.

The retrieval mechanics: how ChatGPT builds an answer

The six visibility layers for ChatGPT

Is your bot configuration correct?

Passage engineering: a concrete example

The 90-day protocol for ChatGPT visibility

What classical SEO no longer delivers

What measurement looks like: KPIs for ChatGPT SEO

Bottom line: ChatGPT SEO is not an add-on

Murat Ulusoy

How often does ChatGPT cite your brand?

The retrieval mechanics: how ChatGPT builds an answer

The six visibility layers for ChatGPT

Is your bot configuration correct?

Passage engineering: a concrete example

The 90-day protocol for ChatGPT visibility

What classical SEO no longer delivers

What measurement looks like: KPIs for ChatGPT SEO

Bottom line: ChatGPT SEO is not an add-on

Murat Ulusoy

How often does ChatGPT cite your brand?

Related insights

Perplexity SEO: how brands appear inside Perplexity answers.

Optimize Google AI Overviews — the 2026 guide.

Claude citation optimization: visibility inside Anthropic's model.