Quick Answer: Vector 1 is the diagnostic stage of AI search optimization. The audit tests how ChatGPT, Perplexity, Gemini, and Google AI Overviews describe your brand across 30+ queries, identifies hallucinations, and sets a baseline. Only 38% of AI Overview citations come from top-10 pages, so rank does not predict AI visibility.
In This Cornerstone
Reading time: 13 minutes.
This Is Vector 1: Diagnose
Every Formative Digital engagement begins here. The 12 Vectors are an engineering sequence, not a menu, and Diagnose is the work that has to land before Anchor, Resonate, Embed, or anything else makes sense. You cannot move metrics you have not measured. The article you are reading is the cornerstone reference for this stage of the framework.
Why Diagnostic Comes Before Optimization
The AI search market has produced a strange new pattern. Agencies sell "GEO services" as a single line item, propose a six-month engagement, and start writing content in week one without ever measuring what the engines currently say about the brand they are paid to optimize. The agency is in motion. The diagnostic is skipped. Six months later the client cannot tell whether anything changed because there was no baseline to change from.
That is the pattern Vector 1 exists to refuse. The diagnostic is not a sales artifact, and it is not a checklist tacked onto a proposal. It is a forensic measurement of present state across the four AI surfaces that now mediate buying decisions: ChatGPT, Perplexity, Google AI Overviews, and Gemini. Until that measurement is on the page, optimization is faith.
Two facts shape the urgency. First, AI Overviews now trigger on roughly half of all Google queries, with healthcare and education running far higher and entertainment running lower. Second, and more important for diagnostic strategy, only 38% of AI Overview citations come from pages that rank in the organic top 10, down from 76% as recently as 2024 according to industry tracking by ALM Corp and others. A brand can rank third in classic Google and be invisible to AI search. The two surfaces have decoupled.
The 2026 Numbers Vector 1 Has To Read
AI Overviews appear on approximately 48% of Google queries as of March 2026 (Search Engine Land tracking). 88% of AI Overviews cite three or more sources; 1% cite a single source. AI Overviews under 600 characters typically cite five sources; AI Overviews over 6,600 characters cite around 28. Cited brands earn 35% more organic clicks and 91% more paid clicks than equivalent uncited brands. Source diversity, not rank dominance, is the new visibility currency.
Read those figures together and the diagnostic becomes mandatory. If half of Google's traffic is being mediated by an answer engine that draws sources outside the top 10, every brand has visibility it does not see and invisibility it does not see either. Vector 1 is the work that pulls both into the open.
Stage 1: Build the 30-Prompt Test Set
The audit begins with a representative prompt set. Thirty prompts is the working floor. Below thirty, the sample is too small to separate signal from variance. Above one hundred, the data is richer but the labour cost climbs faster than the marginal insight.
The prompts split into three categories of roughly equal weight:
The Three Prompt Categories
- Brand-direct (10 prompts): "What is [brand]?", "Tell me about [brand]", "Is [brand] reputable?", "What do customers say about [brand]?", "Where is [brand] located?". These reveal the engine's existing entity model and surface hallucinated facts about the business.
- Category-level (10 prompts): "Best [category] in [region]", "Top [category] for [use case]", "[Category] near me", "How do I choose a [category]?". These reveal whether the brand competes for non-branded discovery and which competitors the engines surface in its place.
- Scenario (10 prompts): The actual buying question a real prospect asks. "I have a king bed and a partner with back pain, what mattress should I buy?" The intent that lives upstream of "best mattresses Brantford" but where the actual decision is made.
The category prompts are the ones most agencies skip and the ones that produce the most uncomfortable reading for the brand. Brand-direct queries usually return something. Scenario queries reveal whether the brand is competing where the buying decision happens.
Localize where local intent applies. A Brantford mattress retailer needs Brantford, Hamilton, Cambridge, and Kitchener-Waterloo variants of the same scenario prompt because AI Overviews and AI assistants resolve geography differently than classic Google does. A national B2B SaaS does not need that variance, but it does need persona-shifted prompts ("I run a 20-person sales team," "I am an enterprise procurement officer").
Stage 2: Run Across the Four Engines
The four AI surfaces that matter for almost every Vector 1 audit, in order of audit priority:
The Four Surfaces and What Each Reveals
- Google AI Overviews. The largest surface by reach because half of all Google traffic now passes through it. Source attribution is on the result page itself. Run prompts logged out, in incognito, with location set to the relevant city. The cited links are the surface most directly answerable to schema, structured data, and crawl signals.
- Perplexity. The most transparent of the four. Every response carries a sidebar of cited sources, which makes the audit fast and the remediation map clear. Use Focus: All. Perplexity is also the closest stand-in for the "what would a research-mode AI agent surface?" question that increasingly drives B2B procurement.
- ChatGPT. The most popular consumer surface and the hardest to audit because the citation behaviour varies by mode. With the search tool active, ChatGPT cites; without it, the engine answers from training data and may produce confident hallucinations. Run both modes. The hallucination findings are usually the most urgent items in the report.
- Gemini. Closer to AI Overviews in surface design and citation pattern. Worth running because Google's two surfaces draw on overlapping but not identical source sets, and Gemini sometimes catches a citation that AI Overviews misses (or the reverse).
For each engine, log the prompt, the answer text, every source cited, the position your brand occupies if mentioned, every competitor mentioned, and any factual claim the engine makes about the brand that would need verification. A spreadsheet is sufficient. The labour is the labour. There is no shortcut to running 30 prompts across four engines three times each, which is 360 individual reads, before the dataset is interpretable.
This is also the stage at which the diagnostic refuses the standard agency shortcut of "one screenshot of ChatGPT mentioning the brand, posted to LinkedIn as proof." A single sample tells you almost nothing. The whole point of Stage 2 is the dataset, not the highlight reel.
Stage 3: Score Mention, Citation, Hallucination
Once the dataset is collected, Vector 1 separates three distinct outcomes that almost every shorter audit collapses into one. The distinction matters because each one routes to a different remediation path in the next 11 vectors.
Three Audit Outcomes, Three Remediations
- Mention means the engine names the brand in the generated answer. This feeds awareness. Tracked as Mention Rate (% of prompts where the brand appears) and Position (where in a list the brand surfaces).
- Citation means the engine links to a specific page on the brand's domain as a source for the answer. This feeds referral traffic. Tracked as Citation Rate (% of prompts where the brand is sourced) and which pages get cited.
- Hallucination means the engine confidently states something about the brand that is incorrect: a feature it does not offer, a service area it does not cover, an integration that does not exist, a price that is wrong, an owner who is not the owner. This is the most urgent finding because the engine is teaching customers wrong facts at scale.
The Aggarwal et al. (2023) GEO paper out of Princeton and Cornell formalized impression metrics for generative engines, including Word Count and Subjective Impression measures that capture how prominently a citation appears inside the answer rather than whether it merely exists. A citation buried as the eighth source has different downstream value than a citation pulled into the visible answer text. A serious audit captures this gradation, not just the binary cite-or-not.
Hallucinations are the asymmetric finding. One bad fact repeated across thousands of prompt instances per month does measurable revenue damage, and unlike rank, hallucinations do not gradually decay; they compound until the underlying entity record is corrected. Vector 2 (Anchor) is where most hallucination remediation happens, but Vector 1 is where the targets are surfaced.
Stage 4: Account for Non-Determinism
The single technical fact that separates a credible Vector 1 audit from a marketing-deck audit is non-determinism. Generative engines do not return the same answer twice. A prompt run at 09:00 may surface your brand and cite three Tier-1 sources; the same prompt run at 09:05 may surface a competitor instead. Aggarwal and colleagues documented this variance directly: even with identical prompts, identical models, and identical session state, output diverges. Industry observation suggests the chance of identical responses on consecutive runs of the same prompt is below one in one hundred.
This is not a bug to engineer around. It is a measurement reality that has to shape audit design. Three implications follow:
Matt Griffin, Formative Digital: "Most agency audits show clients one screenshot. We show them a distribution. ChatGPT does not give the same answer twice, and any diagnostic that pretends it does is selling certainty the engine cannot produce. Run every prompt at least three times. Report the rate, not the snapshot. The variance is the data."
First, every prompt runs a minimum of three times across separate sessions. A prompt that surfaces the brand once and not the next two times has a 33% mention rate, not "yes, mentioned." Second, dates of measurement matter as much as the measurements themselves; the audit is a snapshot of model behaviour on a specific day and must be redated to be defensible. Third, the metric the audit reports is a probability distribution, not a binary. Saying "the brand appears on 70% of category-prompt runs in March 2026" is honest. Saying "the brand appears on Perplexity" is meaningless.
This is also why a Vector 1 audit cannot be replaced by a one-off tool screenshot or a single ChatGPT session. The dataset has to be statistical or it is anecdotal.
Stage 5: Translate Findings Into a Vector Map
The deliverable that makes Vector 1 useful instead of merely interesting is the Vector Map. Each finding from Stages 3 and 4 routes to one of the next 11 vectors, with a priority and a sequencing dependency. The map is what turns the diagnostic into a work plan.
How Findings Route Into the 12 Vectors
- Hallucinations about the brand → Vector 2 (Anchor). Entity validation, NAP consistency, Wikidata eligibility, schema correction.
- Brand absent on category prompts → Vector 3 (Resonate) + Vector 9 (Cluster). Map the actual prompts prospects use; build topical depth around them.
- Brand mentioned but not cited → Vector 4 (Embed) + Vector 6 (Structure). Quick Answer blocks, semantic HTML, schema graph repair so the engines know which page to point at.
- Sources cited are competitors and Tier-3 directories → Vector 5 (Cite) + Vector 7 (Distribute). Earn citations on the corpus the engines actually train on.
- Information returned is stale → Vector 8 (Refresh). Date-stamped updates so re-indexed crawls register freshness.
- Local prompts surface competitors → Vector 10 (Localize). Local schema, GBP, city-specific real-research pages.
- No way to track changes → Vector 11 (Measure) + Vector 12 (Iterate). Standing dashboards, monthly re-audits, feedback loop into next quarter's work.
The map is what most audit reports lack and what makes Vector 1 the operational starting point of the methodology rather than a standalone deliverable. A finding without a remediation route is just an observation. A finding with a vector tag is a unit of work.
Real Vector 1 Anchor
Mattress Miracle, the Brantford retailer Formative Digital has worked with since the methodology was named, baseline-tested at almost zero AI surface presence in its initial Vector 1 audit. The same diagnostic re-run after the full 12-vector engagement registers among the most-cited Ontario retail names on Perplexity and AI Overviews for category prompts. The full diagnostic-to-measurement story is the subject of Vector 11: Measure; the point here is that the present-state baseline has to exist before the gain is legible. Results depend on industry, competition, and existing digital presence.
If your domain has never been formally diagnosed across the four AI surfaces, that is the work that has to land first. A no-charge Vector 1 baseline takes about a week and produces the map the next 11 vectors run against.
What Comes After Vector 1
The 12 Vectors are sequential by design. Each one inherits work from the prior vector and produces work for the next. Vector 1 hands off to Vector 2: Anchor, which is where the hallucination findings get fixed at the entity layer through Knowledge Graph submissions, schema correction, and NAP consistency work. Vectors 3 through 6 then build the resonance, embedding, citation, and structural foundation that turns a corrected entity into a discovered one. Vectors 7 through 10 do the distribution, freshness, clustering, and local work. Vectors 11 and 12 close the loop with measurement and iteration.
The reason the order is fixed is that each vector consumes the output of the prior. You cannot anchor an entity you have not diagnosed. You cannot embed answers for prompts you have not mapped. You cannot measure improvement against a baseline you have not captured. The methodology is a system, and Vector 1 is the system's intake stage.
For Brantford and Ontario Service Businesses
Local intent is now the surface where AI search produces the most measurable revenue swings. A Brantford foundation contractor, a Hamilton accountant, or a Kitchener-Waterloo dental practice that gets cited on the "best [category] near me" prompt set captures referral volume that classic local SEO no longer guarantees on its own. The Vector 1 diagnostic is identical in structure to the national version; the prompt geography is what shifts. Statistics Canada's annual digital adoption surveys for SMEs make clear that Ontario's owner-operators are increasingly aware of AI search but rarely audited for it. The diagnostic gap is the opportunity.
Frequently Asked Questions
How do I check if ChatGPT mentions my brand?
Open ChatGPT, Perplexity, Gemini, and Google AI Overviews. Run 30 to 50 prompts spanning brand-direct, category, and scenario queries. Repeat each query three to five times because responses are non-deterministic. Log every result, every source the engine cites, every competitor mentioned, and every hallucinated fact.
How often should I audit AI visibility?
Quarterly at minimum. Monthly if your category is moving fast or you have published net-new optimization work in the previous 30 days. AI engines update training cycles and re-rank citations continuously, so a single annual audit will miss most of the variance.
Why does ChatGPT give different answers each time I ask the same question?
Generative engines are statistical, not deterministic. There is less than a one-in-100 chance of identical output on repeated runs of the same prompt. Aggarwal and colleagues at Princeton documented this variance in their 2023 GEO paper, which is why a credible audit runs every prompt at least three times.
Can I run a Vector 1 audit myself, or do I need an agency?
You can absolutely run the manual version yourself with a spreadsheet, four browser tabs, and a few hours. Where an agency adds value is volume, statistical rigour, hallucination triage, and translating findings into a remediation plan that maps to the next 11 vectors. The diagnostic is teachable; the remediation is the work.
What is the difference between an AI mention and an AI citation?
A mention is when the engine names the brand inside its generated answer. A citation is when the engine links to a specific page on the brand's domain as a source for that answer. Mentions feed brand awareness; citations feed referral traffic. A complete audit measures both, separately.
Sources
- Aggarwal, P., Murahari, V., Rajpurohit, T., Kalyan, A., Narasimhan, K., & Deshpande, A. (2023). GEO: Generative Engine Optimization. arXiv preprint. arXiv:2311.09735
- Search Engine Land (2026). AI Overview triggering rates and citation behaviour, March 2026 tracking. searchengineland.com
- ALM Corp (2026). Google AI Overview Citations From Top-10 Pages Dropped From 76% to 38%. almcorp.com
- Google Search Central (2025-2026). Structured data and AI Overview eligibility documentation. developers.google.com/search/blog
- Stanford Institute for Human-Centered AI (2025). The AI Index Report 2025. Stanford HAI. aiindex.stanford.edu
- Search Engine Journal (2026). The AI Search Visibility Audit: 15 Questions Every CMO Should Ask. searchenginejournal.com
Get Your Vector 1 Diagnostic
Formative Digital, Brantford, Ontario
This is Vector 1 inside the Formative Forces delivery system. Every engagement begins with the diagnostic, and the diagnostic is the deliverable that makes the rest of the methodology auditable. If you have never been baseline-tested across ChatGPT, Perplexity, Gemini, and Google AI Overviews, that is the work that has to land first. We run it as a no-charge engagement for qualifying domains.