Methodology · Version 1.1

The Catalyst AI Voice Share Index

Every voice share %, AEO score, and pipeline figure Catalyst publishes — in the scanner, the Industry Maps, the State of B2B AEO Report, and any client deliverable — is computed from the formula and coefficients on this page. Read this before citing our numbers externally.

What the Index measures

The Catalyst AI Voice Share Index is the percentage of buyer-intent AI queries in your category where your company is cited by name. We test five AI engines — ChatGPT, Claude, Gemini, Perplexity, and Google AI Overviews — across 10 prompts that represent real buyer questions in your product category, twice per engine per prompt, with different buyer personas across runs.

A company is counted as “mentioned” if its name or domain appears in the engine’s output. Voice share is the ratio of mentions to total possible mentions across all prompts and engines. The same methodology runs for the on-demand scanner at /aeo-audit, every Industry Map leaderboard (Vol 06 / 09 / future), and the upcoming /tools/[category] category pages.

Pipeline at stake translates voice share into a monthly dollar estimate — how much B2B pipeline is flowing through AI-cited buyer journeys in your category, and what share your company is capturing vs. losing to competitors. The full formula is in section three below.

The Index is refreshed continuously for on-demand scans and bi-weekly for industry-level leaderboards. Per-company scans timestamp their inputs so historical comparisons stay honest.

How the scan runs

1
Category research
We fetch the company's homepage, pricing page, and structurally similar companies via Exa. We call Apollo to retrieve employee count and funding stage. Claude synthesizes all four signals — self-description, named customers, regulatory designations, and technical integrations — to classify the product category and buyer persona.
2
Prompt generation
Claude generates 15 candidate buyer-intent query pairs for the identified category. Each pair includes a short keyword (for search volume lookup) and a natural-language question (for LLM polling). DataForSEO provides monthly search volume for each keyword. We keep the top 10 by volume.
3
LLM polling
Each of the 10 prompts is sent to 5 AI engines — ChatGPT (gpt-4o-mini), Claude (claude-haiku), Gemini (gemini-2.5-flash), Perplexity (sonar), and Google AI Overviews (via DataForSEO SERP API) — twice per engine per prompt. Run 1 uses a decision-maker persona; run 2 uses a practitioner persona. This surfaces vendors cited for different buyer roles. We process prompts in batches of 3 to avoid rate limits. Responses are scanned for mentions of your domain and any competitors in scope.
4
Voice share calculation
Voice share = total mentions ÷ total possible mentions (prompts × engines × runs). A company mentioned in either run of a prompt-engine pair receives credit for that pair. Competitor rankings are derived from the same data — how often each company in your category was cited across the full prompt set.
5
Pipeline calculation
Monthly AI queries = category search volume × (1 + LLM shift coefficient). Pipeline at stake = AI queries × citation CTR × opportunity rate × ACV. Captured pipeline = pipeline at stake × your voice share. Lost pipeline = pipeline at stake − captured.
6
Entity and technical checks
In parallel with LLM polling, we run domain authority checks (DataForSEO organic keyword count), Wikipedia presence, brand SERP dominance, press coverage detection, structured data analysis (FAQ, HowTo, Organization, Article schema), and content infrastructure checks (blog volume, original research, executive LinkedIn cadence).

The pipeline formula

monthly_ai_queries = search_volume × (1 + llm_shift_coefficient)

pipeline_at_stake = monthly_ai_queries × citation_ctr × opportunity_rate × acv

captured_pipeline = pipeline_at_stake × (voice_share_pct / 100)

lost_pipeline = pipeline_at_stake − captured_pipeline

LLM Shift Coefficient

2.0×

What: Multiplier applied to Google search volume to estimate total AI query volume for a category.

Why: B2B buyers increasingly start research in AI engines before or instead of Google. For enterprise buying intent queries, AI platforms collectively receive roughly 3× the query volume of Google search (1 + 2.0 = 3×). This coefficient is applied uniformly across auto-research scans and is updated as AI adoption data matures.

Citation CTR

2.5%

What: Estimated percentage of AI responses that result in a click-through to a vendor website.

Why: AI engines frequently answer questions without linking out. When they do cite a vendor, a fraction of users click through. 2.5% reflects conservative early-stage industry estimates for B2B category queries. This will increase as AI engines adopt more link-forward formats (e.g. Perplexity's citations model).

Opportunity Rate

What: Estimated percentage of citation visits that become a qualified pipeline opportunity.

Why: Calibrated to B2B SaaS inbound conversion benchmarks. A buyer arriving via an AI citation is already in active research mode — higher intent than average organic traffic — but not every visit converts to a qualified opportunity.

ACV

Estimated per scan

What: Annual contract value used to translate opportunity volume into dollar pipeline.

Why: Derived in priority order: (1) pricing page — if the company publishes per-seat or per-agent prices, we multiply by a typical enterprise seat count; (2) Apollo enrichment — employee count and funding stage provide a reliable proxy when pricing is hidden; (3) category benchmarks — industry median ACV for the classified product category as a last resort.

Voice share tiers

60–100%

Elite

Consistently cited across all four AI engines for your category's highest-intent queries. Competitors are losing citations to you.

40–59%

Strong

Cited in the majority of responses. Meaningful AI presence, room to consolidate further.

20–39%

Mid

Cited in some responses. Visible but not dominant. Gap analysis will show which engines and queries to prioritize.

0–19%

Emerging

Rarely or never cited. Competitors are capturing the category in AI responses. This is the highest-leverage starting point.

Limitations and disclaimers

Read before citing these figures in board decks or press releases.

Context-free measurement

Voice share is measured via standardized, context-free API calls — no conversation history, no account state, fresh sessions per query. This is the most reproducible and comparable baseline, but it differs from what individual users see. A buyer who has been researching your category for 20 minutes will get different results than a cold query. Our score measures baseline citation behavior, not personalized results.

Non-determinism

LLM outputs are stochastic. The same query can return different results across runs. We run each prompt twice per engine (2 runs × 10 prompts × 5 engines = 100 data points) and take the union of both runs — a company mentioned in either run receives credit. Each run uses a different buyer persona (decision-maker vs. practitioner) to further reduce variance and capture vendor mentions for different buyer roles.

Query coverage

We test 10 buyer-intent queries per scan, generated dynamically from your company's category and buyer persona. Real buyers use thousands of query variations. Our prompts are designed to represent the highest-signal buying intent queries for your category, not to be exhaustive.

Coefficient estimates

Citation CTR (2.5%) and opportunity rate (5%) are calibrated estimates, not audited figures. B2B AI citation behavior is an emerging measurement area with limited published benchmarks. We update these coefficients as more data becomes available and will publish a changelog at this URL.

Point-in-time snapshot

LLM training data, retrieval augmentation, and citation behavior all change over time. A score today reflects today's AI landscape. Companies that improve their entity authority, publish more original research, or earn more citations will see scores improve on future scans. Scores are not permanent rankings.

Pipeline figures are estimates

The monthly pipeline figures are directional estimates intended to size the opportunity, not audited revenue projections. They represent the estimated value of AI-driven buyer attention in your category — not a guarantee of recoverable pipeline. Treat them as a signal for prioritization, not a forecast.

Data sources

DataForSEO

Keyword search volume, organic keyword overlap for competitor discovery

Apollo.io

Company employee count, funding stage, and logo for ACV estimation and entity enrichment

Exa

Competitor discovery via semantic search and structural similarity (findSimilar)

OpenAI (GPT-4o mini)

LLM voice share polling

Anthropic (Claude Haiku)

LLM voice share polling + category research and prompt generation

Google (Gemini 2.5 Flash)

LLM voice share polling

Perplexity (Sonar)

LLM voice share polling

Google AI Overviews (via DataForSEO)

AI Overview voice share polling — surfaces Google's AI-generated answers for buyer queries

Target domain

Homepage text, pricing page, and sitemap for content and technical checks

Challenge our methodology

We update coefficients as better data becomes available. If you have published research on AI citation CTR, LLM query volume, or B2B opportunity rates that we should incorporate, we want to hear it.

will@gotcatalyst.com

Last updated: May 20, 2026. Methodology version v1.1.
Change log: v1.1 — branded the metric as the Catalyst AI Voice Share Index, added Article + FAQPage JSON-LD for citation eligibility, documented the bi-weekly refresh cadence rationale (Perplexity 2-3 day citation drop). v1.0 — initial publication (2026-05-05).