More buyers now open ChatGPT before they open a search engine, asking it directly which tools to shortlist and which vendor to pick. If you don't know whether your brand is named in those answers — or whether a competitor is named instead — you're flying blind at the exact moment a decision is forming.
- Tracking ChatGPT mentions means measuring whether the model names you in the prompts your buyers actually ask — not whether you rank somewhere.
- Separate a mention (your brand in the prose) from a citation (your domain as a source); both matter and are measured differently.
- Answers shift between runs and over time, so probe a fixed prompt-set on a schedule through the official API — never by scraping.
- The payoff is visibility intelligence: knowing where you appear, who appears instead of you, and which sources to win to close the gap.
Why track brand mentions at all
When a prospect asks ChatGPT "what's a good self-hosted analytics tool?" or "which of these two vendors is better for a small team?", the model writes one synthesized answer. There is no list of ten links to scroll. Either your brand is named in that paragraph or it isn't — and if it isn't, you're invisible at the precise moment the buyer is forming a shortlist.
That influence happens before any click. A prospect can read a recommendation, narrow their options, and walk into a sales conversation already leaning toward a competitor, all without ever visiting your site. You never see the impression in your analytics, so the absence is silent. Tracking mentions is how you make that silence visible.
Be honest about the frame, though: this is not a promise of a traffic surge. AI referral traffic is still a small share of the web today, and anyone quoting you a specific uplift is guessing. The real near-term value is intelligence — knowing whether ChatGPT speaks well of you, ignores you, or recommends someone else — which is cheap to measure and expensive to keep ignoring.
Mention vs citation
Two outcomes look similar but are not the same, and conflating them produces noisy data. A mention is ChatGPT naming your brand in the body of its answer. A citation is your domain appearing as a linked source the model drew from. They move independently.
ChatGPT can describe your product accurately from what it learned during training without ever linking to you — a mention with no citation. It can also lean on a third-party review or comparison that talks about you, citing that source while never naming you in the prose — a citation with no mention. Both signals carry information: mentions tell you how the model talks about your category; citations tell you which sources it trusts to build that answer.
Measure them separately. A visibility rate counts how often you're mentioned; a citation rate counts how often your domain is cited. When the two diverge — say you're mentioned often but rarely cited — that itself is a finding, usually pointing at where you need stronger first-party or third-party presence.
Build a prompt set that mirrors real questions
Your measurement is only as good as the questions you ask. A prompt set should mirror how buyers actually talk to an assistant in your category, not the flattering phrasing you wish they used. Asking "why is [your brand] the best?" is a vanity prompt: it puts your name in the question, so a mention in the answer proves nothing. The useful prompts are the ones where your name is not given, and you're watching to see whether the model brings it up on its own.
Cover the categories of questions that lead to a decision:
- Comparison — "[competitor A] vs [competitor B] for a small team" — does the model add you to the comparison unprompted?
- Alternatives — "alternatives to [well-known tool]" — are you on the list of substitutes?
- Best-of — "best [category] tool for [use case]" — do you make the recommended shortlist?
- How-to — "how do I [job your product does]?" — are you named as a way to get it done?
Keep the set fixed and representative — a few dozen prompts spanning these categories is far more useful than hundreds of near-duplicates. Stability matters more than volume, because you'll be re-running the same prompts over time and comparing like with like.
Probe on a schedule, not once
Asking ChatGPT a question once tells you almost nothing. The model samples its responses, so the same prompt can name you in one run and skip you in the next. Retrieval and the underlying model also change over weeks and months. A single snapshot is a coin flip dressed up as a measurement.
The fix is repetition. Run each prompt several times to average out run-to-run variance, and re-run the whole set on a cadence — daily or weekly — so you can watch the trend rather than react to one lucky or unlucky answer. The trend is the signal; the single answer is noise.
Do this through the official API, sending your prompts and recording the answers and any citations returned. Never scrape the ChatGPT web interface: it's brittle, it breaks the terms of service, and it gives you data you can't stand behind. This is exactly the loop runs — a fixed prompt-set, sampled repeatedly on a schedule through official engine APIs, turned into metrics over time.
Read the signal
Once you're collecting answers consistently, a few plain metrics do most of the work. Visibility rate is the share of probes where you're mentioned at all — your baseline presence. Watch its trend, not its absolute value on any one day.
Then look at position within the answer. Being named first in a recommended shortlist is very different from being a footnote in the last sentence. ChatGPT's ordering tends to mirror how confidently it associates you with the question, so a low but consistent position is a different problem than not appearing at all.
Finally, track who is named instead of you. Every prompt where a competitor appears and you don't is a concrete gap. Aggregated across your set, the competitors that keep surfacing reveal your real share-of-voice in AI answers — and a shortlist of exactly where you're losing the conversation.
Act on the gaps
A gap is only useful if it points to an action. When ChatGPT names a competitor instead of you, the question to ask is why the model reached for them — and the answer usually lives in the sources it leaned on. Where citations are available, look at which domains recur in the answers that exclude you: comparison sites, directories, documentation, independent reviews. Those are the places the model trusts, and the places where you're either absent or poorly represented.
From there the work is concrete: be accurately and clearly represented on the sources that keep getting cited, and make your own pages the cleanest available answer to the prompts you're losing — direct, well-structured, current. Then re-run the set and check whether visibility, position, and share-of-voice actually moved. A few prompts you can put to work:
- "For the prompt where [competitor] is recommended and we aren't, list the sources you used to form that answer."
- "What would a page need to say to be a credible answer to [buyer question] in this category?"
- "Which independent sources discuss [our category], and how are we represented there versus [competitor]?"
None of this is a trick. Tracking mentions in ChatGPT is the discipline of watching the answers, finding the gaps, earning presence on the sources that fill them, and measuring again — turning scattered, invisible impressions into a trend you can act on.
Frequently asked questions
Can I track ChatGPT without using the API?
Not reliably. You could ask questions by hand in the web app, but you can't do that repeatedly, consistently, or at the volume needed to average out variance — and the results aren't reproducible. The official API is the only stable way to send a fixed prompt-set, record answers and citations, and track them over time.
How often should I probe?
Often enough to see a trend rather than a snapshot. Daily or weekly works for most brands, with each prompt run several times per cycle to smooth out run-to-run variance. The exact cadence matters less than keeping it fixed so you compare like with like.
What if the answer changes every time?
That's expected — ChatGPT samples its responses, so variation is normal. You handle it by sampling: run each prompt multiple times and look at the rate at which you're mentioned, not any single answer. Variance is exactly why one-off checks are misleading and repeated probing is the point.
Does this work for non-English markets?
Yes. Write your prompt-set in the language your buyers actually use, since answers and the sources behind them differ by language and region. The method is identical; only the prompts and the trusted sources change.
Is scraping ChatGPT against the rules?
Scraping the ChatGPT web interface generally violates its terms of service, and it's brittle besides — interfaces change and break your collection without warning. Use the official API instead: it's permitted, stable, and gives you data you can stand behind.
See if ChatGPT names you.
Probe ChatGPT with the prompts your buyers actually ask, and see where you're mentioned, cited, or quietly missing — and who's named instead. No traffic miracle promised, just a clear picture of your AI visibility.
Start tracking