AISEO

GEO Measurement Without the Hype: A Practical Prompt Audit Workflow

Stop guessing about AI visibility. Learn a simple, repeatable prompt audit workflow to track citations, brand mentions, and answer drift across AI engines.

By Jim27 Mar 20268 min

If you're trying to measure GEO results, you've probably noticed something: the numbers don't always match, the tools don't agree, and nobody can tell you exactly why your content was or wasn't cited in an AI answer.

That's not a bug. It's the nature of generative systems right now.

But that doesn't mean you're stuck guessing. This post shows you a simple, repeatable workflow that works despite the uncertainty.

The Measurement Problem in GEO

Traditional SEO measurement is straightforward: rank for a keyword, track your position, measure clicks. AI systems break this model.

Here's why:

Non-deterministic outputs. Run the same prompt in ChatGPT twice and you might get different citations, different answer structure, or a completely different answer. This is by design.
No standard ranking metric. There's no "position 1" in an AI answer. Your content might be the primary source, one of many, or mentioned without a link.
Tool blindness. Third-party trackers sample prompts and try to extrapolate, but they can't know what individual users actually see.

So how do you measure impact? You stop trying to measure everything, and instead track what actually matters: are we being cited, and how often?

What "Being Cited" Actually Means

In AI answers, citation is the new visibility metric.

Your content might:

Be the primary source (the AI attributes an answer or stat to you)
Be one of several sources (mentioned as supporting evidence)
Be mentioned by name (without a link)
Appear in a list (of resources or alternatives)
Not appear at all

Each of these is a different outcome. A prompt audit workflow helps you track all of them.

Building Your Prompt Audit Set

Start simple. Your "prompt set" is a fixed list of prompts you run regularly. It doesn't have to be huge.

Choose 10-20 prompts that represent:

Awareness stage questions your customers ask (e.g., "What is a heat pump and how does it work?")
Consideration stage questions (e.g., "Heat pump vs air conditioning: which is better for my home?")
Decision stage questions (e.g., "How to find a local heat pump installer")
Your branded searches (e.g., "Best heat pump companies in Dorset", "Acme Heating reviews")

Example set for a plumbing business:

What should a boiler service include?
How often do you need a boiler service?
Boiler service vs central heating maintenance
Cost of boiler service 2026
Best local plumber Shaftesbury
How to find an emergency plumber
Boiler repair vs replacement
What to look for when choosing a plumber
My boiler is leaking water
Central heating not heating

Keep the same set for at least 4-6 weeks so you can spot trends.

Running the Audit: Where and How

You don't need to run prompts everywhere. Start with 2-3 systems:

1. Google AI Overviews (where available)

Search Google for your prompt. If an AI Overview appears, screenshot it and note:

Does your content appear in the sources list?
Is your brand mentioned in the answer text?
Which competitors are cited?
Is the answer stable (run it again tomorrow, does it change?)?

2. ChatGPT or Perplexity

You can run the same prompts here. ChatGPT shows sources when you ask "where did this come from?" Perplexity shows citations inline.

Note the same things: your brand, your competitors, source stability.

3. One industry-specific AI tool (optional)

If your industry has a specialised tool (legal research, medical info, property search), use that too.

What to Record

You don't need to log everything. Use a simple spreadsheet or Notion table with these columns:

Date
Prompt (the exact question)
Engine (Google, ChatGPT, Perplexity)
Your brand mentioned? (Yes / No / Position in answer)
Your content cited? (Yes / No / Which page?)
Competitors mentioned (list 3 top ones)
New findings (anything interesting about how the answer changed)

Run it once a week. Spend 20 minutes per engine. That's it.

Interpreting the Data (Without Overclaiming)

Here's what you're looking for:

Good signs

Your brand appears in more prompts week-on-week
You're cited before competitors on your core topics
The answer structure favours your expertise (e.g., a local search puts local sources first)

Warning signs

Competitors are cited more consistently than you
Your content appears in only one or two engines
When you appear, you're always listed as "also consider" rather than primary

Noise (ignore)

Occasional variations in which sources are cited (this is normal)
Being absent from a single prompt (some queries won't have your type of content)
Different answer phrasing on re-run (expected)

The point is: you're looking for trends over 4-6 weeks, not single-prompt data points.

Turning Findings into Action

If your audit shows gaps, what do you do?

If you're not being cited at all on a core topic:

Check that the content exists and answers the prompt clearly
Make sure the answer is extractable (short paragraphs, lists, tables)
Ensure your credentials and business details are visible (About page, team bios, schema markup)
Refresh the page (add recent data, update examples, add an "updated" date)

If competitors are cited more:

Read what they wrote and how they structured the answer
Do you cover the same ground? If not, add that content
Is theirs more recent? Refresh yours
Do they have stronger credentials? Strengthen your About/Author/Entity schema

If you're cited but rarely linked:

Improve your entity markup so the AI knows your business name, location, and expertise
Add unique, original data the AI can attribute to you (test results, local data, original research)
Make sure your business name is consistent across your site and directories

Common Questions

"How long before I see results?"

Probably 2-4 weeks. AI systems don't update in real time. Your content might take time to be indexed and evaluated by the AI model.

"Do I need to change my SEO strategy?"

No. The fundamentals still matter: crawlability, helpfulness, clear structure, original value. You're not optimising for AI in a special way. You're optimising because of AI.

"What if I'm in a niche where no AI Overviews appear?"

Focus on ChatGPT and Perplexity instead. These systems don't require Google to appear. Run the same audit there.

"Is this guaranteed to increase visibility?"

No. There are no guarantees in AI systems right now. But tracking your actual position is better than following a checklist and hoping. Data beats assumptions.

Next Steps

Build your prompt set this week (10-20 questions your customers actually ask)
Run the first audit on Friday (pick Google and one other engine)
Record your baseline
Run again the same day next week
Compare and adjust your content based on what you find

That's the workflow. It's not magic, but it's honest, repeatable, and it works.

← Back to all articles