SEO

How to do keyword research that actually ranks

SEOany · July 1, 2026 · 7 min read

Most keyword research produces a spreadsheet nobody ranks with. The problem is rarely the tool — it's chasing big numbers instead of winnable, intent-matched queries. This guide is our own methodology, stated plainly: how to read intent before volume, weigh difficulty against the authority you actually have, group keywords into clusters instead of hunting them one by one, bank quick wins while the big terms mature, and adapt all of it now that AI search answers a growing share of queries before anyone clicks.

Why does keyword research fail even when the volume looks big?

Most keyword research fails because it optimizes for search volume instead of search intent. A high-volume keyword you rank for but that never matches what the searcher wanted converts no one. The first question is never 'how many people search this?' — it's 'what does this searcher expect to find, and can that page be us?'

Search intent sorts every query into four buckets — informational, commercial, transactional, and navigational — and those buckets rarely mix on one page. When Google shows ten product pages for a term, your blog post will not rank there no matter how thorough it is.

The fastest intent check is free: search the keyword and read the top ten results. The SERP is Google telling you, in public, exactly which intent it has decided the query deserves — buying guides, comparisons, tools, or definitions.

Search volume still matters, but only after intent qualifies the keyword. A 200-search-a-month term you can win and convert beats a 20,000-search term that sends buyers to a page which only explains.

Informational — 'what is', 'how to'; the searcher wants to learn, not buy.
Commercial — 'best', 'vs', 'review'; comparing before a decision.
Transactional — 'buy', 'pricing', 'near me'; ready to act now.
Navigational — a specific brand or product searched by name.

Is a keyword's difficulty score telling you the truth?

Only partly. Keyword difficulty estimates how hard a term is to rank in the abstract, but it does not know your site. The honest metric is difficulty relative to your own authority — a keyword is 'hard' only when the pages already ranking outclass what you can realistically publish and back with links.

Difficulty scores are modeled mostly on the backlinks of the current top results, so they reflect the incumbents, not you. A DR-30 site and a DR-80 site see the same number and should draw opposite conclusions from it.

Authority is the other half of the equation, and it is the half you carry into every query. New and low-authority sites should deliberately target lower-difficulty, longer-tail terms first, then climb — competing head-on for high-difficulty heads is how thin sites burn months for nothing.

Read the SERP for beatability, not just the number. If the top results are thin, outdated, or off-intent, a genuinely better page can outrank a scary difficulty score; if they are comprehensive and fresh, a green score is lying to you.

Difficulty also assumes your site is technically sound. A brilliant, low-competition keyword still won't rank on a page crawlers can't reach or render, so a technical SEO audit is the floor beneath any keyword bet.

Where do keywords worth targeting actually come from?

Good keyword lists come from five sources, not one: competitor pages that already rank, SEO tools' related-term databases, AI-generated semantic expansions, Google's People-Also-Ask and autocomplete, and real discussion on Reddit and forums. Any single source is biased; the value is combining them, then verifying every candidate against real data.

Competitor mining shows you demand that is already proven to convert, but it can only ever surface what rivals have already covered — it finds no gaps.

AI expansion is the opposite: it invents angles and long-tail keyword phrasings that tools miss, but it cannot know real demand. Every AI-suggested term is a hypothesis until you backfill actual volume and difficulty from a live data source — unverified AI words are noise, not findings.

People-Also-Ask and autocomplete hand you the exact phrasing real users type, which is gold for question-shaped content. Reddit and Quora threads surface the pain points and vocabulary no keyword tool captures until much later.

The discipline that turns five noisy sources into one usable list is deduplication and data backfill: merge everything, collapse near-duplicates, then attach real volume, difficulty, and SERP data before a single keyword earns a place on the plan.

Competitor pages — demand already proven to convert, but no blind spots revealed.
SEO-tool databases — high candidate volume with built-in metrics, plus noise to clean.
AI expansion — semantic and long-tail angles tools miss; must be data-verified.
People-Also-Ask & autocomplete — the real questions, in users' own words.
Reddit & forums — raw pain points and emerging phrasing before tools catch up.

Why group keywords into clusters instead of chasing them one by one?

Because Google ranks pages by intent, not by keyword, and dozens of phrasings often share one intent. Grouping every query that resolves to the same answer into a single cluster — one article per cluster — concentrates your effort and authority instead of splitting it across near-duplicate pages that cannibalize each other.

The reliable test for 'same cluster' is not how similar two keywords look — it's SERP overlap. If the top ten results for two queries are largely the same URLs, Google treats them as one intent and they belong on one page; if the results diverge, they need separate pages.

SERP-overlap clustering beats guessing by wording, because it catches two very different phrases that mean the same thing to Google — and splits two similar-looking phrases that Google actually reads as distinct intents.

One cluster becomes one page with a primary keyword, its long-tail variants, and the People-Also-Ask questions woven into subheadings. Clusters also map onto structure: related ones link to a shared pillar page, and disciplined internal linking between them tells Google you cover the whole topic, not one stray query.

How do you find keywords you can actually rank for right now?

Quick wins are keywords where intent fits, difficulty is low, and something about the current top results is beatable — thin content, stale dates, or off-intent pages. For a young site, a cluster of long-tail queries you can genuinely win this quarter is worth more than one head term you might reach next year.

The highest-value quick win is usually a long-tail term with clear commercial intent: lower volume, far lower competition, and a searcher much closer to acting. Ten such terms often out-earn one high-volume head term that only brings curious readers.

Existing pages that already rank on page two are the fastest wins of all — refreshing, expanding, and re-linking a position 11–20 page moves it up faster than any brand-new article ranks from zero.

Weakness in the current results is an opening: when the ranking pages are outdated, shallow, or written for a different intent than the query implies, a focused, genuinely better page can leapfrog a scary difficulty number.

Intent matches what you can credibly publish.
Difficulty is low relative to your site's authority.
Top results are thin, stale, or off-intent — genuinely beatable.
Page-two pages you already own, ready to refresh and re-link.

How does AI search change keyword research?

AI search shifts keyword research from keywords toward questions. As ChatGPT, Perplexity, and Google's AI Overviews answer more queries before a click, you increasingly optimize for the natural-language prompts people ask assistants — and for being the source those answers cite, not just a blue link they replace.

Head terms lose clicks to zero-click AI answers first, while specific, long-tail, intent-rich questions still pull users through to a page — so weighting your research toward genuine questions is now a hedge, not a nicety.

A practical new source is prompt mining: turning seed keywords into the natural-language questions people actually ask assistants — 'best X for Y', 'is X worth it', 'X vs Y' — and treating those as first-class keywords with their own pages.

The payoff shifts from ranking to citation. Structuring a keyword's answer as a tight, self-contained passage is how you get cited by ChatGPT, Perplexity, and AI Overviews, which is increasingly where the query ends.

You can even guide which pages engines lean on. Publishing an llms.txt manifest points them at your best answer pages — a small, cheap lever on a surface classic keyword tools cannot see.

Weight research toward questions and long-tail intent, not just head volume.
Mine the prompts people ask assistants as first-class keywords.
Format each answer to be quotable — win the citation, not only the rank.
Point engines at your best pages with an llms.txt manifest.

Let the agent run this playbook for you

Start free

GEOEntity SEO: build a brand AI engines cite SEOHow to pass Core Web Vitals in 2026 GEOHow to get cited by ChatGPT, Perplexity & AI Overviews SEOA technical SEO audit checklist for 2026 AEOWhat is llms.txt and do you need one?

← All posts