The Modern Search & AI Visibility Glossary (2026 Edition)

SEO and AI terms guide

SEO · AEO · GEO · LLMO · Entity Optimization

A reference for marketers, content strategists, and technical practitioners working across the full discovery ecosystem — traditional search engines, AI assistants, generative answer engines, and everything in between.


How to use this glossary

The lines between SEO, AEO, GEO, and LLMO blur in practice. A single piece of content can be crawled by Googlebot, retrieved by Perplexity, cited by ChatGPT, and summarized in Google’s AI Mode — all from the same URL. Terms are grouped by their primary domain, but most concepts cross over. Where a term has a common abbreviation, it is shown in parentheses.


1. Foundational Search Concepts

TermDefinition
Search Engine Optimization (SEO)The discipline of improving a website’s visibility in search engines (Google, Bing, etc.) to earn unpaid, organic traffic.
Search Engine Results Page (SERP)The page returned after a search query, now often a mix of links, AI summaries, ads, maps, videos, and rich features.
Organic TrafficVisitors who arrive from unpaid search listings, as opposed to paid ads, social, email, or direct visits.
Ranking FactorAny signal a search engine or AI system uses to decide the order or eligibility of results.
KeywordA word or phrase users type or speak when searching. Still useful, but increasingly secondary to intent and entities.
Search IntentThe underlying goal of a query, typically classified as informational, navigational, transactional, or commercial investigation.
QueryThe exact phrase entered into a search engine or AI assistant.
Long-Tail QueryA longer, more specific query (often 4+ words). Long-tail queries dominate AI assistant usage because users speak more naturally.
Zero-Click SearchA search where the user’s need is satisfied directly on the results page (snippet, AI Overview, map pack) without clicking any website.
Click-Through Rate (CTR)Percentage of users who click a given result after seeing it.
Bounce RateThe share of sessions that end without further interaction. Less emphasized today than engagement metrics.
Dwell TimeHow long a user remains on a page before returning to the SERP. A rough proxy for content satisfaction.
Search SatisfactionWhether the user’s underlying need was met. The end goal that most modern ranking signals try to approximate.

2. Crawling, Indexing & Infrastructure

TermDefinition
CrawlThe process of a bot fetching pages from the web.
IndexingStoring and organizing crawled pages so they can be retrieved in response to queries.
DeindexingRemoval of a URL from a search engine’s index, intentionally or otherwise.
Crawl BudgetThe number of URLs a search engine is willing to crawl on a site within a given period. Matters mainly for very large sites.
Bot / CrawlerAutomated software that fetches web pages. Includes traditional search crawlers and AI/LLM crawlers.
GooglebotGoogle’s primary web crawler.
BingbotMicrosoft Bing’s crawler, which also feeds ChatGPT Search and Copilot.
GPTBotOpenAI’s crawler used for training and retrieval.
ClaudeBotAnthropic’s crawler.
PerplexityBotPerplexity’s crawler.
Google-ExtendedA user-agent token Google uses to let publishers opt out of Gemini and Vertex AI training without affecting search rankings.
SitemapAn XML file listing a site’s important URLs to help crawlers discover content.
Robots.txtA plain-text file at the root of a site that tells crawlers which paths they may or may not access. Honor depends on the bot.
llms.txtA proposed plain-text file at the root of a site that gives LLMs a curated, structured map of the site’s most important content for retrieval and citation.
Canonical URLThe preferred version of a page when duplicates or near-duplicates exist.
HTTP Status CodesServer responses such as 200 (OK), 301 (permanent redirect), 302 (temporary), 404 (not found), 410 (gone), 500 (server error).
301 RedirectA permanent redirect that passes ranking signals to the destination URL.
404 ErrorA response indicating the requested page does not exist.
Soft 404A page that returns a 200 status but offers no real content, often misclassified by Google.
JavaScript SEOThe practice of ensuring JS-rendered content can be crawled, rendered, and indexed correctly.
RenderingThe step where a crawler executes a page’s JavaScript to see the final DOM.
Server Response TimeHow quickly a server begins returning a page; a component of Core Web Vitals.
Render-Blocking ResourcesCSS or JS that delays the browser from showing visible content.
Lazy LoadingDeferring the load of images, videos, or scripts until they are needed.
CDN (Content Delivery Network)A geographically distributed network that caches and serves assets closer to users.
Edge SEOApplying SEO changes (redirects, headers, A/B tests, schema injection) at the CDN edge rather than in the origin application.
Headless CMSA content management system that exposes content via API, with the front end built separately.
APIAn interface that lets systems exchange data programmatically.
Log File AnalysisReviewing server logs to study how bots crawl a site — what they hit, what they miss, what they waste.
SSL / TLS CertificateEncryption that enables HTTPS. A baseline trust and ranking signal.
Core Web VitalsGoogle’s set of user-experience metrics: LCP (loading), INP (interactivity, replaced FID in 2024), and CLS (visual stability).
Mobile-First IndexingGoogle’s standard practice of using the mobile version of a page as the primary basis for indexing and ranking.
Page ExperienceA composite signal covering Core Web Vitals, HTTPS, and absence of intrusive interstitials.

3. On-Page, Content & Semantic Optimization

TermDefinition
On-Page SEOOptimization applied directly to a page: content, headings, internal links, metadata, schema.
Off-Page SEOExternal factors influencing rankings: backlinks, brand mentions, citations, reputation.
Technical SEOOptimization of the underlying infrastructure: crawlability, indexing, performance, rendering, architecture.
Meta Title (Title Tag)The HTML <title> element, used as the clickable headline in most SERP listings.
Meta DescriptionA short summary in the page’s <meta> tag, often shown beneath the title in results.
Heading TagsHTML elements <h1> through <h6> that signal content structure to both users and machines.
Alt TextThe alt attribute describing an image — important for accessibility and for AI/image understanding.
URL SlugThe human-readable portion of a URL identifying the page.
Internal LinkingLinks between pages on the same domain, used to distribute authority and signal topical relationships.
Pillar ContentA comprehensive, central piece of content covering a broad topic in depth.
Topic ClusterA pillar page plus interlinked supporting pages, designed to demonstrate topical depth and authority.
Evergreen ContentContent that remains relevant and accurate over long periods.
Thin ContentPages with little original or useful information — a known risk for demotion.
Duplicate ContentSubstantially similar content on multiple URLs, on the same site or across sites.
Content FreshnessHow recently a page has been meaningfully updated. Important for time-sensitive topics.
Content DecayThe gradual decline in traffic and rankings as content ages or competitors improve.
Helpful ContentGoogle’s framing (since the 2022 Helpful Content Update) for content created primarily for people, not search engines.
ReadabilityHow easily a human reader can understand a piece of content. Often measured with formulas like Flesch-Kincaid.
NLP OptimizationStructuring writing — clear subjects, plain syntax, defined entities — so natural language processing systems can parse and reuse it.
Semantic SearchSearch that interprets meaning and context rather than matching exact keywords.
Semantic RelevanceHow closely a piece of content aligns with the meaning and intent behind a query, not just its words.
LSI Keywords“Latent Semantic Indexing” terms — a popular but largely debunked SEO concept. Google has stated it does not use LSI. The useful underlying idea is “topically related terms.”

4. Authority, Trust & E-E-A-T

TermDefinition
BacklinkAn inbound link from another website pointing to yours. Still a core authority signal.
Anchor TextThe clickable text of a hyperlink, which gives search engines context about the destination.
Domain Authority (DA)A third-party score (originally from Moz) estimating a domain’s ranking strength. Not used by Google itself.
Domain Rating (DR)Ahrefs’ equivalent metric, based on backlink profile.
Topical AuthorityThe degree to which a site is recognized as expert across a defined subject area.
E-E-A-TGoogle’s quality framework: Experience, Expertise, Authoritativeness, and Trustworthiness. The first “E” (Experience) was added in 2022.
YMYL (“Your Money or Your Life”)Google’s classification for topics that can materially affect health, finances, safety, or wellbeing — held to a higher E-E-A-T standard.
Brand MentionAn unlinked reference to a brand. Increasingly important as an authority and entity signal for both search and LLMs.
Citation (Local)An online listing of a business’s name, address, and phone number.
NAP ConsistencyKeeping Name, Address, and Phone identical across directories and platforms.
Reputation SignalsReviews, ratings, press coverage, and discussion across the web that shape both human and AI perception.

5. Structured Data & Entities

TermDefinition
Structured DataMachine-readable code that explicitly labels what a page is about.
Schema MarkupThe structured data vocabulary maintained at Schema.org, typically implemented as JSON-LD.
JSON-LDThe recommended format for adding schema, embedded in a <script> tag.
Rich ResultsEnhanced SERP listings — stars, FAQs, product info, recipe cards — driven by structured data.
Featured SnippetA highlighted answer box at the top of Google’s results, extracted from a ranking page.
Position ZeroCommon name for the featured snippet position, above the standard “blue links.”
Knowledge GraphGoogle’s database of entities (people, places, things, concepts) and the relationships between them.
Knowledge PanelThe branded info box appearing on the right side of Google results, drawn from the Knowledge Graph.
EntityA distinct, identifiable concept — a person, organization, place, product, or idea — that search engines and LLMs can recognize.
Entity SEOOptimizing for clearly defined entities and their relationships, not just keyword strings.
Brand EntityThe cluster of signals — name, descriptions, mentions, schema, Wikipedia/Wikidata presence — that establishes a brand as a recognized entity to machines.
Entity-Based OptimizationBuilding content and signals around concepts and their connections, often validated via knowledge graphs.
Wikidata / Wikipedia PresenceStrong external signals used by both Google and LLMs to verify and disambiguate entities.
Speakable SchemaA schema.org property designed to flag content suitable for voice assistant readout.
FAQ SchemaStructured data marking up question/answer pairs (note: Google has reduced FAQ rich result eligibility since 2023).

6. Local & Multi-Channel Search

TermDefinition
Local SEOOptimization for geographically targeted queries and map-based results.
Google Business Profile (GBP)Google’s business listing platform (formerly Google My Business), powering Maps and local pack results.
Map Pack / Local PackThe block of local business results shown with a map in Google search.
Local CitationA mention of a business’s NAP information on a third-party site.
Search Everywhere OptimizationThe practice of optimizing for visibility across every place users discover information — Google, Bing, YouTube, TikTok, Reddit, Amazon, Apple/Google Maps, and AI assistants.
Omni-Search VisibilityA brand’s combined presence across all of these discovery surfaces.
Platform SEOOptimization tailored to a specific platform’s algorithm — YouTube, TikTok, Amazon, Pinterest, App Store, etc.
Forum / Community OptimizationBuilding presence and helpful contributions on Reddit, Quora, Stack Exchange, and niche communities — increasingly important because LLMs heavily cite these sources.

7. AI Search, LLMs & Generative Optimization

This section covers the overlapping disciplines often labeled AEO (Answer Engine Optimization), GEO (Generative Engine Optimization), and LLMO (LLM Optimization). The boundaries between them are fuzzy; in practice they describe the same goal — being surfaced and cited by AI-mediated discovery — from slightly different angles.

7a. Core AI search vocabulary

TermDefinition
Large Language Model (LLM)A neural network trained on massive amounts of text to understand and generate language. Examples: OpenAI’s GPT, Anthropic’s Claude, Google’s Gemini, Meta’s Llama.
Generative AIAI systems that produce new content (text, images, audio, video) rather than only classifying or retrieving.
AI SearchA search experience powered primarily by generative AI, which synthesizes an answer from multiple sources rather than listing links.
AI AssistantA conversational AI product such as ChatGPT, Claude, Gemini, Copilot, or Perplexity.
Answer EngineA system designed to deliver direct answers (Perplexity, ChatGPT Search, Google AI Mode) rather than ten blue links.
AI SERPA search results page enhanced or replaced by AI-generated content.
AI OverviewGoogle’s AI-generated summary block that appears above traditional results for many queries (the successor to “Search Generative Experience” / SGE).
AI Mode (Google)Google’s dedicated generative search experience offering full conversational answers, launched broadly in 2025.
ChatGPT SearchOpenAI’s search feature inside ChatGPT, which retrieves and cites live web sources.
PerplexityA standalone answer engine that combines retrieval, citation, and conversation.
Copilot (Microsoft)Microsoft’s AI assistant, integrated with Bing search results.
AI SnapshotA generic term for any AI-generated summary appearing in a search interface.
Conversational SearchSearch expressed in natural, often multi-turn dialogue rather than terse keywords.
Multimodal SearchSearch combining text, images, voice, and/or video inputs and outputs.
Voice SearchSearch performed by speaking, typically through a phone, smart speaker, or in-car assistant.
Agentic SearchSearch performed by an autonomous AI agent that can browse, compare, and take actions (book, buy, summarize) on the user’s behalf.

7b. How AI systems find and use content

TermDefinition
Retrieval-Augmented Generation (RAG)An architecture where an LLM retrieves relevant external documents at query time and uses them to generate a grounded answer.
AI RetrievalThe lookup step in which an AI system gathers supporting documents before generating an answer.
Vector SearchRetrieval based on semantic similarity in an embedding space, rather than exact keyword matching.
EmbeddingA numerical vector that represents the meaning of text, an image, or another input — the unit of comparison in vector search.
ChunkingSplitting long documents into smaller, semantically coherent segments so they can be embedded and retrieved efficiently.
GroundingAnchoring an AI’s response in verified, retrievable source material to reduce hallucination.
AI HallucinationConfidently stated but incorrect or fabricated AI output.
Source AttributionThe AI system identifying which sources it used to construct an answer.
AI CitationA specific in-response reference (link, footnote, badge) pointing to a source.
AI MentionAny reference to a brand, product, or person inside an AI-generated response, with or without a link.
Knowledge RetrievalThe general process of an AI system locating and extracting information from indexed sources.
Context WindowThe maximum amount of text an LLM can consider in a single request — relevant to how much content a system can ingest before answering.
PromptThe input given to an AI model.
Prompt EngineeringThe craft of writing prompts to reliably produce useful outputs.
Prompt InjectionAn attack in which hidden instructions in a webpage or document attempt to manipulate an LLM’s behavior. A real risk for AI-readable content.
AI Training DataThe corpus used to train a model. Distinct from retrieval data, which is fetched at query time.
Fine-TuningAdditional training applied to a base model to specialize its behavior or knowledge.

7c. Optimizing for AI visibility

TermDefinition
Answer Engine Optimization (AEO)Structuring content so that answer engines and AI assistants can extract a direct, accurate response. Heavy on clear question-answer formatting, schema, and concise lead-ins.
Generative Engine Optimization (GEO)The broader practice of optimizing content to be retrieved, synthesized, and cited by generative AI search systems.
LLM Optimization (LLMO)Optimizing so that LLMs — both at training time and at retrieval time — can understand, attribute, and reproduce information about a brand or topic.
AI VisibilityHow often, and how favorably, a brand or source appears inside AI-generated answers. The AI-era equivalent of share-of-voice.
AI DiscoverabilityHow easily an AI system can find a brand or piece of content when it would be relevant.
AI CrawlabilityWhether AI bots can technically access a site (robots.txt, authentication, rendering).
AI IndexabilityWhether a site’s content can be parsed, chunked, and stored by AI systems for later retrieval.
Machine ReadabilityHow cleanly a system can interpret a page — clear HTML, semantic markup, plain language, accessible structure.
AI-Friendly ContentContent explicitly structured for machine consumption: clear claims, direct answers, defined entities, attributable statements.
Citation OptimizationWriting and structuring content to maximize the chance of being cited by AI systems (concrete facts, unique data, clear attribution, stable URLs).
Citation GraphThe network of who cites whom across the web — increasingly used by AI systems to weight authority.
Citation AuthorityThe likelihood that a given source will be referenced by AI systems on a given topic.
Contextual AuthorityAuthority that comes from covering the entire ecosystem of a topic, not just a single page.
AI Trust SignalsSignals — author bios, citations, schema, consistent brand entity, third-party validation — that lead AI systems to treat a source as reliable.
Retrieval SignalsWhatever cues a retrieval system uses to select content: freshness, relevance, authority, structure, embeddings quality.
AI Ranking SignalsThe factors that determine whether and how prominently an AI system features a source.
Source AuthorityThe perceived overall trustworthiness of a source as judged by an AI system.
Question OptimizationWriting content around the actual questions users ask, often in their own phrasing.
Conversational ContentContent that reads as if it directly answers a question, in the register of a knowledgeable conversation.
FAQ OptimizationStructuring FAQs (both on-page and in schema) for snippet and AI retrieval.
Knowledge EntityA clearly defined entity that AI systems can recognize and reason about.
Trust Layer OptimizationBuilding credibility signals — reviews, mentions, authorship, third-party validation — across the wider web, not just on-site.
Machine-First SEODesigning for machine consumption and human consumption simultaneously, rather than treating them as competing goals.
Parasite SEO / AI Parasite MarketingThe practice of ranking or being cited via high-authority third-party platforms (Reddit, LinkedIn, YouTube, major publications) rather than your own domain.
Digital Entity FootprintThe total picture of a brand across the web — owned, earned, and third-party — that defines it as an entity.
AI Search EcosystemThe combined environment of traditional engines, AI assistants, and answer engines that now shapes discovery.
Human + AI Search JourneyThe reality that a single buying or research journey now spans Google, ChatGPT, Reddit, YouTube, and others before a decision is made.

8. Analytics & Measurement

TermDefinition
ImpressionsThe number of times a piece of content has appeared in a results interface.
SessionsVisits to a website, as tracked in analytics.
UsersUnique visitors over a given period.
Engagement RateThe share of sessions considered meaningful (by duration, depth, or conversion). The metric that largely replaced bounce rate in GA4.
Conversion RateThe percentage of visitors who complete a defined goal.
Organic ConversionsConversions attributable to unpaid search traffic.
AttributionThe methodology used to assign credit for a conversion across the channels that touched it.
Key Performance Indicator (KPI)A specific, measurable metric tied to business outcomes.
Google Search Console (GSC)Google’s free tool for monitoring crawl, index, and search performance.
Bing Webmaster ToolsMicrosoft’s equivalent, also useful for understanding how content surfaces in Bing, Copilot, and ChatGPT Search.
Crawl ErrorsIssues — server, redirect, blocking, or not-found — that prevent crawlers from accessing content.
Index CoverageA report in Search Console showing which pages are indexed, excluded, or in error.
Share of AI VoiceAn emerging metric estimating how often a brand is named in AI-generated responses to relevant prompts.
AI Citation TrackingMonitoring which AI systems cite a brand or page, for which queries, with what framing.

Quick reference: the four “O”s

AcronymStands forPrimary focus
SEOSearch Engine OptimizationRanking in traditional search results (Google, Bing).
AEOAnswer Engine OptimizationBeing chosen as the direct answer in snippets and AI assistants.
GEOGenerative Engine OptimizationBeing retrieved, synthesized, and cited inside AI-generated answers.
LLMOLLM OptimizationBeing understood and reproduced correctly by large language models, both via training data and live retrieval.

In practice, the same well-structured, authoritative, machine-readable content tends to win across all four. The acronyms describe emphasis, not separate disciplines.


Last updated: 2026.

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *