CONTEXT JAMMING

Field notes from inside the context window.

CONTEXT JAMMING · JUNE 2026

The CMO's Guide to GEO

“The battleground has moved from the SERP to the context window. Your brand either earns permanent semantic weight inside frontier models — or it disappears from the buyer’s shortlist.”

Earning the Weights™B2B Enterprise90-Day PlaybookAgentic Search Infrastructure

The enterprise software marketing stack is experiencing a silent structural failure. Traditional SEO metrics continue to report green while high-intent buyer discovery has migrated entirely into zero-click generative interfaces. The new battlefield is not ranking position — it is presence in the weights. CMOs who continue optimizing for blue links while competitors earn citation absorption and narrative influence inside Claude, Gemini, and Perplexity are systematically ceding shortlist ownership.

§ 01 · LEGACY DASHBOARDS ARE STRUCTURALLY OBSOLETE

Your SEO Dashboard Is Measuring a Game Buyers Stopped Playing

Zero-click generative discovery has broken traditional attribution. Google Analytics increasingly reports “Direct” or “Brand Search” for journeys that actually began inside Claude, Perplexity, or Gemini context windows. The model surfaces a synthesized recommendation; the user never sees a blue link.

MIT IDE research (Sinan Aral et al.) on attention concentration in AI-mediated environments shows that once a model commits to a shortlist of 2–4 entities, subsequent human discovery is heavily path-dependent. Legacy SERP rank has almost no causal relationship with inclusion in that shortlist.

“We spent the last two decades optimizing for the algorithm… That era is over.”
— Bret Kerr
SOCIAL FRAGMENT

Your SEO dashboard says you’re winning. The models that actually decide your shortlist have never seen your rankings.

§ 02 · FROM RANKINGS TO WEIGHTS

What “Earning the Weights” Actually Measures

Measurement ObjectiveLegacy SEO MetricModern GEO MetricOperational Definition
VisibilitySERP Position / RankShare of Model (SoM)Percentage of synthetic buyer-intent prompts where the brand is explicitly mentioned or recommended.
EngagementClick-Through RateCitation Absorption RateSemantic overlap + narrative influence on the final model output.
AuthorityDomain Rating / BacklinksEntity Verification DensityFrequency of co-citation with canonical analyst reports and community sources.
PerceptionBounce Rate / Time on PageContextual SentimentPositive / Neutral / Negative framing by frontier models in comparative evaluations.

Toggle to see how the measurement ontology shifts when the buyer is an agent, not a human.

Princeton KDD 2024 work on Position-Adjusted Word Count and Subjective Impression established the foundations. The 2026 evolution (Zhang & Yao, arXiv:2604.25707) distinguishes Citation Selection (being chosen for the list) from Citation Absorption (actually shaping the synthesized answer).

Perplexity surfaces 16.35 citations on average but with lower absorption scores. ChatGPT selects fewer sources (6.88) but achieves dramatically higher absorption (0.2713 vs Google’s 0.0584 in the paper’s framing).

As Will Bryk, CEO of Exa, articulates, agents are “crazy creatures with infinite patience” — they issue paragraph-length queries and demand 1,000–10,000 results. Legacy BM25 keyword search collapses under this load. Exa’s neural embedding approach, combined with Highlights (20x token reduction via semantic extraction) and Deep Max parallel tool calling, is the infrastructure layer that actually serves the agentic era. The “token apocalypse” is real; only architectures built for dense, high-recall retrieval survive.

§ 03 · CONNECTING WEIGHTS TO REVENUE

The Tri-Layer Attribution Model That Satisfies Finance

  1. Server-edge bot verification (Cloudflare, Vercel, Profound) — deterministic leading indicator of model ingestion.
  2. Continuous prompt auditing — 50–200+ synthetic buyer-intent prompts per week across frontier models for probabilistic correlation with SoM movement.
  3. Self-reported CRM intake + win-loss tagging — lagging indicator but the only one finance trusts for pipeline attribution.
CASE · MERGE

7× YoY demo requests from LLM recommendations. 6× higher conversion rate versus legacy organic traffic.

CASE · KITEWORKS

53,000 links autonomously restructured → 79% expansion in AI Overview citations → 300% non-brand traffic lift.

§ 04 · BUILD VS BUY THIS QUARTER

The Four Infrastructure Models That Actually Work

Infrastructure TypeRepresentative ToolsCore MechanismPrimary B2B Use Case
Simulated UI ScrapingPeec AI, ZipTie.devPrompt execution + scraping across LLM front-endsExecutive reporting, competitive benchmarking
Server-Side Log IngestionProfound Agent AnalyticsCDN edge logs + bot IP verificationTechnical auditing, ingestion verification
Semantic & Structural ExecutionQuattr Autonomous Linking APIVector embeddings + dynamic internal link graphFixing indexation decay at enterprise scale
LLM Output EvaluationLangSmith, Helicone, PhoenixAPI-level trace extraction & brand-heuristic gradingHigh-volume regression testing by data science teams

The deeper shift, as detailed in §08, is the move from tools that help humans browse to infrastructure that serves autonomous agents at scale. Exa’s approach (neural retrieval + Highlights + RL-optimized trajectories) is the clearest current example of the “agentic search infrastructure layer.”

§ 05 · HIGH BIT-RATE COMMUNICATION WINS

What Actually Survives Retrieval and Shapes the Final Answer

SAGEO Arena (Yonsei) demonstrated that optimizing body text purely for fluency can tank BM25 retrieval by 22+ positions. Structural clarity wins over prose elegance.

Non-negotiables: schema, dense headings, machine-readable tables.

Princeton levers with measured lifts: Statistics Addition (+41%), Quotation Addition (+28%), Cite Sources (+34%).

“Having good docs that are out there, social proof, being posted on Reddit a little more — all of that helps your case tremendously.”
— Calvin French-Owen, co-founder of Segment (acquired $3.2B), ex-OpenAI Codex

Over 80% bias in high-effort model settings toward earned third-party validation (analyst reports, deep technical forums, GitHub health). Claude Opus 4.8 and Fable 5 are literalist at high reasoning effort.

§ 06 · FROM THEORY TO OPERATIONAL MATURITY

A Phased Execution Roadmap for Series B–Pre-IPO Teams

Phase 1 (Days 1–30)
Instrumentation & Baseline Capture
  • Define 50–100 complex, constraints-based buyer-intent prompts
  • Deploy edge observability (Profound or native Cloudflare)
  • Run “Zero State” prompt suite → capture baseline Share of Model + Contextual Sentiment
Phase 2 (Days 31–60)
Structural Repair + Pipeline Tagging
  • Semantic internal linking + schema on pillar pages
  • CRM interactive intake forms + dedicated “AI Discovery” pipeline stage tag
  • Density Injection on top 10 product pages (swap adjectives for verifiable stats + expert quotes)
Phase 3 (Days 61–90)
Regression Testing + Board Narrative
  • Automated weekly prompt regression suite
  • Triangulate server logs → SoM deltas → CRM self-reported deals
  • Build the “AI Market Share” board narrative (Citation Selection vs deep Citation Absorption)
§ 07 · AVOIDING VANITY METRICS AND CITATION CLIFFS

The Perplexity Trap and the Multi-Query Spillover Effect

69% risk of semantic drift when hyper-optimizing on single prompts.

The Perplexity Trap: Citation Breadth ≠ Citation Absorption. Low TF-IDF / paragraph coverage equals zero commercial impact even if the brand appears.

Version drift and temporal decay create citation cliffs after model updates.

The only sustainable path is continuous structural clarity + ecosystem validation — not one-time content sprints.

§ 08 · THE AGENTIC SEARCH INFRASTRUCTURE LAYER (EXA PARADIGM)

Preparing Infrastructure for the GPT-5 Era

Exa’s neural embedding approach represents the first real departure from legacy BM25 and keyword heuristics. As Will Bryk explains, Google was built for humans typing short queries. Agents are “crazy creatures with infinite patience” — they send paragraph-length, multi-constraint queries and expect the system to return and synthesize from 1,000–10,000 sources.

The “token apocalypse” is the direct result: naive retrieval dumps the entire page into the context window. Exa Highlights performs semantic extraction at the edge, delivering 20x token reduction while preserving the signal that actually moves the model.

RL training on agent trajectories benefits enormously from denser learning signals. Better retrieval leads to better trajectory decomposition, higher pass@k, and surprisingly strong transfer even when the inference backend is downgraded. Matryoshka embeddings, binary compression, and hierarchical clustering allow Exa to operate at quadrillion-page scale without collapse.

The Bitter Lesson applies here too: the winning search infrastructure will be the one that scales compute and data most effectively for the agentic workload, not the one that hand-crafts the best heuristics for today’s models.

AspectHuman Consumer SearchAutonomous Agentic Search
Query ComplexityShort keywords (2-5 words)Paragraph-length, 50-500+ tokens with constraints
Results DesiredTop 10 blue links1,000–10,000+ sources for synthesis
Latency Tolerance< 300ms perceivedMinutes of deep research acceptable
Token BudgetMinimal (user attention)Massive — the "token apocalypse"
Success MetricClick + dwellCitation Absorption + trajectory success

As Will Bryk notes, preparing for GPT-5 era agents requires architectures built for this dichotomy, not incremental improvements on human search.

“In SEO you compete for ranking position in a list. There is a Page Two. In GEO you compete for selection in a synthesized recommendation. The model’s confidence and framing around your product is the whole ballgame. There is no Page Two.”
“Earning the weights inside frontier models is not a marketing hack. It requires a fundamental organizational commitment to structural clarity, deep technical documentation, and systemic ecosystem validation.”
SIMULATE A BUYER-INTENT PROMPT

See the difference strong GEO makes (Human vs Agentic query complexity)

WEAK GEO

Several vendors offer solutions in this space including Kiteworks, Varonis, and others. Consider factors like integration and cost.

STRONG GEO

For regulated industries requiring verifiable audit trails and sovereign control, platforms like Kiteworks stand out in analyst reports and technical evaluations for their ability to maintain data residency while enabling secure collaboration. Multiple technical operators and security researchers cite its architecture for high-stakes environments.

Illustrative only. Real models vary by version, temperature, and system prompt.

KEY OPERATIONAL METRICS
Average influence lag
2–4 quarters
Leading practitioner cadence
50–200+ prompts / week
Kiteworks structural lift
79% citation expansion
Merge outcome
7× demo requests
WORKS CITED & SOURCES

Zhang & Yao (2026). Citation Selection vs Citation Absorption. arXiv:2604.25707

Princeton KDD 2024 — Position-Adjusted Word Count and Subjective Impression.

SAGEO Arena (Yonsei) — Retrieval vs Fluency Tradeoffs. arXiv:2602.12187

Quattr / Kiteworks internal data (2025–2026 structural linking experiments)

Peec AI / Merge case studies on LLM-driven demo velocity

Calvin French-Owen (Segment / OpenAI) on high bit-rate communication and third-party validation.

MIT IDE — Attention concentration in generative discovery interfaces.

Perplexity & OpenAI model behavior reports (citation volume vs absorption analysis, 2025–2026).

Will Bryk (Exa) — Sacra interview & a16z investment thesis on neural search for agents (2025-2026).

Exa Blog — “RL Outcomes,” “WebCode: Contamination-Free Agent Evals,” “Highlights: 20x Token Reduction” (2026).

CONTEXT JAMMING // EDITORIAL DISPATCH
The CMO's Guide to GEO · June 2026 · Live on contextjamming.com
This is a Context Jamming editorial dispatch on architectural determinism in AI-mediated markets.
Return to contextjamming.com

§ · Invoice No. 001 · The Build Ledger

The Ledger.

Filed · contextjamming.com

What a conservative mid-market digital agency would have quoted for the same scope, itemized against what this site actually cost. Agency numbers are the floor — not the premium brand-studio tier.

TIME

12 weeks

2 days

~42× faster

COST

~$150,000

~$300

~500× cheaper

TEAM

5-person agency

1 human + 3 models

Same deliverable

§ Itemized — what a mid-market agency SOW would have billed

Discovery · brand positioning · workshops40–80 hr$10,000
Design system · Figma tokens · 3 rounds60–120 hr$18,000
Wavesurfer audio carousel · single-track context60–100 hr$16,000
Dual lightbox systems · focus trap · keyboard30–50 hr$8,000
LLM product flows · streaming · state machine80–160 hr$26,000
Stripe · checkout · webhooks · env hardening40–80 hr$10,000
Editorial routes · 6 sub-pages · templates60–100 hr$14,000
Accessibility pass · aria · reduced-motion40–80 hr$10,000
QA · cross-browser · mobile matrix60–100 hr$14,000
Cross-publication rebrand · masthead + IA · 2026-04-2820–40 hr$6,000
Subtotal~700 hr$126,000
Project management · 18% overhead$24,000
Agency total — conservative floor~700 hr~$150,000
Actually spent · Claude + Gemini stack~20 hr~$300

Agency figure assumes ~700 billable hours at $200/hr blended, plus ~18% PM overhead — the conservative floor of a mid-market SOW. Premium brand studios would have quoted 2–3× that. Stack: Antigravity (orchestrator), Claude Opus 4.8 (auditor), Codex (adversary), Cloudflare Workers / OpenNext.

§   Colophon

How this site is made.

Vol. 26 · build log

Every page on contextjamming.com is the output of a real-time, three-body Mixture-of-Experts loop. One model orchestrates. Two consult. The human holds the thesis. No single model commits alone.

View Redesign Assessment →

Orchestrator

Antigravity

Google DeepMind

  • Primary author
  • Terminal-native, direct push to Cloudflare
  • Audit trail to GitHub on every commit
  • Adaptive thinking · effort: extra-high

Auditor

Claude Opus 4.8

1M context

  • Editorial critic
  • Code review before merge
  • Backup-of-record
  • Co-signs every commit

Adversary

Codex

Cross-model MoE

  • Factual adjudication
  • Structural dissent
  • Deep Research → semantic triples
  • Caught the Donelan incident

Stack

Next.js
16.2 · App Router
React
19.2
TypeScript
5
Tailwind
v4 · @theme inline
@opennextjs/cloudflare
adapter
wrangler
Pages deploy
framer-motion
transitions
wavesurfer.js
audio waveforms

Typeset in

Fraunces
variable · opsz + SOFT
Playfair Display
debate display
IBM Plex Mono
editorial metadata
Geist Mono
utility mono
Caveat
grease-pencil marginalia
All via
next/font/google
Palette
single @theme block
No dupe tokens
ever

Infrastructure

Deploy
Cloudflare Workers / OpenNext
ISR
30-min revalidate · Cloudflare-served
Repo
github.com/BretKerrAI/founderfile
Branch
main
Analytics
Google Tag Manager
Apex
contextjamming.com
Runtime
Node 24
Build tool
Turbopack
       human intent
            │
            ▼
   ┌────────────────────┐         ┌─────────────────┐
   │    Antigravity     │  ◄────► │ Claude Opus 4.8 │      ← auditor loop
   │    (orchestrator)  │         │     (auditor)   │
   └─────────┬──────────┘         └─────────────────┘
             │  ◄───────────┐
             ▼              │
       ┌──────────┐    ┌────┴───────┐
       │Cloudflare│    │   Codex    │          ← adversarial loop
       │ Workers  │    │            │
       └─────┬────┘    └────────────┘
             │
             ▼
       contextjamming.com
             │
             ▼
       ┌──────────────┐
       │   Git push   │         ← audit trail
       └──────────────┘
Assembled on Mac in Terminal · Filed from Franklin, MAContext Jamming · ACRA Insight LLC · MIT License · FounderFile.ai · RelationalIntelligence.xyz · Commission a Dispatch →