CONTEXT JAMMING

Field notes from inside the context window.

MULTIMODAL UNIX · MMPipe

The Unix pipe,
for everything.

The Unix pipe (|) assumes everything is text. When you pipe binary media—an image, a video frame, an audio clip—the receiving program has no idea what it is looking at. MMPipe fixes this with a lightweight framing protocol so vision models return plain UTF-8 that grep, awk, and sed can consume exactly as Thompson intended.

Universal Media Pipe ProtocolZero DependenciesLegacy Tools Untouched
THE SACRED CONTRACT

Universal Media Pipe Protocol (UMPP)

Every UMPP stream begins with a signature header block that any downstream program can read instantly:

MMP/1.0
Content-Type: image/png
Content-Length: 84321

<raw binary payload>

The magic bytes MMP/1.0\n let tools distinguish a framed media stream from raw bytes or plain text. No magic-byte sniffing required in the happy path (though analyze_vision gracefully falls back when you just cat a file).

THE PIPELINE

How it works

StepCommandRole
01mcatEmits UMPP-framed binary to stdout
02|Standard Unix pipe — completely unchanged
03analyze_visionReads frame, calls Gemini, prints UTF-8 text
04grep / awk / sedConsumes plain text exactly as before

The multimodal complexity is fully contained inside analyze_vision. The boundary between programs stays plain text — the original Unix contract is restored.

IMMEDIATE VALUE

Usage

BASIC — DESCRIBE AN IMAGE
mcat photo.jpg | analyze_vision
WITH A TARGETED PROMPT
mcat diagram.png | analyze_vision "List every label visible in this diagram."
PIPE THE RESULT INTO CLASSIC TOOLS
mcat screenshot.png | analyze_vision "What colors appear?" | grep -i red
WORKS WITH PLAIN CAT (FALLBACK)
cat photo.jpg | analyze_vision "What is this?"

Without GEMINI_API_KEY, analyze_vision runs in mock mode and emits a deterministic placeholder. Perfect for testing pipelines.

V2 — ERGONOMICS LAYER

Three progressive enhancements

mmpick

Native macOS file picker (osascript). Emits one MMP/1.0 envelope per selected file with an X-MMP-Source header. Supports multi-select.

Source once via shell/mmpick.sh

mmkitty

Non-destructive Kitty Graphics Protocol bridge. Renders an inline thumbnail and passes the original UMPP envelope downstream unchanged.

Works in Kitty; graceful text fallback elsewhere.

mmtui

Full-screen Textual TUI. File browser + pipeline builder + live output. Keyboard-driven composition of complex media → text pipelines.

pip install textual

PHILOSOPHY

The Sacred Contract restored

All the fancy multimodal reasoning happens inside a single, well-behaved program. Everything on either side of the pipe continues to speak the only language Unix ever promised: plain text.

MMPipe is deliberately dependency-free in the core (Python stdlib only) and ships as a single executable script plus a handful of optional ergonomic bridges. It is the minimal, correct extension that makes the 1970s pipe model work in 2026.

ONE-LINE INSTALL
git clone https://github.com/bretkerr/MMPipe.git
cd MMPipe
bash install.sh
source ~/.zshrc   # or ~/.bashrc
Requirements: Python 3.10+ (stdlib only for core), GEMINI_API_KEY (optional — mock mode works without it). No virtualenvs, no pip for the base experience.
VIEW SOURCE ON GITHUB →

Imported from bretkerr/MMPipe. A pure expression of Unix philosophy applied to the multimodal era.

CONTEXT JAMMING // MULTIMODAL UNIX
MMPipe — because the pipe was never the problem. The assumption that everything is text was.

§ · Invoice No. 001 · The Build Ledger

The Ledger.

Filed · contextjamming.com

What a conservative mid-market digital agency would have quoted for the same scope, itemized against what this site actually cost. Agency numbers are the floor — not the premium brand-studio tier.

TIME

12 weeks

2 days

~42× faster

COST

~$150,000

~$300

~500× cheaper

TEAM

5-person agency

1 human + 3 models

Same deliverable

§ Itemized — what a mid-market agency SOW would have billed

Discovery · brand positioning · workshops40–80 hr$10,000
Design system · Figma tokens · 3 rounds60–120 hr$18,000
Wavesurfer audio carousel · single-track context60–100 hr$16,000
Dual lightbox systems · focus trap · keyboard30–50 hr$8,000
LLM product flows · streaming · state machine80–160 hr$26,000
Stripe · checkout · webhooks · env hardening40–80 hr$10,000
Editorial routes · 6 sub-pages · templates60–100 hr$14,000
Accessibility pass · aria · reduced-motion40–80 hr$10,000
QA · cross-browser · mobile matrix60–100 hr$14,000
Cross-publication rebrand · masthead + IA · 2026-04-2820–40 hr$6,000
Subtotal~700 hr$126,000
Project management · 18% overhead$24,000
Agency total — conservative floor~700 hr~$150,000
Actually spent · Claude + Gemini stack~20 hr~$300

Agency figure assumes ~700 billable hours at $200/hr blended, plus ~18% PM overhead — the conservative floor of a mid-market SOW. Premium brand studios would have quoted 2–3× that. Stack: Antigravity (orchestrator), Claude Opus 4.8 (auditor), Codex (adversary), Cloudflare Workers / OpenNext.

§   Colophon

How this site is made.

Vol. 26 · build log

Every page on contextjamming.com is the output of a real-time, three-body Mixture-of-Experts loop. One model orchestrates. Two consult. The human holds the thesis. No single model commits alone.

View Redesign Assessment →

Orchestrator

Antigravity

Google DeepMind

  • Primary author
  • Terminal-native, direct push to Cloudflare
  • Audit trail to GitHub on every commit
  • Adaptive thinking · effort: extra-high

Auditor

Claude Opus 4.8

1M context

  • Editorial critic
  • Code review before merge
  • Backup-of-record
  • Co-signs every commit

Adversary

Codex

Cross-model MoE

  • Factual adjudication
  • Structural dissent
  • Deep Research → semantic triples
  • Caught the Donelan incident

Stack

Next.js
16.2 · App Router
React
19.2
TypeScript
5
Tailwind
v4 · @theme inline
@opennextjs/cloudflare
adapter
wrangler
Pages deploy
framer-motion
transitions
wavesurfer.js
audio waveforms

Typeset in

Fraunces
variable · opsz + SOFT
Playfair Display
debate display
IBM Plex Mono
editorial metadata
Geist Mono
utility mono
Caveat
grease-pencil marginalia
All via
next/font/google
Palette
single @theme block
No dupe tokens
ever

Infrastructure

Deploy
Cloudflare Workers / OpenNext
ISR
30-min revalidate · Cloudflare-served
Repo
github.com/BretKerrAI/founderfile
Branch
main
Analytics
Google Tag Manager
Apex
contextjamming.com
Runtime
Node 24
Build tool
Turbopack
       human intent
            │
            ▼
   ┌────────────────────┐         ┌─────────────────┐
   │    Antigravity     │  ◄────► │ Claude Opus 4.8 │      ← auditor loop
   │    (orchestrator)  │         │     (auditor)   │
   └─────────┬──────────┘         └─────────────────┘
             │  ◄───────────┐
             ▼              │
       ┌──────────┐    ┌────┴───────┐
       │Cloudflare│    │   Codex    │          ← adversarial loop
       │ Workers  │    │            │
       └─────┬────┘    └────────────┘
             │
             ▼
       contextjamming.com
             │
             ▼
       ┌──────────────┐
       │   Git push   │         ← audit trail
       └──────────────┘
Assembled on Mac in Terminal · Filed from Franklin, MAContext Jamming · ACRA Insight LLC · MIT License · FounderFile.ai · RelationalIntelligence.xyz · Commission a Dispatch →