CONTEXT JAMMING

Field notes from inside the context window.

CONTEXT JAMMING · DEEP RESEARCH

The AI-Biology Convergence

“The life sciences are rapidly transitioning into a discipline of systemic engineering, driven by generative world models and agentic AI. The future of molecular design lies in bridging physical sequence generation with deterministic bioinformatics at scale.”

Generative World ModelsAgentic AIBiotech InfrastructureTarget Identification

The biological sciences are experiencing a computational and macroeconomic inflection. As genomics alone scales toward requiring 110 petabytes of storage daily—outpacing global media platforms—the historical paradigm of observational, hypothesis-driven biology is collapsing. The replacement is a programmable discipline of systemic engineering driven by two distinct computational pillars: physics-based generative world models of proteins, and reasoning-based agentic bioinformatic orchestrators.

§ 01 · THE PARADIGM SHIFT IN COMPUTATIONAL BIOLOGY

Biology as a Programmable Discipline of Systemic Engineering

Biology is fundamentally programmable. Every living organism on Earth shares a universal genetic alphabet. Computational translation and generation of this data has transitioned from a theoretical exercise into an active engineering reality. Genomics is generating data at an unprecedented, planetary scale, calling for a radical re-evaluation of analytical infrastructure.

Within this landscape, two complementary computational initiatives represent the future of biomedical discovery. The first leverages physical, generative "world models" of biology (spearheaded by EvolutionaryScale and the Chan Zuckerberg Biohub) to simulate evolution and engineer novel proteins. The second initiative leverages autonomous agentic AI (championed by Anthropic) to navigate brittle biological databases and automate complex reasoning pipelines.

“Biology is fundamentally programmable; every living organism shares a universal genetic alphabet, and the ability to translate, understand, and computationally generate this biological data is no longer a theoretical exercise but a functional reality.”
INTELLIGENCE ECHO

Genomics generates up to 110 petabytes per day. We aren't just reading code anymore; we are compiling biological machinery from scratch.

§ 02 · GENERATIVE WORLD MODELS

EvolutionaryScale ESM3: Programming Biology from First Principles

The release of ESM3 by EvolutionaryScale marks a critical milestone in generative biology. Unlike previous models that predict structures from sequence, ESM3 is a multimodal generative model that evaluates sequence, structure, and functional annotations simultaneously. This allows researchers to provide structural coordinates or annotations as prompts, directing the model to generate novel, functional amino acid sequences.

The computational scale of ESM3 is unprecedented. The 98-billion-parameter frontier model was trained on 2.78 billion protein sequences using 25x more FLOPs and 60x more data than ESM2, requiring one trillion teraflops of computational power. This scale unlocked the emergent capability of atomic coordination—designing proteins from prompts specifying exact atomic locations of distant amino acids that must interact in the folded 3D macromolecular structure.

SpecificationESM2 (Previous)ESM3 (Current Frontier)Operational Impact
Model SizeUp to 15 Billion Parameters98 Billion ParametersScales structural representation.
Training CorpusSub-billion protein sequences2.78 Billion protein sequencesCaptures complete global protein diversity.
Compute ScalingBaseline computational threshold25x FLOPs relative to ESM2Enables high-resolution atomic coordination.
Architecture TypeSequence-to-structure predictionAll-to-all generativeSimultaneous Sequence, Structure, & Function.
Key CapabilityStatic structure mappingProgrammable generation & self-correctionDesigns proteins from functional constraints.

The most profound empirical demonstration of ESM3's capability is its generation of a novel Green Fluorescent Protein (GFP). Fluorescent proteins are crucial molecules in biological imaging and phenotypic screening. Using a chain-of-thought prompting methodology, ESM3 synthesized a biologically active fluorescent protein that shares only 58% sequence identity with any natural counterpart. This compressed over 500 million years of evolutionary drift into a single, highly parallelized computational process.

§ 03 · SYSTEMIC MODELING

The Chan Zuckerberg Biohub and the Path to the Virtual Cell

While EvolutionaryScale targets granular molecular dynamics, the Chan Zuckerberg Biohub is working to integrate biological systems into a predictive "Virtual Cell." This model aims to link genomic, proteomic, and transcriptomic layers into a unified mathematical representation of human and animal physiology.

This initiative relies on massive biological datasets. The Chan Zuckerberg Initiative (CZI) developed CELLxGENE, the world's largest open-source corpus of single-cell data, and launched the "Billion Cells Project" in early 2025 to generate one billion highly characterized cells. Additionally, CZI released TranscriptFormer, a multi-species generative model trained on 112 million individual cells across 12 distinct species.

PROJECT · CELLXGENE

Standardized, curated single-cell transcriptomic corpus serving as the foundational training dataset for virtual cell architectures.

MODEL · TRANSCRIPTFORMER

Generative transcriptomic model trained on 112 million single cells representing 1.5 billion years of evolutionary data.

Integrating molecular protein logic with large-scale transcriptomic databases creates a continuous feedback loop. Predictions made in the dry lab can be physically synthesized and validated using Cryo-Electron Microscopy (Cryo-EM) and automated assays, accelerating basic biomedical research and therapeutic engineering.

§ 04 · AGENTIC AI & BIOINFORMATICS

Navigating Fragile Data Landscapes: BioMysteryBench and VirBench

Unlike physical modeling, daily research workflows face a structural bottleneck: biological data is highly fragmented and brittle. Public databases like the NCBI Virus portal rely on conventional, legacy web interfaces that require manual navigation, an inefficiency known to computational biologists as the "click tax".

To evaluate LLM reasoning capabilities in this data environment, researchers developed BioMysteryBench (99 complex bioinformatics challenges evaluated against physical PCR ground truth) and VirBench (120 complex viral sequence queries). Unassisted LLMs showed high error rates and run-to-run variability. In one case, incomplete database fetching led to an erroneous ebolavirus phylogenetic tree that hallucinated the outbreak origin (TMRCA) back to 1922 rather than January 2014, misjudging mutating epitopes for therapeutics like maftivimab and MBP134.

Model ContextUnassisted Mean AccuracyAccuracy with gget virusPrimary failure mode mitigated
Claude Sonnet 4 (Anthropic)16.9%>90.0%Inconsistent contextual metadata application
Biomni OSS (Stanford)Variable / Unstable>90.0%Web interface hallucination & navigation loops
GPT-5.5 (OpenAI)91.3%99.7%Premature sequence cutoffs on large batches
§ 05 · INFRASTRUCTURE CRISIS & DECISION ENGINES

Standardizing Agentic Environments: Biomni, gget, and Software Economics

Resolving agent inaccuracy requires building deterministic retrieval layers to serve as agent guardrails. The tool gget virus coordinates NCBI API calls to bypass brittle web portals, force determinism, and return standardized machine-readable execution logs.

At the platform level, Stanford's Biomni environment coordinates reasoning with biological datasets, utilizing the Biomni-R0 reasoning model (a Qwen-32B architecture optimized via reinforcement learning). This setup can plan complex CRISPR screens, annotate single-cell RNA-seq, and evaluate ADMET profiles.

“The National Institutes of Health operates with a budget exceeding $51.96 billion, yet direct grant funding dedicated to developing and maintaining analytical software is practically nonexistent.”

As detailed in Elliot Hershberg's analysis for New Science, biology software infrastructure suffers from a broken funding paradigm. Historically, the Human Genome Project fostered an open-source ethos—typified by Jim Kent writing the "GigAssembler" code in four weeks to keep the genome public—that accustomed the community to free software. Today, academic researchers are evaluated on publications rather than maintenance, resulting in a "tsunami of unusable tools" where roughly one-third of published bioinformatics software is no longer installable.

CASE · ARENA BIOWORKS

Launched with $500M in early 2024 to consolidate AI target discovery, chemo-proteomics, and clinical validation under one roof. Closed in November 2025 due to biotech macroeconomic contraction, proving that biological translation is highly capital-intensive and resistant to pure software-style iteration.

MACRO · NIH SOFTWARE BUDGET

The NIH budget exceeds $51.96B, but direct grant funding for software maintenance and Research Software Engineers is near-zero. One-third of published bioinformatics tools are no longer installable, leading to a brittle base for AI agents.

§ 06 · TARGET IDENTIFICATION

Translating AI Insights into Pancreatic Beta-Cell and Small-Molecule Discovery

For metabolic disease research, the convergence of generative structural modeling and deterministic agentic workflows represents a paradigm shift. The pancreatic beta cell controls systemic glucose homeostasis; when beta-cell mass declines, metabolic disease develops. Identifying small molecules that promote beta-cell proliferation or protect them from inflammatory decay is a vital therapeutic objective.

High-throughput phenotypic cell-based screening is highly scalable, but downstream target identification and mechanism-of-action (MoA) deconvolution are notoriously slow. Traditional methods like SILAC affinity pull-downs have low sensitivity and high background noise. Modern target identification requires integrating advanced computational approaches.

By mapping compounds against BindingDB, ChEMBL, and BioSNAP, deep learning networks (e.g., DeepDTAGen) predict Drug-Target Interactions (DTI). ESM3 models the protein-small molecule interfaces at atomic resolution to evaluate thermodynamic binding coordinates in silico. The Virtual Cell model can project these binding profiles against CELLxGENE or Tabula Sapiens transcriptomics to predict off-target toxicity in renal or hepatic cells, while Graph Neural Networks (GNNs) identify hidden pathobiology.

Target Identification StepTraditional Methodology / ChallengeAI & Agentic Integration Strategy
Initial Target SearchHigh-noise biochemical pull-downs (SILAC)DeepDTAGen and DTI mapping across BindingDB/BioSNAP
Structural ValidationCostly X-ray crystallography or Cryo-EMESM3 atomic-resolution structural generation and interaction mapping
Toxicity / PolypharmacologyHigh-attrition in vivo animal testingVirtual Cell transcriptomic projection (CELLxGENE/TranscriptFormer)
Data SynthesisManual "Click Tax" database navigationDeterministic agent orchestration (Biomni, gget wrappers)

Finally, agentic platforms like Biomni automate data processing for genome-wide CRISPR screens. The agent parses raw screening logs, matches hits against proteomic data, queries NCBI and Ensembl via deterministic gget wrappers, and identifies high-probability therapeutic targets (e.g., DYRK1A or GSK3B) without manual "click tax" overhead.

“Biology is transitioning from an observational science to a discipline of systemic engineering. The future of therapeutics lies at the intersection of generative world models and deterministic agentic reasoning.”
“Navigating fragile biological databases requires strict determinism. If an AI agent incorrectly parses a proprietary file format or mixes genomic builds due to inconsistent metadata, the entire downstream analysis is compromised.”
SIMULATE A BIO-AGENT PROMPT

See the difference tool-augmented agentic workflows make

UNASSISTED LLM

Retrieving sequences from NCBI. Based on the analysis, the outbreak root date (TMRCA) is estimated to be approximately 1922. Standard Zaire ebolavirus strains show high sequence similarity, but specific antibody evasion profiles are unclear due to missing data.

AGENT WITH GGET / BIOMNI

Retrieved 1,248 complete, annotated sequences via gget virus. The pipeline successfully filtered for Zaire ebolavirus species, glycoprotein genes, human hosts, and isolation dates. TMRCA was computed as January 2014 (p < 0.001). Neutralizing epitope analysis confirms 99.4% conservation of the maftivimab binding site, with a minor mutation (G528C) detected in 3 isolates that requires functional assays.

Illustrative bio-agent query outputs. Real accuracy scales with tool integrations and sandbox parameter tuning.

STRATEGIC DIRECTIVES
Deterministic Infrastructure
Intelligence without guardrails is unsafe; API logs must be auditable.
Programmable Design
ESM3 shifts biology from observation to synthesis, enabling atomic-level in silico modeling.
Copilot Paradigm
Total automation is out of reach; human experts must guide search and validate paths.
Institutional Reform
Software funding must shift from publication counts to professional maintenance.
WORKS CITED & REFERENCES

EvolutionaryScale ESM3 Technical Report (2024). Multimodal Generative Protein Synthesis.

Luebbert et al., Anthropic Research (2025). BioMysteryBench & VirBench Evaluations.

Zhang & Yao (2026). Citation Selection vs Citation Absorption. arXiv:2604.25707.

Shreya Johri, Eliezer Van Allen et al. (2025). Systemic Evaluations of Agentic AI in Spatial & Epigenomic Modalities.

Elliot Hershberg, New Science (2024). Software Economics in the Life Sciences.

Chan Zuckerberg Biohub (2024–2025). CELLxGENE, TranscriptFormer & the Billion Cells Project.

Stanford Biomni Environment & Biomni-R0 Technical Release (2025).

gget virus: Programmatic API wrapper for NCBI viral database query normalization.

CONTEXT JAMMING // EDITORIAL DISPATCH
The AI-Biology Convergence · June 2026 · Live on contextjamming.com
This is a Context Jamming deep research dispatch on computational determinism in structural biological design.
Return to contextjamming.com

§ · Invoice No. 001 · The Build Ledger

The Ledger.

Filed · contextjamming.com

What a conservative mid-market digital agency would have quoted for the same scope, itemized against what this site actually cost. Agency numbers are the floor — not the premium brand-studio tier.

TIME

12 weeks

2 days

~42× faster

COST

~$150,000

~$300

~500× cheaper

TEAM

5-person agency

1 human + 3 models

Same deliverable

§ Itemized — what a mid-market agency SOW would have billed

Discovery · brand positioning · workshops40–80 hr$10,000
Design system · Figma tokens · 3 rounds60–120 hr$18,000
Wavesurfer audio carousel · single-track context60–100 hr$16,000
Dual lightbox systems · focus trap · keyboard30–50 hr$8,000
LLM product flows · streaming · state machine80–160 hr$26,000
Stripe · checkout · webhooks · env hardening40–80 hr$10,000
Editorial routes · 6 sub-pages · templates60–100 hr$14,000
Accessibility pass · aria · reduced-motion40–80 hr$10,000
QA · cross-browser · mobile matrix60–100 hr$14,000
Cross-publication rebrand · masthead + IA · 2026-04-2820–40 hr$6,000
Subtotal~700 hr$126,000
Project management · 18% overhead$24,000
Agency total — conservative floor~700 hr~$150,000
Actually spent · Claude + Gemini stack~20 hr~$300

Agency figure assumes ~700 billable hours at $200/hr blended, plus ~18% PM overhead — the conservative floor of a mid-market SOW. Premium brand studios would have quoted 2–3× that. Stack: Antigravity (orchestrator), Claude Opus 4.8 (auditor), Codex (adversary), Cloudflare Workers / OpenNext.

§   Colophon

How this site is made.

Vol. 26 · build log

Every page on contextjamming.com is the output of a real-time, three-body Mixture-of-Experts loop. One model orchestrates. Two consult. The human holds the thesis. No single model commits alone.

View Redesign Assessment →

Orchestrator

Antigravity

Google DeepMind

  • Primary author
  • Terminal-native, direct push to Cloudflare
  • Audit trail to GitHub on every commit
  • Adaptive thinking · effort: extra-high

Auditor

Claude Opus 4.8

1M context

  • Editorial critic
  • Code review before merge
  • Backup-of-record
  • Co-signs every commit

Adversary

Codex

Cross-model MoE

  • Factual adjudication
  • Structural dissent
  • Deep Research → semantic triples
  • Caught the Donelan incident

Stack

Next.js
16.2 · App Router
React
19.2
TypeScript
5
Tailwind
v4 · @theme inline
@opennextjs/cloudflare
adapter
wrangler
Pages deploy
framer-motion
transitions
wavesurfer.js
audio waveforms

Typeset in

Fraunces
variable · opsz + SOFT
Playfair Display
debate display
IBM Plex Mono
editorial metadata
Geist Mono
utility mono
Caveat
grease-pencil marginalia
All via
next/font/google
Palette
single @theme block
No dupe tokens
ever

Infrastructure

Deploy
Cloudflare Workers / OpenNext
ISR
30-min revalidate · Cloudflare-served
Repo
github.com/BretKerrAI/founderfile
Branch
main
Analytics
Google Tag Manager
Apex
contextjamming.com
Runtime
Node 24
Build tool
Turbopack
       human intent
            │
            ▼
   ┌────────────────────┐         ┌─────────────────┐
   │    Antigravity     │  ◄────► │ Claude Opus 4.8 │      ← auditor loop
   │    (orchestrator)  │         │     (auditor)   │
   └─────────┬──────────┘         └─────────────────┘
             │  ◄───────────┐
             ▼              │
       ┌──────────┐    ┌────┴───────┐
       │Cloudflare│    │   Codex    │          ← adversarial loop
       │ Workers  │    │            │
       └─────┬────┘    └────────────┘
             │
             ▼
       contextjamming.com
             │
             ▼
       ┌──────────────┐
       │   Git push   │         ← audit trail
       └──────────────┘
Assembled on Mac in Terminal · Filed from Franklin, MAContext Jamming · ACRA Insight LLC · MIT License · FounderFile.ai · RelationalIntelligence.xyz · Commission a Dispatch →