Building AI agents, exploring cognitive science, unraveling bioinformatics

Creative engineering where philosophy meets technology

Latest Posts

AI Safety via Debate: How Adversarial Argumentation Solves RL's Hardest Problem

#ai-safety #reinforcement-learning #scalable-oversight #debate #alignmentCited

Reinforcement learning works when you can check the answer. A chess engine wins or loses. A code-generation model passes or fails the test suite.

Feb 20, 202616 min read

Inverse Graphics as RL Environments: Testing Whether VLMs Can Actually See

#reinforcement-learning #vision-language-models #inverse-graphics #benchmarks #spatial-reasoningCited

Vision-language models can describe a scene in paragraph-length detail and still fail to tell you whether a red cube is in front of or behind a blue cylinder.

Feb 20, 202612 min read

Socrates Was a Terrible Prompt Engineer (That's the Point)

#prompt-engineering #socratic-method #llm-reasoning #ai-philosophy #chain-of-thoughtCited

Six ancient questioning techniques map onto the most effective LLM prompting strategies with uncomfortable precision, and what that reveals about these models is more interesting than the performance gains.

Feb 11, 202620 min read

Making Science Machine-Readable: The Epistemological Challenge of Verifying Knowledge at Scale

#machine-learning #scientific-publishing #ai #knowledge-graphs #epistemologyCited

How do you verify scientific knowledge when there are 2.9 million papers on arXiv alone, with thousands more added every day? A new paper extracts nearly two million claims from 16,087 manuscripts and compares machine evaluation to human peer review, with 81% agreement.

Feb 4, 202619 min read

When AI Writes the Code, Verification Becomes the Job

#formal-verification #ai-code-generation #software-security #devops #llmCited

Over 80% of developers now use AI assistants for code generation, yet at least 62% of AI-generated code contains vulnerabilities. As AI writes code faster than humans can review it, the engineer's primary job shifts from writing code to verifying it through formal methods.

Feb 4, 202611 min read

The Historical Accident That Split Drug Design in Two (And the Contrastive Model That Reunites It)

#drug discovery #contrastive learning #computational biology #virtual screening #protein-ligand interactions #graph neural networks #machine learningCited

Structure-based and ligand-based drug design evolved as separate fields solving the same problem. ConGLUDe, a contrastive geometric learning model, unifies both approaches and outperforms specialist methods on realistic benchmarks without requiring pre-defined binding pockets.

Jan 30, 202612 min read

AlphaGenome: One Model for the Other 98% of Your DNA

#deep-learning #genomics #alphagenome #deepmind #variant-prediction #non-coding-dnaCited

Google DeepMind's AlphaGenome reads 1 million base pairs of DNA and predicts thousands of regulatory functions at single-nucleotide resolution, beating 25 of 26 specialized models.

Jan 29, 202612 min read

How AlphaGenome Tackles Variant Effect Prediction

#genomics #deep learning #variant effect prediction #alphagenome #computational biologyCited

AlphaGenome processes 1 million DNA base pairs to predict variant effects across 7,000+ genomic tracks in one second, outperforming specialized models on 25 of 26 VEP benchmarks.

Jan 29, 20268 min read

How AlphaGenome Models Gene Regulation: 2D Embeddings, Splicing, and the Race to Read Non-Coding DNA

#alphagenome #genomics #deep learning #splicing #computational biology #google deepmind #variant interpretationCited

A technical look at AlphaGenome's architecture, its 2D pairwise embeddings for splicing prediction, and what the model means for clinical variant interpretation.

Jan 29, 202618 min read

EDEN: 28 Billion Parameters for Programming Biology

#foundation models #computational biology #gene therapy #metagenomics #drug discovery #eden #basecamp researchCited

Basecamp Research's EDEN model trains on proprietary environmental metagenomics to design gene-insertion enzymes, antimicrobial peptides, and synthetic microbiomes -- all validated in the wet lab.

Jan 28, 202616 min read

A Bioinformatician's Guide to Choosing Genomic Foundation Models

#foundation models #genomics #bioinformatics #deep learning #protein language models #dna models #esm-2 #dnabert-2 #hyenadna #scgptCited

A practical guide to selecting genomic foundation models for bioinformatics tasks. Covers ESM-2, DNABERT-2, HyenaDNA, Nucleotide Transformer, scGPT, and Evo with specific recommendations for DNA sequence analysis, protein structure prediction, and single-cell analysis based on hardware requirements, inference speed, and task type.

Jan 19, 202622 min read

End-to-End Test-Time Training: Making Long Context Work Without the Memory Tax

#llm #long context #test-time training #machine learning #transformers #inference optimizationCited

How TTT-E2E achieves constant inference latency regardless of context length by treating long context as a learning problem rather than an architecture problem.

Jan 15, 202616 min read

Engram: How DeepSeek Added a Second Brain to Their LLM

#deep learning #llm architecture #memory #mixture of experts #deepseek #sparse computationCited

A technical deep dive into DeepSeek's Engram architecture, which introduces conditional memory as a new axis of sparsity for large language models.

Jan 13, 202618 min read

ChemBERTa: When Language Models Learned to Speak Chemistry

#machine learning #chemistry #transformers #drug discovery #molecular property predictionCited

How the same transformer architecture powering GPT learned to predict molecular properties by treating chemistry as a language problem

Jan 8, 202620 min read

MolBERT: Teaching Transformers to Read the Language of Chemistry

#machine learning #drug discovery #transformers #chemistry #smiles #bertCited

How researchers adapted BERT for molecular property prediction, turning SMILES strings into drug discovery insights

Jan 8, 202621 min read

View All 73 Posts