Jarrod Barnes

Jarrod Barnes

I'm a researcher and founder working on open-ended scientific discovery.

I run Dynamical Systems, where I build environments, evaluations, and verification systems that make open-ended scientific work trainable.

Research threads

Scientific discovery Training scientific judgment Verified campaign environments convert search, trust, escalation, and revision into a multi-turn RL problem with physics-grounded oracle reward. Verification Scaling test-time verification for novel materials Probe-gradient guidance extracts band-gap signal from an unconditional crystal diffusion model and steers sampling without retraining. Post-training Self-improving pretraining as a substrate for agentic post-training I adapted self-improving pretraining, interleaved-thought SFT, and RL mid-training to Qwen3-0.6B-Base. Interpretability Do language models know when to change their mind? Signal detection framework for LLM belief revision across six open-weight models. Learning systems Continual learning for production agents ATLAS converts production agent trajectories into inference-time learning and on-policy distillation loops.

More writing

A Theory On Becoming an Expert Learning systems / May 4, 2026 Rethinking Evaluation for Agents That Never Stop Learning Evaluation / Jan 13, 2026 Building a World Model of Consequence Learning systems / Nov 19, 2025 My Agents Keep Failing. Yours Will Too. Learning systems / Jul 16, 2025 Everything is Changing...Again Learning systems / Apr 30, 2025