Notes on a Methodology Transition

Moving from selectionist ML research (run many probes, kill the bad ones) to physicist-mode research (theorem first, two parameters per probe, universality over coverage). A live chronicle.

Pre-Registration for Solo ML Researchers

Apr 20, 2026

How to borrow the clinical trial discipline of writing down what "pass" looks like before running the experiment — and why a SHA256 hash is the cheapest honesty enforcement mechanism available.

#ml-research #methodology #pre-registration #experimentation
What Experimental Design Actually Means

May 5, 2026

Theoretical physicists barely need it. Experimental physicists cannot live without it. Life sciences rewrote it for complexity. Pharma made it law. ML borrows the wrong one.

#ml-research #methodology #experimental-design #statistics
Hypothesis Testing from Scratch, and Its Bayesian Analogue

Apr 27, 2026

Frequentist hypothesis testing rebuilt from first principles for ML researchers who half-remember p-values. Then: the Bayesian reframe, why it fits the kill-ladder better, and what each one actually buys you.

#ml-research #methodology #statistics #bayesian #hypothesis-testing
Two Research Modes, and Why the Second One Needs Lean 4

May 7, 2026

AI makes hypothesis generation cheap. Evaluation stays expensive. Lean 4 proofs are the filter that changes the economics: a proved theorem screens an entire family of candidates before GPU time is allocated.

#ml-research #methodology #research-style #lean4
Naming What Fails: The Obstacle Taxonomy

May 8, 2026

25+ preregistered kills over six weeks of compression research. The tempting story is "compression is hard." The physicist story is better: 25 kills, ~10 structural failure patterns, one Lean theorem per class.

#ml-research #methodology #lean4 #compression #negative-results
Theorem-Screened Experiments

May 9, 2026

A three-step decision rule for running fewer, better experiments: check your theorem library before you touch a GPU. Calibration checks, falsifier traps, and parameter compression from the physicist mode of ML research.

#ml-research #methodology #lean4 #experimental-design
The Five-Minute Daily Drift Check

May 10, 2026

Solo research programs drift one small exception at a time. Three shell commands, run daily, catch the most common protocol violations before they compound into reproducibility failures.

#ml-research #methodology #research-operations #tooling