← All tags

Posts tagged #lean4

Theorem-Screened Experiments

May 9, 2026

A three-step decision rule for running fewer, better experiments: check your theorem library before you touch a GPU. Calibration checks, falsifier traps, and parameter compression from the physicist mode of ML research.

#ml-research #methodology #lean4 #experimental-design
Naming What Fails: The Obstacle Taxonomy

May 8, 2026

25+ preregistered kills over six weeks of compression research. The tempting story is "compression is hard." The physicist story is better: 25 kills, ~10 structural failure patterns, one Lean theorem per class.

#ml-research #methodology #lean4 #compression #negative-results
Two Research Modes, and Why the Second One Needs Lean 4

May 7, 2026

AI makes hypothesis generation cheap. Evaluation stays expensive. Lean 4 proofs are the filter that changes the economics: a proved theorem screens an entire family of candidates before GPU time is allocated.

#ml-research #methodology #research-style #lean4
Zero-Sorry Discipline: What a Lean 4 Appendix Actually Costs

May 6, 2026

Two theorems in this paper — MoEGauge and JensenFloor — had to reach zero sorries before the paper shipped. What that process looks like, why sorry is dangerous, and what JensenFloor actually says.

#ml-research #lean4 #formal-verification #methodology
A Survey as a Living Document

May 3, 2026

What it means to maintain a formal proof corpus that stays in sync with its own coverage badge, and what happened when RWKV was added as a new architectural family without invalidating existing theorems.

#ml-research #lean4 #formal-verification #RWKV #methodology
Lean 4 as a Soundness Oracle for Security Properties

May 2, 2026

Type stubs catch misuse at type-check time. But can we prove they are sound — that a well-typed program cannot trigger the dangerous code path? Enter Lean 4.

#security #lean4 #formal-verification #python #type-theory
The Microsite as Interactive Publication

Apr 24, 2026

Building a GitHub Pages research microsite with d3 widgets, a Lean 4 theorem status page, and a reproducibility shim. One gotcha with Jekyll and markdown-inside-divs. One real answer to whether live widgets are worth the effort.

#ml-research #d3 #jekyll #lean4 #reproducibility #visualization