Research

ML research on neural network compression, formal verification with Lean 4, and empirical methodology that can falsify its own claims. All experiments run on RTX 3060 12GB — hardware constraints are the motivation, not the limitation. Current focus: MoE weight quantization with machine-checked invariants. Research supplements and informs engineering work on LLM agent infrastructure.

Papers

Phase-Collapse Defragmentation: Provable Bounds on 1-bit KV-Cache Quantization in MoE

Moment-ratio gauge construction that determines — from weight statistics alone, before inference — whether 1-bit KV-cache quantization is achievable for a given layer. MoEGauge and JensenFloor theorems proved in Lean 4 with zero sorries. Evaluated on OLMoE-1B-7B.

Microsite + Lean source Blog series

Verified Neural Compilation: Formal Symmetries and Impossibility Boundaries

Machine-checked proofs of RoPE commutant classification and block-diagonal frequency boundaries in Lean 4. Formalizing zero-cost compilation invariants and proving the impossibility of key-only routing under standard MoE interfaces.

Microsite + Lean source Blog series

RoPE-Provenance: Subspace-Split Positional Encodings for Token-Role Auditing

Allocating a low-frequency subspace of RoPE positional encodings to serve as an out-of-band role channel. Applied to SmolLM2-135M to dynamically isolate instruction execution from untrusted data payloads. Pre-registered benchmarks evaluate selectivity (SEP scores) and instruct compliance under adversarial injection.

Microsite + Lean source Blog series

Other Research

Falsifying LoRA Alignment Geometry: srank as Overfitting Signature in DPO Fine-Tuning

Pre-registered study of whether DPO fine-tuning leaves a geometrically distinct fingerprint in LoRA weight updates. Two claims killed: the orthogonal-complement hypothesis (directional violation) and cross-probe srank (length-bias artifact). Interactive microsite with d3 exploration widgets.

Microsite + Lean source Blog series

A Survey of Symmetries Compression Must Respect

Living catalogue of algebraic invariants — RoPE, SignPhase, BoundedArithmetic, DenseSparse, dormancy — that any compression scheme must preserve to avoid silent correctness failures. Lean 4 coverage across 152/161 symmetry claims.

Microsite + Lean source Blog series

Methodology

The empirical methodology used across these projects — pre-registration, kill ladders, τ-baselines, theorem-screened experiments — is documented as a standalone series. It borrows from clinical trial design and applies it to solo ML research. Start with Notes on a Methodology Transition or the Compression Falsification Ladder.

Blog series

The Geometry of MoE Compression (Part E)

Expanding on Phase-Collapse: β transfer in FFNs, formal Softmax soundness, and recovering dense performance from quantized layers.

The β-lift and FFN Transfer: MoE Compression Part E Apr 29, 2026
Formalizing the Softmax and Hessian: Lean 4 Soundness Proofs May 9, 2026
Dense Activation-Fit Recovery: Healing Quantized Layers May 16, 2026

View series →

Notes on a Methodology Transition

Moving from selectionist ML research (run many probes, kill the bad ones) to physicist-mode research (theorem first, two parameters per probe, universality over coverage). A live chronicle.

Pre-Registration for Solo ML Researchers Apr 20, 2026
What Experimental Design Actually Means May 5, 2026
Hypothesis Testing from Scratch, and Its Bayesian Analogue Apr 27, 2026
Two Research Modes, and Why the Second One Needs Lean 4 May 7, 2026
Naming What Fails: The Obstacle Taxonomy May 8, 2026
Theorem-Screened Experiments May 9, 2026
The Five-Minute Daily Drift Check May 10, 2026

View series →