Debanjan Basu

Research Engineer / ML Systems Engineer · LLM Infrastructure · Observability

debanjan.basu.ds@gmail.com · Berlin, Germany
LinkedIn · GitHub · HuggingFace · Medium

CV — Engineering (PDF) CV — Research (PDF)

Engineering profile → Research profile →

Profile

ML systems engineer with independent research bridging post-training methods, formal verification, and empirical methodology for safety-relevant claims. LoRA-DPO scaling work on Pythia 70M–1B documents geometry–behaviour decoupling: γ-overlap doesn't predict reward margin, suggesting behavioral interventions don't necessarily reorganize underlying representations — a structural finding with implications for evaluating alignment techniques. Lean 4 invariants formalize the empirical scaling (74 theorems across two papers, all in Mathlib). Pre-registered adversarial validation methodology — trap-cell design, kill-fast sequential testing — applied across MoE compression rungs. Six years production ML engineering at Nexern: LLM agent observability with Arize Phoenix, distributed pipelines, deployment infrastructure. Physics background (IISER Kolkata BS-MS; doctoral research under Peter Blöchl at TU Clausthal).

Experience

Senior Python Engineer (GenAI & Data) Aug 2020 – Jun 2026

Nexern GmbH, Berlin

Built in-house HR platform in 3 months — multi-entity, multi-jurisdiction across 5 EU/EMEA countries; Django/PostgreSQL, ADR + spec-driven sprints, invariants enforced across backend, frontend and the database
Built LLM agent observability with Arize Phoenix; trajectory analysis for debugging agent failures in production
Achieved 5–10× pipeline speedups via Dask distributed processing, Celery task queues, and vectorized operations
Led greenfield GenAI agent projects: LLM-based prediction pipelines and automated web data extraction
Built and maintained Django data platform; large-scale JSON ingestion and web crawlers
Deployed agents with Docker on VPS; S3/MinIO storage; GitLab CI/CD

Data Scientist May 2018 – Jul 2020

KUGU Home GmbH, Berlin

Developed physics-informed heat models (Fourier equation) and time series forecasting with TensorFlow/XGBoost
Built online changepoint detection for anomaly detection in IoT boiler systems
Co-developed IoT pipeline for hundreds of devices; deployed to OpenStack via Ansible
Ensured GDPR compliance in data handling workflows

Doctoral Researcher (Physics) Aug 2012 – Oct 2016

TU Clausthal (Institute for Theoretical Physics) & University of Göttingen

Supervisor: Prof. Peter E. Blöchl

Investigated phonon dynamics and thermoelectric transport via classical molecular dynamics
Extended MD codebase (Fortran/C): force constant extraction, phonon bandstructure calculation, thermal transport properties
Published in Physica Status Solidi A (DOI: 10.1002/pssa.201532488)
Tutored ab-initio electronic structure methods

Skills

ML Systems / Infrastructure

Python, Django, Docker, Kubernetes, AWS (S3, EC2), GitLab CI/CD, Ansible, PostgreSQL, S3/MinIO

LLM Observability / Evals

Arize Phoenix, trajectory analysis, agent debugging, TensorFlow, scikit-learn, XGBoost

Distributed Data

Dask, Celery, vectorised ops, large-scale JSON ingestion, web crawlers

Research & Formal Methods

PyTorch, HuggingFace Transformers/Accelerate, LoRA/PEFT, quantization (GPTQ, INT4/INT8), SVD/spectral methods, CUDA, Fortran, C, Rust

Research

Independent research, Oct 2025 – present.

Phase-Collapse Defragmentation: Provable Bounds on 1-bit KV-Cache Quantization in MoE

2026 · preprint microsite with Lean source

Learned orthogonal rotation lifts moment-ratio cosine β from SRHT floor (0.80) to 0.92–0.97 across four architectures (Gemma-4 e2b/e4b/26B-MoE, DeepSeek-MoE-16B, OLMoE-1B-7B)
Discovered Stiefel frustration: MoE expert banks resist single-rotation alignment at β ≈ 0.83
74 Lean 4 theorems proved (Mathlib); includes convergence bounds and quadratic last-mile hardness

Microsite Repo

Verified Neural Compilation: Formal Symmetries and Impossibility Boundaries

2026 · preprint microsite with Lean source

Machine-checked proofs of RoPE commutant classification and block-diagonal frequency boundaries in Lean 4.
Formalizing zero-cost compilation invariants and proving the impossibility of key-only routing under standard MoE interfaces.

Microsite Repo

RoPE-Provenance: Subspace-Split Positional Encodings for Token-Role Auditing

2026 · preprint microsite with Lean source

Allocating a low-frequency subspace of RoPE positional encodings to serve as an out-of-band role channel.
Applied to SmolLM2-135M to dynamically isolate instruction execution from untrusted data payloads.
Pre-registered benchmarks evaluate selectivity (SEP scores) and instruct compliance under adversarial injection.

Microsite Repo

Other Research

Falsifying LoRA Alignment Geometry: srank as Overfitting Signature in DPO Fine-Tuning

2026 · preprint microsite with Lean source

Stable rank ≈ 3.6 floor across 4× width scaling under fixed LoRA-DPO recipe
Geometry–behavior decoupling: γ-overlap does not predict reward margin
Lean-verified: stableRank_smul_invariant, rsLoraUpdate_frob_bounded
All adapter checkpoints released on HuggingFace (~1.9 GB)

Microsite Repo HuggingFace

A Survey of Symmetries Compression Must Respect

2026 · in progress

Catalog of transformer weight symmetries (RoPE-commuting rotations, sign/phase gauges, parabolic stabilizers) with Lean 4 companion

Microsite

The Compression Falsification Ladder — empirical methodology

Pre-registered protocols: SHA-locked configs, trap-cell adversarial validation, τ-hardened random baselines, kill-fast sequential design
Applied to 10+ compression rungs on OLMoE-1B-7B; 7 clean kills, 1 deepen-strict result (per-channel INT4), 1 impact-rung in flight

Publications

Peer-reviewed

Basu, D. & Blöchl, P.E. (2016). Phonon dynamics and thermoelectric transport in thermoelectric materials. Physica Status Solidi A. DOI: 10.1002/pssa.201532488

Preprints and research artifacts (2026)

Basu, D. Phase-Collapse Defragmentation: A Moment-Ratio Framework for 1-Bit KV-Cache Quantization in MoE Transformers. Microsite with Lean source.

Basu, D. Low Stable-Rank Structure in LoRA-DPO Adapters on Pythia 70M–1B: Empirical Scaling and Formal Invariants. Microsite with Lean source.

Basu, D. RoPE-Provenance: Subspace-Split Positional Encodings for Token-Role Auditing. Microsite with Lean source.

Basu, D. Verified Neural Compilation: Formal Symmetries and Impossibility Boundaries. Microsite with Lean source.

Education

Doctoral research (not completed) 2012 – 2016

University of Göttingen / TU Clausthal

B.S.–M.S. in Physics 2007 – 2012

Indian Institute of Science Education and Research (IISER) Kolkata

Languages

English (fluent) · German (B1) · Hindi (native) · Bengali (native)

Links

GitHub: d3banjan HuggingFace: d3banjan Paper: KV-Cache Quantization Paper: LoRA-DPO Stable Rank Paper: Symmetry Survey Paper: RoPE Provenance Paper: Verified Neural Compilation Methodology: Compression Falsification Ladder