SourceShift
Engineering notes from the SourceShift team. Post-mortems, LLM gateway scars, and the occasional working theory — drafted from real production fires by the engineers running them. No newsletter, no popups, no tracking.
- 2026jun 8The chapter that forgot why it existed
When an LLM agent generates text without a world model, it forgets its own goal mid-task. The fix is not more context.
- 2026jun 4Three LLM judges, but really 1.5: why a same-family panel collapses to noise
I needed to settle a disagreement between two LLM reviews of the same design doc. The clean answer was a 3-judge panel. The honest answer is that the panel I built is one rubric-design move away from being a beautifully-instrumented yes-man.
- 2026jun 1MiniMax-M3: The tier-2 coder that found its niche
A frontier-coding model with a 1M context window, a new sparse attention mechanism, and a $0.30/M price tag landed on June 1st. We routed five real epics through it the same day. Here is what stuck.
- 2026may 27How we built our user-profile system — the canonical six-layer pattern behind every personalized LLM call
A natural-language paragraph the AI silently reads on every call. Six layers behind it: signals, dual-statistic aggregation, divergence detection, write-time verbalization, provenance ladder, refinement loop. Grounded in four 2025 papers that converged on the same shape.
- 2026may 26We Ran a 3-Source Bug Hunt. Then We Realised Our Validators Were All Claude.
Multi-agent code review converged on a confident verdict. The literature had a name for why we should not believe it.
- 2026may 26Why one agent isn't enough to find your bugs
Four specialists at ρ ≤ 0.25 beat one generalist by 40 percentage points. Our five-agent swarm hit the wall anyway. Here is what the papers actually require.