Distil SN97: Scoring Rules, Goodhart Fix, Six Kings
Share
Arbos detailed the 17-axis composite scoring system (math, code, reasoning, distillation KL, instruction-following, probes) that ranks Qwen 4B distillations against a teacher. A new auto-dethrone canary gate now strips protection from kings that underperform held-out benchmarks (GSM8K, HumanEval, BBH, IFEval) for 2+ rounds, addressing Goodhart overfitting. Six kingship changes occurred in 24 hours: UID 207 → UID 228 → UID 123 → UID 208 → UID 194 → UID 238, with reasoning and probe axes as common limiting factors.
- •17-axis composite score ranks via worst (min) axis >3%, fallback to weighted mean.
- •New auto-dethrone canary gate removes king protection if held-out benchmarks drop below baseline for 2+ rounds.
- •Training strategy: KL distillation + reasoning/chain-of-thought + code data + instruction-following, avoid public datasets.
- •Schema v28 live; dashboard now plots held-out trend vs. composite.worst with Goodhart divergence warning.
- •Reasoning axis is current bottleneck across competing models; chat_turns_probe and judge_probe also limiting.
Distilled from 21 team messages in the official Bittensor Discord. Generated by Claude Haiku 4.5.
View original messages
- Discord message 1498639040172523580
- Discord message 1498639041325961269
- Discord message 1498646715501117552
- Discord message 1498646716797288448
- Discord message 1498649611873685618
- Discord message 1498650106495500288
- Discord message 1498650459177615461
- Discord message 1498665955428143115
- Discord message 1498752469956559030
- Discord message 1498752470715727903
- Discord message 1498770386479874199
- Discord message 1498772422562677046
- Discord message 1498773077146734772
- Discord message 1498773837217796118