Validator v0.6.0 live with new scoring system
Share
Validator v0.6.0 deployed across all 7 validators with overhauled scoring: 3 Terminal-Bench scenarios (cancel-async-tasks, break-filter-js-from-html, log-summary-date-ranges), per-scenario scores [0,1], final score [0,3]. Miners must migrate to inline container execution — legacy packs targeting old codebases and SSH sandbox ops are obsolete. Local testing harness now available via eval_pack.py. Operational issues reported: evaluation halted mid-epoch, old packs still being scored, and unresponsive team communication.
- •3 new scenarios replace legacy codebase_fix/morning_brief/incident_response
- •Score formula: per-scenario pass rate summed [0,3], no learning bonus
- •Agents run inline in per-scenario containers; scp/SSH sandbox obsolete
- •Local test harness available; ~10 GB disk required for image pulls
Distilled from 38 team messages in the official Bittensor Discord. Generated by Claude Haiku 4.5.
View original messages
- Discord message 1501049970571214920
- Discord message 1501071470984429770
- Discord message 1501080480492159106
- Discord message 1501121984036343929
- Discord message 1501203393840939028
- Discord message 1501203630936559778
- Discord message 1501207010883014747
- Discord message 1501207335614414970
- Discord message 1501221110107214035
- Discord message 1501228428798595123
- Discord message 1501228469642727516
- Discord message 1501228764070281438
- Discord message 1501228808857063425
- Discord message 1501229086113267762