Oro Reasoning Judge Updated, Validators Fixed
Share
Oro shipped a reasoning quality scoring refinement today with a clearer 5-bin structure and refreshed judge model pool (GLM-5.1, GLM-5, gemma-4-31B-turbo, Kimi-K2.5) to improve score reproducibility and judge availability. Validators experienced downtime mid-day but were restored; allowlisted models will update after today's race, and miners will soon gain the ability to pin specific agent versions for races. Multiple miners reported local test environment failures and permission errors after pulling new docker/git repos.
- •Reasoning judge uses 5-bin scoring (0.1–0.9) tied to specific evidence criteria
- •Judge model pool refreshed for tighter cross-model agreement and availability
- •Allowlisted models updating after today's race; agent pinning feature coming soon
- •Local sandbox test failures reported; validator downtime fixed same day
Distilled from 36 team messages in the official Bittensor Discord. Generated by Claude Haiku 4.5.
View original messages
- Discord message 1499226022636290088
- Discord message 1499263372598710272
- Discord message 1499289655747678322
- Discord message 1499289791857037446
- Discord message 1499318809780293732
- Discord message 1499319245605961738
- Discord message 1499348719634743316
- Discord message 1499365678879215698
- Discord message 1499373352748581025
- Discord message 1499404771826733116
- Discord message 1499407830690693181
- Discord message 1499418898758635664
- Discord message 1499421031817805985
- Discord message 1499428457061285979