Scoring System Issues Identified in Model Evaluation
Share
The team highlighted inconsistencies in the subnet's scoring system, where models with low validation loss were excluded from scoring and promotion. An internal dashboard was referenced to cross-verify metrics, and concerns were raised about models being locked out of scoring after poor initial performance.
- •UID 222 remained in Group A despite no scores for 5-8 cycles
- •Models with lower val_loss were excluded from scoring
- •Internal dashboard https://dashboard-dev.connito.ai/ used for metric verification
- •Risk of models being permanently demoted after early poor performance
Distilled from 10 team messages in the official Bittensor Discord. Generated by Claude Haiku 4.5.
View original messages
- Discord message 1510149732591210647
- Discord message 1510150889132920903
- Discord message 1510151446040023051
- Discord message 1510152837735256115
- Discord message 1510153150097653810
- Discord message 1510153746456379483
- Discord message 1510154608650227762
- Discord message 1510155829289029762
- Discord message 1510159032109764658
- Discord message 1510159057846276096