Teutonic adjusts evaluation delta, explores larger models
Share
Team reduced evaluation delta to 0.0005 (1/N samples) to address rapid progression limiting training depth on 8B models. Current 8B appears saturated after three days; discussion pivoted to larger architectures including 27B, 31B, or experimenting with Quasar-style 24B with 8B experts. New king crowned (Teutonic-XXIV) but team notes progression may be too early.
- •Evaluation delta reduced to 0.0005 to slow model progression
- •8B model hitting saturation; team considering 27B–100B alternatives
- •Exploring Quasar architecture (24B with 8B experts and loop transformer)
- •Conviction feature under core design review for long-term implementation
Distilled from 96 team messages in the official Bittensor Discord. Generated by Claude Haiku 4.5.
View original messages
- Discord message 1499601746052517958
- Discord message 1499601830949163099
- Discord message 1499614087473135737
- Discord message 1499684361342287932
- Discord message 1499684485556469951
- Discord message 1499689279071322243
- Discord message 1499689325606993920
- Discord message 1499689481752543303
- Discord message 1499695686164156517
- Discord message 1499698441046065173
- Discord message 1499721032599343135
- Discord message 1499752816359506061
- Discord message 1499754182507368588
- Discord message 1499759747077898492