HOLY SMOKES! A new, 200% faster DeepSeek R1-0528 variant appears from German lab TNG Technology Consulting GmbH
3 July 2025 at 13:32

This gain is made possible by TNGโs Assembly-of-Experts (AoE) method โ a technique for building LLMs by selectively merging the weight tensorsRead More