Where Should AI Compress? Inside the Model or Before It Hits the Neural Net?
Understanding Muon: A Revolutionary Neural Network Optimizer
Hidden Redundancy in Self-Attention: Why Transformers Still Work with Less
So You Wanna Get Into Machine Learning?