Replacing Density with Precision: Sparse Activation and Structural Efficiency in Diffusion Models
By Zach Albertson•Medium•Fri, 09 May 2025 18:17:04 GMT
A mid-sized diffusion model (DiT-B/8, ~130M parameters) built with NdLinear recently outperformed a much larger baseline (DiT-L/8, ~457MContinue reading on Medium »...