| Age | Commit message (Collapse) | Author | |
|---|---|---|---|
| 30 hours | TODO: 8-bit weight quantization for 2× size reduction | skal | |
| - Add QAT (quantization-aware training) notes - Requires training with fake quantization - Target: ~1.6 KB weights (vs 3.2 KB f16) - Shader unpacking needs adaptation (4× u8 per u32) | |||
