feat: TQ4_1S weight compression (Metal only, needs CUDA port) by TheTom · Pull Request #45 · TheTom/llama-cpp-turboquant Comments By Sonic Mustang · April 4, 2026 · 1 min read Source: GitHub Comments