[2606.02823] Qift: Shift-Friendly No-Zero W2 Post-Training Quantization for Rotated W2A4/KV4 LLM Inference

Read full story on arxiv.org
Share
[2606.02823] Qift: Shift-Friendly No-Zero W2 Post-Training Quantization for Rotated W2A4/KV4 LLM Inference
AI disclosure

Summary

Abstract page for arXiv paper 2606.02823: Qift: Shift-Friendly No-Zero W2 Post-Training Quantization for Rotated W2A4/KV4 LLM Inference

Original reporting

Open original source
Read full article on arxiv.org