[2507.23035] OASIS: Outlier-Aware LUT-Based GEMM with Dual-Side Quantization for LLM Inference Acceleration

Read full story on arxiv.org
Share
[2507.23035] OASIS: Outlier-Aware LUT-Based GEMM with Dual-Side Quantization for LLM Inference Acceleration
AI disclosure

Summary

Abstract page for arXiv paper 2507.23035: OASIS: Outlier-Aware LUT-Based GEMM with Dual-Side Quantization for LLM Inference Acceleration

Original reporting

Open original source
Read full article on arxiv.org