Robuta

https://arxiv.org/abs/2308.13137 [2308.13137] OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models Abstract page for arXiv paper 2308.13137: OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models large languageomniquantcalibratedquantizationmodels