2

Quantization-Aware Training for Large Language Models with PyTorch (2024)