4.2 Quantization Methods

Apply at least two of the following quantization methods: - **GPTQ** (4-bit, 128 group size): Use the `auto-gptq` library with a calibration dataset of 128-256 examples from your training data. - **AWQ** (4-bit): Use the `autoawq` library. Compare with GPTQ on quality and speed. - **GGUF** (for llam