#model-compression
1 bookmark tagged with "model-compression"
across 1 category: AI Models
-
Intel GPT-OSS 20B INT4 Quantized Model
HuggingFace • Aug 9, 2025 • AI Models
Intel's INT4 quantized version of GPT-OSS 20B using AutoRound technique with symmetric quantization, optimized for efficient inference on CPU/Intel GPU/CUDA hardware.