#inference-optimization
1 bookmark tagged with "inference-optimization"
across 1 category: AI Models
-
Intel GPT-OSS 20B INT4 Quantized Model
HuggingFace • Aug 9, 2025 • AI Models
Intel's INT4 quantized version of GPT-OSS 20B using AutoRound technique with symmetric quantization, optimized for efficient inference on CPU/Intel GPU/CUDA hardware.