Bio: Haihao is a senior AI architect in DCAI/AISE at Intel, leading model quantization and efficient inference for LLMs on Intel platforms.

Description: the talk will give an overview of LLM model quantization and efficient inference based on Intel Extension for Transformers.

Add comment

Your email address will not be published. Required fields are marked *

Categories

All Topics