Guanghua Yu
Head of Large Model Compression Algorithms, Tencent Hunyuan
He is responsible for the implementation and innovation of compression algorithms for Tencent Hunyuan large models, including quantization, sparsification, and speculative sampling. With nearly 10 years of experience in artificial intelligence, he has extensive expertise in model compression and optimization, and has published over 10 patents and papers. He led the team in building the AngelSlim large-model compression toolkit from scratch, enabling both internal deployment within Hunyuan and open-source model compression applications. He also developed proprietary large-model compression algorithms that now serve more than 60% of the company’s large-model use cases, demonstrating deep understanding of both business and technology.