![]() (at Grand Canyon, Arizona, US) |
Xikai Meng   孟西恺
I'm currently a Master of Engineering student in Computer Engineering at the
University of Washington, Seattle
(Sep 2025 – Dec 2026). Before that, I received my B.S. in Applied Physics from the
Honors School of Harbin Institute of Technology
(2020 – 2024).
|
[2025/09] Started my M.Eng. in Computer Engineering at the University of Washington, Seattle.
[2025/08] Started research collaboration with University of Waterloo on Eagle-3 + Quantization within the SGLang framework.
[2025/02] Started SD-HC project at AMD — speculative decoding on heterogeneous CPU/NPU/GPU AIPC platforms.
[2024/09] Joined AMD as a Machine Learning Engineer in the Model Compression Team (Beijing).
[2024/06] Graduated from the Honors School of Harbin Institute of Technology with a B.S. in Applied Physics.
[2024/06] Joined SenseTime as an HPC/AI System Research Intern in the LLM Team.
[2023/09] Joined Baichuan AI as an AI Infra Intern; contributed to the Clover speculative-decoding work.
|
University of Washington, Seattle, US (Sep 2025 – Dec 2026) M.Eng. in Computer Engineering
|
|
|
Harbin Institute of Technology, Harbin, China (Sep 2020 – Jun 2024) B.S. in Applied Physics, Honors School of HIT (minor in Information Engineering)
|
|
|
UC San Diego, San Diego, US (Sep 2022 – Mar 2023) Visiting Student, Department of Computer Science & Engineering
|
|
|
Advanced Micro Devices (AMD), Beijing, China (Sep 2024 – Sep 2025) Machine Learning Engineer, Model Compression Team
|
|
|
SenseTime, Beijing, China (Jun 2024 – Sep 2024) HPC / AI System Research Intern, LLM Team
|
|
|
Baichuan AI, Beijing, China (Sep 2023 – Mar 2024) AI Infra Intern, LLM Team
|
|
Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge.
Baichuan AI authors incl. X. Meng et al., 2024.
[paper]
Eagle-3 Quantization for Speculative Decoding. In submission, 2025.
SD-HC: Heterogeneous Functional Pipelining for Speculative LLM Decoding on AI PCs.
X. Meng, S. Fu, Z. Li, W. Wang, C. Li, S. Tiwari, P. Zheng.
In submission, 2025.
[code (coming soon)]
Y. Li, S. Yuan, X. Meng, Y. Wang, J. Li, "Security Research of Intelligent Image Recognition System," Tsinghua Summer Conference on Communications and Networking, 2021. []
|
Eagle-3 Quantization on SGLang — with University of Waterloo, Aug 2025 – Present
|
|
SD-HC: Heterogeneous Functional Pipelining for Speculative LLM Decoding on AI PCs — AMD, Feb 2025 – Aug 2025 | code (soon)
|
|
Open-source contributions to LLM inference stacks
|
|
Systems & algorithms practice projects
|
Self-studied a CS-major curriculum alongside my Physics degree — Stanford 106L (C++), Harvard CS50, UCSD CSE120 (OS), MIT 6.S081, CMU 15-445 (DB), MIT 6.824 (Distributed), Stanford CS144 (Networks), UCB CS61B, and Andrew Ng's Deep Learning specialization.
I enjoy reading widely and am good at summarizing.
Strong, sustained passion for technology — especially LLM systems.
I also love traveling, camping, and being close to nature.