Xuanteng Huang's homepage | pronounciation: Seant (/sɪnt/), Chinese: 黃軒騰
myself
San Francisco, CA,
United States, Jun. 2025

I am a third-year Ph.D. student of CSE at Sun Yat-sen University mentored by Assoc. Prof. Xianwei Zhang and Prof. Nong Xiao. My research interest mainly focuses on leveraging GPUs in machine learning systems.

Currently, I'm a research intern at Channel recommendation R&D team, Weixin Group, Tencent, to explore the optimization opportunities for large-scale embedding-based DLRM online training.

I obtained my B.E. degree in 2022 from SYSU as well. Check out my résumé for more details.

Industry experiences

[object Object], ,recommandation system
Channel recommandation systemTencent, WeChat
Guangzhou, ChinaNov. 2024 - Now
GPU compute arch intern
GPU compute arch internNVIDIA, F(ast)K(ernel) team
Shanghai, ChinaSept., 2022 - Dec., 2022
Heterogeneous intern
Heterogeneous internByteDance, IaaS virtualization team
Hangzhou, ChinaMay., 2022 - Aug., 2022

FOSS ? engagements

cpython
I've made some tiny contributions towards the main branch of CPython interpreter. I'm interested in the free-threaded no-GIL build and bytecode specialization optimizations to make Python faster. Also, I'm writing some articles about CPython internals in wiki/cpython.
debian
Since GSoC '24, I'm maintaining ROCm (open source GPU software stack for AMD GPUs) in Debian's official package archive. Unlike the package provided by AMD, we aim to provide a series of packages totally compatible with DFSG transparently for democratizing GPU computing. See my QA page for contribution records.