Xuanteng Huang's homepage | pronounciation: Seant (/sɪnt/), Chinese: 黃軒騰
myself

Zhuhai, China, Nov. 2021

I am a third-year Ph.D. student of CSE at Sun Yat-sen University mentored by Assoc. Prof. Xianwei Zhang and Prof. Nong Xiao. My research interest mainly focuses on leveraging GPUs in machine learning systems.

Currently, I'm a research intern at Channel recommendation R&D team, Weixin Group, Tencent, to explore the optimization opportunities for large-scale embedding-based DLRM online training.

I obtained my B.E. degree in 2022 from SYSU as well. Check out my résumé for more details.

Industry experiences

[object Object], ,recommandation system
Channel recommandation systemTencent, WeChat
Guangzhou, ChinaNov. 2024 - Now
GPU compute arch intern
GPU compute arch internNVIDIA, F(ast)K(ernel) team
Shanghai, ChinaSept., 2022 - Dec., 2022
Heterogeneous intern
Heterogeneous internByteDance, IaaS virtualization team
Hangzhou, ChinaMay., 2022 - Aug., 2022

FOSS ? engagements

debian
Since GSoC '24, I'm maintaining ROCm (open source GPU software stack for AMD GPUs) in Debian's official package archive. Unlike the package provided by AMD, we aim to provide a series of packages totally compatible with DFSG transparently for democratizing GPU computing. See my QA page for contribution records.
cpython
I've made some tiny contributions towards the main branch of CPython interpreter. I'm interested in the free-threaded no-GIL build and bytecode specialization optimizations to make Python faster. Also, I'm writing some articles about CPython internals in wiki/cpython.