Xuanteng Huang's homepage 🌿

Xuanteng Huang's homepage _{| pronounciation: Seant (/sɪnt/), Chinese: 黃軒騰}

San Francisco, CA,
United States, Jun. 2025

I am a third-year Ph.D. student of CSE at Sun Yat-sen University mentored by Assoc. Prof. Xianwei Zhang and Prof. Nong Xiao. My research interest mainly focuses on leveraging GPUs in machine learning systems.

Currently, I'm a research intern at Channel recommendation R&D team, Weixin Group, Tencent, to explore the optimization opportunities for large-scale embedding-based DLRM online training.

I obtained my B.E. degree in 2022 from SYSU as well. Check out my résumé for more details.

Industry experiences

Channel recommandation systemTencent, WeChat

Guangzhou, ChinaNov. 2024 - Now

GPU compute arch internNVIDIA, F(ast)K(ernel) team

Shanghai, ChinaSept., 2022 - Dec., 2022

Heterogeneous internByteDance, IaaS virtualization team

Hangzhou, ChinaMay., 2022 - Aug., 2022

FOSS ^? engagements

I've made some tiny contributions towards the main branch of CPython interpreter. I'm interested in the free-threaded no-GIL build and bytecode specialization optimizations to make Python faster. Also, I'm writing some articles about CPython internals in wiki/cpython.

Since GSoC '24, I'm maintaining ROCm (open source GPU software stack for AMD GPUs) in Debian's official package archive. Unlike the package provided by AMD, we aim to provide a series of packages totally compatible with DFSG transparently for democratizing GPU computing. See my QA page for contribution records.

Industry experiences

FOSS ? engagements

FOSS ^? engagements