Long Wu

About me

Hi, I’m Long Wu. I’m a Software Engineer at Microsoft. Currently, I’m also a Research Intern at UC Merced, advised by Prof. Dong Li. My research interests are Computer Systems, HPC, Distributed Computing & Machine Learning.

I prefer ideas that are simple, solid and making true impact. I love building things that can solve real problems.

Recent updates

Personal homepage launched on GitHub.

Education

2026.03 — Now
Research Intern
University of California, Merced · advised by Prof. Dong Li
2018.08 — 2022.06
B.Eng. in Software Engineering
School of Information and Software Engineering,
University of Electronic Science and Technology of China (UESTC)

Work experience

Companies where I’ve worked or interned.

2024.03 — Now
Software Engineer
Microsoft · MAI-A Ads Data, Microsoft AI
2021.11 — 2024.02
Software Development Engineer, Backend
Tencent · Interactive Entertainment Group (IEG)
2021.01 — 2021.06
Software Development Engineer Intern
ByteDance · Data-Monetization Technology

Open source contributions

Selected projects I’ve authored or contributed to. More on GitHubxiaobao520123.

01

vLLM Contributor

A high-throughput, memory-efficient inference and serving engine for LLMs.
  • Batched weight prefetching that yields a >50% per-step latency decrease and ~4% lower TTFT. PR #41474
02

SGLang Contributor

A fast serving framework for large language models and vision language models.
  • Redesigned the Weight OffloaderV2 to support CUDA Graph compilation, delivering >5× increase in token throughputs. PR #24531
03

OpenXMLA Author

High performance data analysis service from an OLAP Cube.
04

MiniVT Author

A simple showcase of Intel CPU’s virtualization technology VT-x on the Windows platform.
05

openclaw-os-activity Author

Enables OpenClaw to read and learn from user activity on the OS to deliver more personalized answers.

Research

A selection of recent work.

Honors & awards

Awards: The First Prize of National Contest at the 9th China Software Cup (2021).