Fu Dai wuxuedaifu

Fu Dai

Data Scientist with a production engineering side. I work on AI systems that have to run reliably at inference time — not just in notebooks.

Current focus: multimodal AI serving (TTS, OCR, ASR, LLM) on GPU infrastructure.

vLLM-based serving repos with real benchmark data on A100 / H200.

All expose OpenAI-compatible APIs and ship with Docker.

Python · vLLM · FastAPI · CUDA · Docker · Kubernetes
ClickHouse · PostgreSQL · Java · SQL