Skip to content
View wuxuedaifu's full-sized avatar
  • Xiaomi
  • Beijing
  • 23:30 (UTC -12:00)

Block or report wuxuedaifu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wuxuedaifu/README.md

Fu Dai

Data Scientist with a production engineering side. I work on AI systems that have to run reliably at inference time — not just in notebooks.

Current focus: multimodal AI serving (TTS, OCR, ASR, LLM) on GPU infrastructure.

Production serving systems

vLLM-based serving repos with real benchmark data on A100 / H200.

All expose OpenAI-compatible APIs and ship with Docker.

Data science work

  • ASR / TTS: speech systems, streaming inference, latency optimization
  • Vision: OCR, face recognition, document AI
  • LLM: RAG pipelines, serving infra, OpenAI-compatible APIs
  • Data: ClickHouse, PostgreSQL, ETL pipelines, AQI/weather data systems

Stack

Python · vLLM · FastAPI · CUDA · Docker · Kubernetes
ClickHouse · PostgreSQL · Java · SQL

Pinned Loading

  1. vllm-surya-ocr vllm-surya-ocr Public

    OpenAI-compatible, vLLM-served OCR API for the Surya-OCR-2 model — multilingual document OCR (layout + text recognition) with request batching, a local CLI, and Docker packaging.

    Python 21

  2. xttsv2-vllm-streaming-server xttsv2-vllm-streaming-server Public

    Real-time streaming TTS server for XTTS-v2 on vLLM — OpenAI-compatible API, ~0.5s TTFB, Docker

    Python 32 2

  3. vllm-chatterbox-stream vllm-chatterbox-stream Public

    OpenAI-compatible multilingual TTS server — Chatterbox on vLLM with real-time PCM audio streaming, low time-to-first-byte (~0.7 s), voice cloning, and 23 languages.

    Python 32 2

  4. pipecat-plugin-tenvad pipecat-plugin-tenvad Public

    Local TenVAD voice activity detection plugin for Pipecat voice agent pipelines

    Python 3

  5. insightface insightface Public

    Forked from deepinsight/insightface

    State-of-the-art 2D and 3D Face Analysis Project

    Python 1

  6. deepfilter-stream deepfilter-stream Public

    Real-time streaming noise cancellation with DeepFilterNet3 on ONNX Runtime

    Python 1