About

Dr. Tao Ge is a Principal Science Lead at Microsoft in Redmond, where he leads research and development of LLMs with a focus on productivity scenarios, spanning synthetic data, mid-/post-training, efficient training recipes, and agentic approaches. Prior to his current role, Tao was a Principal Researcher at Tencent AI Lab (Seattle) and Microsoft Research Asia (MSRA) after earning his Ph.D. in Computer Science from Peking University.

Tao has published more than 60 papers at top AI/ML conferences. Two of his most known and widely adopted tech innovations are:

  1. Speculative Decoding: Tao was the first to propose and name Speculative Decoding in the published literature, a novel paradigm for lossless acceleration of Transformer decoding, and the first to open-source the draft-then-verify speculative execution decoding paradigm:
    • His seminal work began in 2021 (initially referred to as Aggressive Decoding) for seq2seq generation, which is a key idea of Predicted Output used by OpenAI.
    • He was the first to introduce a separate drafter model to achieve lossless speedup (first made public in March 2022) and was the first to coin the term “Speculative Decoding” for this speculative execution paradigm (made public in September 2022).

    His research was subsequently followed by the papers on Speculative Decoding/Sampling for LLMs from Google (first made public in November 2022) and DeepMind (made public in February 2023), sparking the surge of interest and adoption since mid-2023. Today, Speculative Decoding has become an industry standard for LLM inference acceleration, supported in major open-source frameworks (e.g., vLLM, PyTorch, ONNX) and widely integrated into production-scale deployments.

  2. Persona-Driven Synthetic Data Creation: Tao proposed persona-driven synthetic data creation, a novel paradigm for scaling high-quality synthetic training data generation. This innovation has been widely recognized and adopted as a core methodology for synthetic data creation and agentic simulation in the development of leading LLMs, including but not limited to:

Publications (*: equal contributions; : corresponding author)

Tech Report

Peer-reviewed