About

Dr. Tao Ge is a Principal Science Lead at Microsoft in Redmond, leading research and development of state-of-the-art large language models (LLMs), spanning synthetic data creation, mid-/post-training of OpenAI models (GPT-4/5, and o3/o4-mini), and agentic approaches powering Microsoft products (e.g., Office/Copilot). Prior to his current role, Tao was a Principal Researcher at Tencent AI Lab (Seattle) and Microsoft Research Asia (MSRA) after earning his Ph.D. in Computer Science from Peking University.

Tao has published more than 60 papers at top AI/ML conferences. Two of his most known and widely adopted tech innovations are:

  1. Speculative Decoding: Tao pioneered the seminal study of Speculative Decoding beginning in 2021 (initially referred to as Aggressive Decoding). He was the first to introduce a separate drafter model to achieve lossless acceleration of Transformer decoding (first made public in March 2022), and he was also the first to coin the term “Speculative Decoding” for this speculative execution paradigm (made public in September 2022). His research was subsequently followed by the papers on Speculative Decoding/Sampling for LLMs from Google (first made public in November 2022) and DeepMind (made public in February 2023), sparking the surge of interest and adoption since mid-2023. Today, Speculative Decoding has become an industry standard for LLM inference acceleration, supported in major open-source frameworks (e.g., vLLM, PyTorch, ONNX) and widely integrated into production-scale deployments.

  2. Persona-Driven Synthetic Data Creation: Tao proposed persona-driven synthetic data creation, a novel paradigm for scaling high-quality synthetic training data generation. This innovation has been widely recognized and adopted as a core synthetic data methodology in the development of leading LLMs, including (but not limited to):

Publications (*: equal contributions; : corresponding author)

Tech Report

Peer-reviewed