Research

Research Interests

  • High-dimensional statistics and learning
  • Deep learning theory
  • Foundations of artificial intelligence

Publications (*: equal contribution)

  1. Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers
    Siyu Chen, Heejune Sheen, Tianhao Wang, and Zhuoran Yang
    In Advances in Neural Information Processing Systems (NeurIPS), 2024
  2. Approximate Message Passing for orthogonally invariant ensembles: Multivariate non-linearities and spectral initialization
    Xinyi Zhong*, Tianhao Wang*, and Zhou Fan
    Information and Inference: A Journal of the IMA, 2024
  3. Universality of Approximate Message Passing algorithms and tensor networks
    Tianhao Wang, Xinyi Zhong, and Zhou Fan
    The Annals of Applied Probability, 2024
  4. Training dynamics of multi-head softmax attention for in-context learning: emergence, convergence, and optimality
    Siyu Chen, Heejune Sheen, Tianhao Wang, and Zhuoran Yang
    Conference on Learning Theory (COLT), 2024
    Presented at ICLR 2024 Workshop on Bridging the Gap Between Practice and Theory in Deep Learning
  5. Maximum likelihood for high-noise group orbit estimation and single-particle cryo-EM
    Zhou Fan, Roy R. Lederman, Yi Sun, Tianhao Wang, and Sheng Xu
    The Annals of Statistics, 2024
  6. The Marginal Value of Momentum for Small Learning Rate SGD
    Runzhe Wang, Sadhika Malladi, Tianhao Wang, Kaifeng Lyu, and Zhiyuan Li
    In International Conference on Learning Representations (ICLR), 2024
  7. Noise-adaptive Thompson sampling for linear contextual bandits
    Ruitu Xu, Yifei Min, and Tianhao Wang
    In Advances in Neural Information Processing Systems (NeurIPS), 2023
  8. Cooperative multi-Agent reinforcement learning: asynchronous communication and linear function approximation
    Yifei Min, Jiafan He, Tianhao Wang, and Quanquan Gu
    In International Conference on Machine Learning (ICML), 2023
  9. Finding regularized competitive equilibria of heterogeneous agent macroeconomic models via reinforcement learning
    Ruitu Xu, Yifei Min, Tianhao Wang, Michael I. Jordan, Zhaoran Wang, and Zhuoran Yang
    In International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
  10. Fast mixing of stochastic gradient descent with normalization and weight decay
    Zhiyuan Li, Tianhao Wang, and Dingli Yu
    In Advances in Neural Information Processing Systems (NeurIPS), 2022
  11. Learn to match with no regret: Reinforcement learning in Markov matching markets
    Yifei Min, Tianhao Wang, Ruitu Xu, Zhaoran Wang, Michael I Jordan, and Zhuoran Yang
    In Advances in Neural Information Processing Systems (NeurIPS), 2022  (Oral)
  12. A simple and provably efficient algorithm for asynchronous federated contextual linear bandits
    Jiafan He*, Tianhao Wang*, Yifei Min*, and Quanquan Gu
    In Advances in Neural Information Processing Systems (NeurIPS), 2022
  13. Implicit bias of gradient descent on reparametrized models: On equivalence to mirror descent
    Zhiyuan Li*, Tianhao Wang*, Jason D. Lee, and Sanjeev Arora
    In Advances in Neural Information Processing Systems (NeurIPS), 2022
    Abridged version accepted for a contributed talk to ICML 2022 Workshop on Continuous time methods for machine learning
  14. Learning stochastic shortest path with linear function approximation
    Yifei Min, Jiafan He, Tianhao Wang, and Quanquan Gu
    In International Conference on Machine Learning (ICML), 2022
  15. What happens after SGD reaches zero loss?–A mathematical framework
    Zhiyuan Li, Tianhao Wang, and Sanjeev Arora
    In International Conference on Learning Representations (ICLR), 2022  (Spotlight)
  16. North American biliary stricture management strategies in children after liver transplantation: a multicenter analysis from the society of pediatric liver transplantation (SPLIT) registry
    Pamela L Valentino, Tianhao Wang, Veronika Shabanova, Vicky Lee Ng, John C Bucuvalas,  Amy G Feldman and 5 more authors
    Liver Transplantation, 2022
  17. Variance-aware off-policy evaluation with linear function approximation
    Yifei Min*, Tianhao Wang*, Dongruo Zhou, and Quanquan Gu
    In Advances in neural information processing systems (NeurIPS), 2021
  18. Provably efficient reinforcement learning with linear function approximation under adaptivity constraints
    Tianhao Wang*, Dongruo Zhou*, and Quanquan Gu
    In Advances in Neural Information Processing Systems (NeurIPS), 2021
  19. Likelihood landscape and maximum likelihood estimation for the discrete orbit recovery model
    Zhou Fan, Yi Sun, Tianhao Wang, and Yihong Wu
    Communications on Pure and Applied Mathematics, 2022
  20. Continuous and discrete-time accelerated stochastic mirror descent for strongly convex functions
    Pan Xu*, Tianhao Wang*, and Quanquan Gu
    In International Conference on Machine Learning (ICML), 2018
  21. Accelerated stochastic mirror descent: From continuous-time dynamics to discrete-time algorithms
    Pan Xu*, Tianhao Wang*, and Quanquan Gu
    In International Conference on Artificial Intelligence and Statistics (AISTATS), 2018

Preprints (*: equal contribution)

  1. Implicit regularization of gradient flow on one-layer softmax attention
    Heejune Sheen, Siyu Chen, Tianhao Wang, and Harrison H. Zhou
    arXiv:2403.08699, 2024
    Presented at ICLR 2024 Workshop on Bridging the Gap Between Practice and Theory in Deep Learning
  2. How well can Transformers emulate in-context Newton’s method?
    Angeliki Giannou, Liu Yang, Tianhao Wang, Dimitris Papailiopoulos, and Jason D. Lee
    arXiv:2403.03183, 2024
    Presented at ICLR 2024 Workshop on Bridging the Gap Between Practice and Theory in Deep Learning