The author order is alphabetical (following CS theory convention) unless marked with an asterisk.Â
Two Heads are Better than One: Simulating Large Transformers with Small Ones*
Hantao Yu, Josh Alman
In Submission
Fast Attention Mechanisms: a Tale of Parallelism*
Jingwen Liu, Hantao Yu, Clayton Sanford, Alex Andoni, Daniel Hsu
In Submission
Differentially Private Shortest Distances in Continual Release Model
Rachel Cummings, Tamalika Mukherjee, Jalaj Upadhyay, Hantao Yu, Zongrui Zou
Theory and Practice of Differential Privacy Workshop (TPDP), 2025
In Submission to conference
Fundamental Limitations on Subquadratic Alternatives to Transformers [arXiv]
Josh Alman, Hantao Yu
International Conference on Learning Representations (ICLR), 2025
Improving the Leading Constant of Matrix Multiplication [arXiv]
Josh Alman, Hantao Yu
Symposium on Discrete Algorithms (SODA), 2025
Tensor Ranks and the Fine-Grained Complexity of Dynamic Programming [arXiv][20-min talk]
Josh Alman, Ethan Turok, Hantao Yu, Hengzhi Zhang
Innovations in Theoretical Computer Science (ITCS), 2024
Robust Empirical Risk Minimization with Tolerance* [arXiv]
Robi Bhattacharjee, Max Hopkins, Akash Kumar, Hantao Yu, Kamalika Chaudhuri
International Conference on Algorithmic Learning Theory (ALT), 2023
Active Learning Polynomial Threshold Functions [arXiv]
Omri Ben-Eliezer, Max Hopkins, Chutong Yang, Hantao Yu
Neural Information processing Systems (NeurIPS), 2022