Foundation Models
Large language models, diffusion models, and transformers. Research on pre-training, alignment, and generative AI systems.
Selected Publications in Foundation Models
A selection of recent publications. For a complete list, please visitGoogle Scholar.
13 selected publications
Mmada: Multimodal large diffusion language models
L Yang, Y Tian, B Li, X Zhang, K Shen, Y Tong, M Wang
arXiv preprint arXiv:2505.15809 · 2025
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations
K Huang, J Guo, Z Li, X Ji, J Ge, W Li, Y Guo, T Cai, H Yuan, R Wang, ...
arXiv preprint arXiv:2502.06453 · 2025
Reasonflux: Hierarchical llm reasoning via scaling thought templates
L Yang, Z Yu, B Cui, M Wang
arXiv preprint arXiv:2502.06772 · 2025
Revolutionizing reinforcement learning framework for diffusion large language models
Y Wang, L Yang, B Li, Y Tian, K Shen, M Wang
arXiv preprint arXiv:2509.06949, 2025 · 2025
Emergent symbolic mechanisms support abstract reasoning in large language models
Y Yang, D Campbell, K Huang, M Wang, J Cohen, T Webb
arXiv preprint arXiv:2502.20332, 2025 · 2025
Training-free guidance beyond differentiability: Scalable path steering with tree search in diffusion and flow models
Y Guo, Y Yang, H Yuan, M Wang
arXiv preprint arXiv:2502.11420, 2025 · 2025
Score approximation, estimation and distribution recovery of diffusion models on low-dimensional data
M Chen, K Huang, T Zhao, M Wang
International Conference on Machine Learning · 2024
MaxMin-RLHF: Towards equitable alignment of large language models with diverse human preferences
S Chakraborty, J Qiu, H Yuan, A Koppel, F Huang, D Manocha, AS Bedi, ...
Forty-first International Conference on Machine Learning · 2024
An Overview of Diffusion Models: Applications, Guided generation, Statistical Rates and Optimization
M Chen, S Mei, J Fan, M Wang
arXiv preprint arXiv:2404.07771 · 2024
Fast best-of-n decoding via speculative rejection
H Sun, M Haider, R Zhang, H Yang, J Qiu, M Yin, M Wang, P Bartlett, ...
Advances in Neural Information Processing Systems 37, 32630-32652 · 2024
Gradient guidance for diffusion models: An optimization perspective
Y Guo, H Yuan, Y Yang, M Chen, M Wang
Advances in Neural Information Processing Systems 37, 90736-90770 · 2024
Treebon: Enhancing inference-time alignment with speculative tree-search and best-of-n sampling
J Qiu, Y Lu, Y Zeng, J Guo, J Geng, H Wang, K Huang, Y Wu, M Wang
arXiv preprint arXiv:2410.16033 · 2024
Specdec++: Boosting speculative decoding via adaptive candidate lengths
K Huang, X Guo, M Wang
arXiv preprint arXiv:2405.19715 · 2024