Foundation Models

Foundation Models

Large language models, diffusion models, and transformers. Research on pre-training, alignment, and generative AI systems.

Selected Publications in Foundation Models

A selection of recent publications. For a complete list, please visitGoogle Scholar.

13 selected publications

Mmada: Multimodal large diffusion language models

L Yang, Y Tian, B Li, X Zhang, K Shen, Y Tong, M Wang

arXiv preprint arXiv:2505.15809 · 2025

MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

K Huang, J Guo, Z Li, X Ji, J Ge, W Li, Y Guo, T Cai, H Yuan, R Wang, ...

arXiv preprint arXiv:2502.06453 · 2025

Reasonflux: Hierarchical llm reasoning via scaling thought templates

L Yang, Z Yu, B Cui, M Wang

arXiv preprint arXiv:2502.06772 · 2025

Revolutionizing reinforcement learning framework for diffusion large language models

Y Wang, L Yang, B Li, Y Tian, K Shen, M Wang

arXiv preprint arXiv:2509.06949, 2025 · 2025

Emergent symbolic mechanisms support abstract reasoning in large language models

Y Yang, D Campbell, K Huang, M Wang, J Cohen, T Webb

arXiv preprint arXiv:2502.20332, 2025 · 2025

Score approximation, estimation and distribution recovery of diffusion models on low-dimensional data

M Chen, K Huang, T Zhao, M Wang

International Conference on Machine Learning · 2024

MaxMin-RLHF: Towards equitable alignment of large language models with diverse human preferences

S Chakraborty, J Qiu, H Yuan, A Koppel, F Huang, D Manocha, AS Bedi, ...

Forty-first International Conference on Machine Learning · 2024

An Overview of Diffusion Models: Applications, Guided generation, Statistical Rates and Optimization

M Chen, S Mei, J Fan, M Wang

arXiv preprint arXiv:2404.07771 · 2024

Fast best-of-n decoding via speculative rejection

H Sun, M Haider, R Zhang, H Yang, J Qiu, M Yin, M Wang, P Bartlett, ...

Advances in Neural Information Processing Systems 37, 32630-32652 · 2024

Gradient guidance for diffusion models: An optimization perspective

Y Guo, H Yuan, Y Yang, M Chen, M Wang

Advances in Neural Information Processing Systems 37, 90736-90770 · 2024

Treebon: Enhancing inference-time alignment with speculative tree-search and best-of-n sampling

J Qiu, Y Lu, Y Zeng, J Guo, J Geng, H Wang, K Huang, Y Wu, M Wang

arXiv preprint arXiv:2410.16033 · 2024

Specdec++: Boosting speculative decoding via adaptive candidate lengths

K Huang, X Guo, M Wang

arXiv preprint arXiv:2405.19715 · 2024