Publications
In reversed chronological order | * denotes equal contribution
2025
- ICLR 2025Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models AlignmentIn International Conference on Learning Representations, 2025
- ICLR 2025Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better DiversityIn International Conference on Learning Representations, 2025
2024
2023
- Ph.D. ThesisUnderstanding Adversarially Robust Generalization: A Learning Theory PerspectiveThe Chinese University of Hong Kong, Shenzhen, 2023
2022
- MLSW 2022Smoothed-SGDmax: A Stability-Inspired Algorithm to Improve Adversarial GeneralizationIn NeurIPS ML Safety Workshop, 2022