LibAUC: A Deep Learning Library for X-Risk Optimization. Total Downloads: 87k+
DisCO: Discriminative Constrained Optimization for Large Reasoning Models. Improves GRPO, DAPO, Dr. GRPO, etc in multiple aspects with significant improvements.
DFT: Discriminative Fine-tuning of LLMs without reward models and human preference data. Improves SFT and is Competitive SF-->PO.
DRRho-CLIP: Model steering for CLIP training. Beats OpenAI CLIP using 8 GPUs with 2 times less data size (192M vs 400M) and 10 times less sample complexity (1.28b vs 12.8b).
SogCLR: Stochastic Optimization for Global Contrastive Learning without Large Mini-batches. Achieves the same performance as SimCLR with 32 times less batch size (256 vs 8192).
Online Optimization with Gradual Variations [bibtex]
with Chao-Kai Chiang, Chia-Jung Lee, Mehrdad Mahdavi, Chi-Jen Lu, Rong Jin, Shenghuo Zhu
In COLT 2012. Best Student Paper Award.
This paper is a merged paper of two COLT submissions. Our manuscript proposes two algorithms for online convex optimization
to obtain a variation based regret bound. One of our algorithms is the same to the algorithm proposed in the other submission.
A commmentary on the merged paper written by Satyen Kale can be found here.