news
Jun 03, 2025 | Check out our latest findings in On the Effect of Negative Gradient in Group Relative Deep Reinforcement Optimization, where we delve into the learning dynamics of GRPO and conduct an in-depth analysis of negative gradients. |
---|---|
Jun 02, 2025 | Delighted to share that I will be interning as a Research Scientist at Meta this summer. |
Apr 01, 2025 | Our paper MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs is online! We use Knowledge Graph(KG)as structured knowledge source to provide fact guidence on medical reasoning data generation. |
Feb 07, 2025 | Our work DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models is selected as Spotlight at ICLR, you can drop more than 99% of your delta parameters without hurt finetuned model performance! Code will be released soon. |