Wangjie Gan's picture

Wangjie Gan

zju-omniai

·

AI & ML interests

None yet

Recent Activity

authored a paper about 15 hours ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

commentedon a paper about 23 hours ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

upvoted a paper about 23 hours ago

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

View all activity

Organizations

Papers 1

arxiv:2604.14258

models 0

None public yet

datasets 0

None public yet