Is there a GRPO/RL/PPO for text classification task using encoder only models like bert/roberta. any github repo , example or help would be really appreciated thanks in advance.

Can we use PPOtrainer to deal with text classification problem opened 08:24AM - 08 Sep 23 UTC (UTC) closed 03:05PM - 04 Dec 23 UTC (UTC) [image] yixiaoer …

GRPO or PPO or some RL

Research

John6666 May 19, 2025, 1:08pm 2

This may be an unresolved issue. The following article may be helpful for general information about GRPO, but it is not specific to classification tasks…

Topic		Replies	Views
GRPO Trainer for VLM? Research	5	432	July 7, 2025
🔧 Beyond Pretraining: A Visual Guide to Post-Training Techniques Show and Tell	1	327	July 27, 2025
New Version of PPOTrainer 🤗Transformers	6	669	November 24, 2024
Scalar Reward Model 🤗Transformers	2	94	April 8, 2025
Fine Tuned GPT2 model performs very poorly on token classification task Models	4	1923	February 1, 2022

GRPO or PPO or some RL

Related topics