This may be an unresolved issue. The following article may be helpful for general information about GRPO, but it is not specific to classification tasks…
John6666
2
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| GRPO Trainer for VLM? | 5 | 432 | July 7, 2025 | |
| 🔧 Beyond Pretraining: A Visual Guide to Post-Training Techniques | 1 | 327 | July 27, 2025 | |
| New Version of PPOTrainer | 6 | 669 | November 24, 2024 | |
| Scalar Reward Model | 2 | 94 | April 8, 2025 | |
| Fine Tuned GPT2 model performs very poorly on token classification task | 4 | 1923 | February 1, 2022 |