### SFT Model for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards"
[![GitHub](https://img.shields.io/github/stars/THUDM/CaRR)](https://github.com/THUDM/CaRR) [![arXiv](https://img.shields.io/badge/arXiv-2601.06021-b31b1b.svg)](https://arxiv.org/pdf/2601.06021) [![Dataset & Model](https://img.shields.io/badge/🤗%20HuggingFace-CaRR%26C--GRPO-green)](https://huggingface.co/collections/THU-KEG/carr-and-c-grpo)