### SFT Model for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards"
[](https://github.com/THUDM/CaRR)
[](https://arxiv.org/pdf/2601.06021)
[](https://huggingface.co/collections/THU-KEG/carr-and-c-grpo)