UniSteer: Text-Guided Flow Matching in Activation Space for Versatile LLM Steering Paper • 2605.30076 • Published 15 days ago • 26
Crafter: A Multi-Agent Harness for Editable Scientific Figure Generation from Diverse Inputs Paper • 2605.30611 • Published 15 days ago • 192
MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation Paper • 2605.20183 • Published 24 days ago • 14
No One Knows the State of the Art in Geospatial Foundation Models Paper • 2605.12678 • Published about 1 month ago • 4
Mem-π: Adaptive Memory through Learning When and What to Generate Paper • 2605.21463 • Published 23 days ago • 8
SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution Paper • 2605.18401 • Published 25 days ago • 126
WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation Paper • 2605.10912 • Published May 11 • 46
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published Apr 13 • 102