AmanPriyanshu/tool-reasoning-sft-TOOLS-context-management-handling Viewer • Updated 12 days ago • 75k • 87
AmanPriyanshu/regularizer-250K-from-reasoning-and-tool-use-sft-4M-random-compilation Viewer • Updated 16 days ago • 250k • 78 • 1
AmanPriyanshu/regularizer-250K-from-reasoning-sft-3M-random-compilation Viewer • Updated 21 days ago • 250k • 42
AmanPriyanshu/tool-reasoning-sft-RESEARCH-rlvr-env-retrieval-source Viewer • Updated 29 days ago • 156k • 55
AmanPriyanshu/tool-reasoning-sft-RESEARCH-openresearcher-dataset-sft-deep-research-agent-data-cleaned Updated 29 days ago • 562 • 1
AmanPriyanshu/tool-reasoning-sft-RESEARCH-OpenHands-CodeScout_Training_Rollouts Viewer • Updated 30 days ago • 56.8k • 36
AmanPriyanshu/tool-reasoning-sft-RESEARCH-OpenSeeker-v1-Data Viewer • Updated 30 days ago • 7.19k • 33
AmanPriyanshu/tool-reasoning-sft-RESEARCH-REDSearcher_SFT_10K Viewer • Updated 30 days ago • 9.05k • 32
AmanPriyanshu/reasoning-sft-minimax-microsoft-orca-agentinstruct-1M-v1 Viewer • Updated Mar 16 • 945k • 97 • 1
AmanPriyanshu/reasoning-sft-minimax-stratified-kmeans-diverse-reasoning-842K-only Viewer • Updated Mar 15 • 843k • 47
AmanPriyanshu/tool-reasoning-sft-TOOLS-toucan-1.5m-sft-tool-use-data-cleaned-rectified-333k Viewer • Updated Mar 14 • 566k • 47
AmanPriyanshu/RLVR-Env-Retrieval-Source-Retrieval-Synthetic-NVDocs-v1 Viewer • Updated Mar 14 • 100k • 21
AmanPriyanshu/tool-reasoning-sft-CODING-nvidia-Nemotron-Agentic-v1 Viewer • Updated Mar 14 • 331k • 42
AmanPriyanshu/reasoning-sft-Nemotron-Instruction-Following-Chat-v1 Viewer • Updated Mar 14 • 158k • 18
AmanPriyanshu/tool-reasoning-sft-RESEARCH-grill-lab-browsecomp-plus-runs-data-cleaned-rectified Viewer • Updated Mar 11 • 49.9k • 37
AmanPriyanshu/tool-reasoning-sft-CODING-allenai-SERA-data-cleaned-rectified Viewer • Updated Mar 10 • 211k • 45
AmanPriyanshu/tool-reasoning-sft-TOOLS-hermes-reasoning-tool-style-data-cleaned-rectified-115k Viewer • Updated Mar 10 • 115k • 32
AmanPriyanshu/RLVR-Env-Retrieval-Source-code-search-net-javascript Viewer • Updated Mar 10 • 100k • 23