AI & ML interests
None yet
Organizations
None yet
cjgs20017/qwen_rl_8k_28k_7k_all
cjgs20017/qwen_rl_8k_28k_7k
cjgs20017/qwen_rl_8k_28k_7k_low
cjgs20017/deepseek_distill_sft
4.09M • Updated • 1
cjgs20017/mamba_longalign
4.09M • Updated • 3
8B • Updated • 2
cjgs20017/deepseek_qwen_rl
8B • Updated cjgs20017/deepseek_qwen_p
8B • Updated cjgs20017/deepseek_qwen_new
Updated
8B • Updated • 1
cjgs20017/qwen3_small_new_04
2B • Updated cjgs20017/qwen3_4b_low_large_long_179
4B • Updated 2B • Updated • 1
8B • Updated • 1
cjgs20017/qwen3_4b_low_large
4B • Updated cjgs20017/qwen3_4b_long_low_large
4B • Updated • 1
cjgs20017/qwen3_4b_long_low
4B • Updated • 1
4B • Updated • 1
4B • Updated • 2
4B • Updated • 1
cjgs20017/deepseek_qwen_7b
8B • Updated • 1
Text Generation
• 7B • Updated • 3
cjgs20017/gemma2_32k_base
Text Generation
• 3B • Updated • 2
Text Generation
• 3B • Updated • 4
cjgs20017/gemma2_2b_longred
Text Generation
• 3B • Updated • 4