Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
fasa-org
/
MiniCPM-4-8B-DashAttention
like
0
Follow
FASA
5
PyTorch
llama
arxiv:
2605.18753
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Copy to bucket
new
main
MiniCPM-4-8B-DashAttention
16.4 GB
Ctrl+K
Ctrl+K
2 contributors
History:
18 commits
hyx21
Update README.md
d1d53b1
verified
17 days ago
.gitattributes
1.52 kB
initial commit
24 days ago
README.md
6.96 kB
Update README.md
17 days ago
added_tokens.json
216 Bytes
Upload 7 files
23 days ago
check_act.py
1.08 kB
Add check_act.py (ModelScope adansa source, verbatim)
22 days ago
config.json
3.42 kB
Update config.json
20 days ago
config_benchmark.json
3.42 kB
Rename config.json to config_benchmark.json
20 days ago
configuration_minicpm.py
9.66 kB
Add configuration_minicpm.py (ModelScope adansa source, verbatim)
22 days ago
generation_config.json
182 Bytes
Upload 7 files
23 days ago
modeling_llama_long_infllmv2.py
102 kB
Add modeling_llama_long_infllmv2.py (ModelScope adansa source, verbatim)
22 days ago
modeling_llama_long_infllmv2_64.py
102 kB
Add modeling_llama_long_infllmv2_64.py (ModelScope adansa source, verbatim)
22 days ago
modeling_minicpm.py
103 kB
Add modeling_minicpm.py (ModelScope adansa source, verbatim)
22 days ago
pytorch_model.bin
16.4 GB
xet
Add pytorch_model.bin weights (ModelScope adansa source)
22 days ago
special_tokens_map.json
630 Bytes
Upload 7 files
23 days ago
stage1.py
10.9 kB
Add stage1.py (ModelScope adansa source, verbatim)
22 days ago
tokenizer.json
6.7 MB
Upload 7 files
23 days ago
tokenizer.model
1.18 MB
xet
Upload 7 files
23 days ago
tokenizer_config.json
2.94 kB
Upload 7 files
23 days ago
topk_sparse_attention_decode.py
12.6 kB
Add topk_sparse_attention_decode.py (ModelScope adansa source, verbatim)
22 days ago
topk_sparse_attn.py
46.5 kB
Add topk_sparse_attn.py (ModelScope adansa source, verbatim)
22 days ago
transform_score.py
4.55 kB
Add transform_score.py (ModelScope adansa source, verbatim)
22 days ago