Improve language tag

#2
by lbourdois - opened
Files changed (1) hide show
  1. README.md +82 -68
README.md CHANGED
@@ -1,69 +1,83 @@
1
- ---
2
- base_model:
3
- - Qwen/Qwen2.5-72B-Instruct
4
- tags:
5
- - conversational
6
- - roleplay
7
- - chat
8
- license: other
9
- license_name: qwen
10
- ---
11
- # Qwen 2.5 72b RP Ink
12
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/M9KSL64gppBVatmTdoQnG.png)
13
- A roleplay-focused LoRA finetune of Qwen 2.5 72b Instruct. Methodology and hyperparams inspired by [SorcererLM](https://huggingface.co/rAIfle/SorcererLM-8x22b-bf16) and [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush).
14
- Yet another model in the Ink series, following in the footsteps of [the 32b one](https://huggingface.co/allura-org/Qwen2.5-32b-RP-Ink) and [the Nemo one](https://huggingface.co/allura-org/MN-12b-RP-Ink)
15
-
16
- ## Testimonials
17
- > [Compared to the 32b] felt a noticeable increase in coherence
18
-
19
- \- ShotMisser64
20
-
21
- > Yeah ep2's great!! made me actually wanna write a reply by myself for the first time in a few days
22
-
23
- \- Maw
24
-
25
- > This is the best RP I've ever had
26
-
27
- \- 59smoke
28
-
29
- > this makes me want to get another 3090 to run 72b
30
-
31
- \- dysfunctional
32
-
33
- ## Dataset
34
- The worst mix of data you've ever seen. Like, seriously, you do not want to see the things that went into this model. It's bad.
35
-
36
- "this is like washing down an adderall with a bottle of methylated rotgut" - inflatebot
37
-
38
- Update: I have sent the (public datasets in the) data mix publicly already so here's that
39
- <details>
40
- <img src=https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/JtjUoKtbOfBZfSSKojTcj.png>
41
- </details>
42
-
43
- ## Quants
44
- [imatrix GGUFs by bartowski](https://huggingface.co/bartowski/Qwen2.5-72b-RP-Ink-GGUF)
45
- [exl2s by sleep deprived](https://huggingface.co/collections/ReadyArt/allura-org-qwen25-rp-ink-72b-exl2-6796a13fba1b09be7b12be9e)
46
-
47
- ## Recommended Settings
48
- Chat template: ChatML
49
- Recommended samplers (not the be-all-end-all, try some on your own!):
50
- - Temp 0.83 / Top P 0.8 / Top A 0.3 / Rep Pen 1.03
51
- - Your samplers can go here! :3
52
-
53
- ## Hyperparams
54
- ### General
55
- - Epochs = 2
56
- - LR = 6e-5
57
- - LR Scheduler = Cosine
58
- - Optimizer = Paged AdamW 8bit
59
- - Effective batch size = 16
60
- ### LoRA
61
- - Rank = 16
62
- - Alpha = 32
63
- - Dropout = 0.25 (Inspiration: [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush))
64
-
65
- ## Credits
66
- Humongous thanks to the people who created and curated the original data
67
- Big thanks to all Allura members, for testing and emotional support ilya /platonic
68
- especially to inflatebot who made the model card's image :3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
69
  Another big thanks to all the members of the ArliAI and BeaverAI Discord servers for testing! All of the people featured in the testimonials are from there :3
 
1
+ ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-72B-Instruct
4
+ tags:
5
+ - conversational
6
+ - roleplay
7
+ - chat
8
+ license: other
9
+ license_name: qwen
10
+ language:
11
+ - zho
12
+ - eng
13
+ - fra
14
+ - spa
15
+ - por
16
+ - deu
17
+ - ita
18
+ - rus
19
+ - jpn
20
+ - kor
21
+ - vie
22
+ - tha
23
+ - ara
24
+ ---
25
+ # Qwen 2.5 72b RP Ink
26
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/M9KSL64gppBVatmTdoQnG.png)
27
+ A roleplay-focused LoRA finetune of Qwen 2.5 72b Instruct. Methodology and hyperparams inspired by [SorcererLM](https://huggingface.co/rAIfle/SorcererLM-8x22b-bf16) and [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush).
28
+ Yet another model in the Ink series, following in the footsteps of [the 32b one](https://huggingface.co/allura-org/Qwen2.5-32b-RP-Ink) and [the Nemo one](https://huggingface.co/allura-org/MN-12b-RP-Ink)
29
+
30
+ ## Testimonials
31
+ > [Compared to the 32b] felt a noticeable increase in coherence
32
+
33
+ \- ShotMisser64
34
+
35
+ > Yeah ep2's great!! made me actually wanna write a reply by myself for the first time in a few days
36
+
37
+ \- Maw
38
+
39
+ > This is the best RP I've ever had
40
+
41
+ \- 59smoke
42
+
43
+ > this makes me want to get another 3090 to run 72b
44
+
45
+ \- dysfunctional
46
+
47
+ ## Dataset
48
+ The worst mix of data you've ever seen. Like, seriously, you do not want to see the things that went into this model. It's bad.
49
+
50
+ "this is like washing down an adderall with a bottle of methylated rotgut" - inflatebot
51
+
52
+ Update: I have sent the (public datasets in the) data mix publicly already so here's that
53
+ <details>
54
+ <img src=https://cdn-uploads.huggingface.co/production/uploads/634262af8d8089ebaefd410e/JtjUoKtbOfBZfSSKojTcj.png>
55
+ </details>
56
+
57
+ ## Quants
58
+ [imatrix GGUFs by bartowski](https://huggingface.co/bartowski/Qwen2.5-72b-RP-Ink-GGUF)
59
+ [exl2s by sleep deprived](https://huggingface.co/collections/ReadyArt/allura-org-qwen25-rp-ink-72b-exl2-6796a13fba1b09be7b12be9e)
60
+
61
+ ## Recommended Settings
62
+ Chat template: ChatML
63
+ Recommended samplers (not the be-all-end-all, try some on your own!):
64
+ - Temp 0.83 / Top P 0.8 / Top A 0.3 / Rep Pen 1.03
65
+ - Your samplers can go here! :3
66
+
67
+ ## Hyperparams
68
+ ### General
69
+ - Epochs = 2
70
+ - LR = 6e-5
71
+ - LR Scheduler = Cosine
72
+ - Optimizer = Paged AdamW 8bit
73
+ - Effective batch size = 16
74
+ ### LoRA
75
+ - Rank = 16
76
+ - Alpha = 32
77
+ - Dropout = 0.25 (Inspiration: [Slush](https://huggingface.co/crestf411/Q2.5-32B-Slush))
78
+
79
+ ## Credits
80
+ Humongous thanks to the people who created and curated the original data
81
+ Big thanks to all Allura members, for testing and emotional support ilya /platonic
82
+ especially to inflatebot who made the model card's image :3
83
  Another big thanks to all the members of the ArliAI and BeaverAI Discord servers for testing! All of the people featured in the testimonials are from there :3