Text Generation
Transformers
PyTorch
English
olmo2
conversational
StefanHex commited on
Commit
2a1f1f8
·
verified ·
1 Parent(s): 0d85a3d

Remove note about checkpoints (only exist for RLVR)

Browse files

Intermediate checkpoints only (seem to) exist for the RLVR1 model (which makes sense given it says RL training checkpoints).

```python
olmo_model = AutoModelForCausalLM.from_pretrained("allenai/OLMo-2-0425-1B-SFT", revision="step_200")
```
will fail with
```
OSError: step_200 is not a valid git identifier (branch name, tag name or commit id) that exists for this model name. Check the model page at 'https://huggingface.co/allenai/OLMo-2-0425-1B-SFT' for available revisions.
```

Files changed (1) hide show
  1. README.md +0 -9
README.md CHANGED
@@ -77,15 +77,6 @@ I'm just a computer program, so I don't have feelings, but I'm functioning as ex
77
  ```
78
  It is embedded within the tokenizer as well, for `tokenizer.apply_chat_template`.
79
 
80
- ### Intermediate Checkpoints
81
-
82
- To facilitate research on RL finetuning, we have released our intermediate checkpoints during the model's RLVR training.
83
- The model weights are saved every 20 training steps, and can be accessible in the revisions of the HuggingFace repository.
84
- For example, you can load with:
85
- ```
86
- olmo_model = AutoModelForCausalLM.from_pretrained("allenai/OLMo-2-0425-1B-SFT", revision="step_200")
87
- ```
88
-
89
  ### Bias, Risks, and Limitations
90
 
91
  The OLMo-2 models have limited safety training, but are not deployed automatically with in-the-loop filtering of responses like ChatGPT, so the model can produce problematic outputs (especially when prompted to do so).
 
77
  ```
78
  It is embedded within the tokenizer as well, for `tokenizer.apply_chat_template`.
79
 
 
 
 
 
 
 
 
 
 
80
  ### Bias, Risks, and Limitations
81
 
82
  The OLMo-2 models have limited safety training, but are not deployed automatically with in-the-loop filtering of responses like ChatGPT, so the model can produce problematic outputs (especially when prompted to do so).