danielje/MetaMathQA
Viewer • Updated • 395k • 18
This is a Recurrent Adapter Model fine-tuned on MetaMathQA for mathematical reasoning, built on top of OLMo-3-1025-7B.
The model uses a recurrent adapter architecture where:
import torch
from transformers import AutoTokenizer, AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained(
"hanseungwook/olmo3-recurrent-adapter-sft-cot-rec1-coda2-untied",
torch_dtype=torch.bfloat16,
device_map="auto",
trust_remote_code=True,
)
tokenizer = AutoTokenizer.from_pretrained("allenai/OLMo-3-1025-7B")
prompt = "What is 25 * 37?"
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
with torch.no_grad():
outputs = model.generate(
**inputs,
max_new_tokens=512,
num_recurrence_steps=32,
temperature=0.7,
)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Apache 2.0