CohereLabs/xP3x
Viewer • Updated • 434M • 286k • 93
How to use kcoopermiller/aya-101-GGUF with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("kcoopermiller/aya-101-GGUF")
model = AutoModelForSeq2SeqLM.from_pretrained("kcoopermiller/aya-101-GGUF")This repo contains GGUF format model files for Cohere's Aya-101 model
Quantized using Huggingface's candle framework
Visit the candle T5 example for more detailed instruction
git clone https://github.com/huggingface/candle.git
cd candle/candle-examples
cargo run --example quantized-t5 --release -- \
--model-id "kcoopermiller/aya-101-GGUF" \
--weight-file "aya-101.Q2_K.gguf" \
--config-file "config.json" \
--prompt "भारत में इतनी सारी भाषाएँ क्यों हैं?" \
--temperature 0
Available weight files: