Instructions to use jadael/comma-v0.1-2t-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use jadael/comma-v0.1-2t-GGUF with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="jadael/comma-v0.1-2t-GGUF",
	filename="comma-v0.1-2t-f16.gguf",
)

output = llm(
	"Once upon a time,",
	max_tokens=512,
	echo=True
)
print(output)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use jadael/comma-v0.1-2t-GGUF with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf jadael/comma-v0.1-2t-GGUF:F16
# Run inference directly in the terminal:
llama-cli -hf jadael/comma-v0.1-2t-GGUF:F16

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf jadael/comma-v0.1-2t-GGUF:F16
# Run inference directly in the terminal:
llama-cli -hf jadael/comma-v0.1-2t-GGUF:F16

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf jadael/comma-v0.1-2t-GGUF:F16
# Run inference directly in the terminal:
./llama-cli -hf jadael/comma-v0.1-2t-GGUF:F16

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf jadael/comma-v0.1-2t-GGUF:F16
# Run inference directly in the terminal:
./build/bin/llama-cli -hf jadael/comma-v0.1-2t-GGUF:F16

Use Docker

docker model run hf.co/jadael/comma-v0.1-2t-GGUF:F16

LM Studio
Jan

vLLM

How to use jadael/comma-v0.1-2t-GGUF with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "jadael/comma-v0.1-2t-GGUF"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "jadael/comma-v0.1-2t-GGUF",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/jadael/comma-v0.1-2t-GGUF:F16

Ollama
How to use jadael/comma-v0.1-2t-GGUF with Ollama:
```
ollama run hf.co/jadael/comma-v0.1-2t-GGUF:F16
```

Unsloth Studio new

How to use jadael/comma-v0.1-2t-GGUF with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for jadael/comma-v0.1-2t-GGUF to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for jadael/comma-v0.1-2t-GGUF to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for jadael/comma-v0.1-2t-GGUF to start chatting

Docker Model Runner
How to use jadael/comma-v0.1-2t-GGUF with Docker Model Runner:
```
docker model run hf.co/jadael/comma-v0.1-2t-GGUF:F16
```

Lemonade

How to use jadael/comma-v0.1-2t-GGUF with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull jadael/comma-v0.1-2t-GGUF:F16

Run and chat with the model

lemonade run user.comma-v0.1-2t-GGUF-F16

List all available models

lemonade list

Comma v0.1-2T - GGUF

This is a GGUF conversion of common-pile/comma-v0.1-2t for use with llama.cpp and Ollama.

Model Details

Original Model: Comma v0.1-2T Architecture: Llama 3 (7B parameters) Training: 2 trillion tokens from the Common Pile v0.1 dataset License: Apache 2.0 Converted by: Community conversion

About Comma v0.1

Comma v0.1-2T is a 7 billion parameter language model trained exclusively on openly licensed and public domain text from the Common Pile v0.1 dataset. This model demonstrates that competitive performance can be achieved using only ethically sourced training data.

Performance is competitive with Llama 2 7B, OLMo, and DeepSeekLLM on knowledge-intensive and coding benchmarks.

GGUF Conversion Details

Format: GGUF
Quantization: F16 (non-quantized, full precision)
Size: 14GB
Converted with: llama.cpp (with custom patch for Comma tokenizer)
File: comma-v0.1-2t-f16.gguf

Conversion Notes

This conversion required patching llama.cpp to recognize Comma v0.1's tokenizer format. The tokenizer uses Llama 3 style BPE with a unique checksum. The patch has been included in this repository for others who want to convert similar models.

Usage with Ollama

1. Download the GGUF file

Download from the Files tab above, or use the command line:

huggingface-cli download jadael/comma-v0.1-2t-GGUF comma-v0.1-2t-f16.gguf --local-dir .

2. Create a Modelfile

FROM ./comma-v0.1-2t-f16.gguf

TEMPLATE """{{ .Prompt }}"""

PARAMETER stop "<|end_of_text|>"
PARAMETER temperature 0.7
PARAMETER top_p 0.9
PARAMETER repeat_penalty 1.1

Note: This is a base model (not instruction-tuned), so it will continue/complete text rather than follow chat instructions.

3. Import the model

ollama create comma-v0.1-2t -f Modelfile

4. Run the model

ollama run comma-v0.1-2t

Example usage (text completion):

>>> Once upon a time in a land far away
[Model will continue the story...]

Usage with llama.cpp

./llama-cli -m comma-v0.1-2t-f16.gguf -p "Your prompt here" -n 128

Important Notes

This is a base model (not instruction-tuned)
Trained on Common Pile v0.1: openly licensed and public domain text only
Suitable for further fine-tuning or use as a foundation model
For chat/instruct capabilities, fine-tuning is recommended

Citation

If you use this model, please cite the original Common Pile work:

@article{commonpile2025,
  title={The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text},
  author={[Original authors from Common Pile team]},
  year={2025}
}

License

This GGUF conversion is released under Apache 2.0 (same as the original model).

Apache 2.0 was chosen because:

It matches the original Comma v0.1-2T model license
It's compatible with llama.cpp's MIT license
It provides patent grants and protections
It requires attribution and license preservation
It's the most protective license compatible with all components

All conversions, tools, and documentation in this repository are Apache 2.0 licensed.

Downloads last month: 315

GGUF

Model size

7B params

Architecture

llama

Hardware compatibility

16-bit

Model tree for jadael/comma-v0.1-2t-GGUF

Base model

common-pile/comma-v0.1-2t

Quantized

(4)

this model

jadael
/

comma-v0.1-2t-GGUF