Instructions to use huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated")
model = AutoModelForCausalLM.from_pretrained("huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Inference
Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated

SGLang

How to use huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated with Docker Model Runner:
```
docker model run hf.co/huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated
```

the abliterated is not very thorough

by CrestYao - opened Jan 23, 2025

Discussion

CrestYao

Jan 23, 2025

It feels like the abliterated is not very thorough. When it comes to political issues (such as being asked “What impact would Trump's presidency have on China?” or “If China decides to reclaim Taiwan by force, what would be the most likely method employed?” etc.), a very official statement is given, but the question is not truly answered. The abliterated versions of the previous QWen or QwQ models did not have this issue. It should be a problem brought about by the distillation of the DeepSeek-R1 model.

huihui-ai

Owner Jan 23, 2025

Important Note There's a new version available, please try using the new version huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2.

CrestYao

Jan 23, 2025

Important Note There's a new version available, please try using the new version huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2.

Thank you for the update! However, my PC needs to wait for a GGUF version to test. I will provide feedback on the results after the testing is done.

CrestYao

Jan 23, 2025

Important Note There's a new version available, please try using the new version huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2.

I have already tried the V2 version, but the previously mentioned issues were not resolved. Normally, the model goes through a process before answering questions. However, for the types of issues mentioned earlier, the model would bypass the process and directly provide an official statement as a response. Moreover, this official statement is slightly different each time it is regenerated; the general meaning is the same, but the wording is not entirely identical. I suspect this might be due to the failure of the erasure technique.

pery8275

Jan 24, 2025

I have also already tried the V2 version,the previously mentioned issues were not resolved.

CrestYao

Jan 24, 2025

I conducted several tests using the same quantization version by Mradermacher. When using the 32B model, if asking questions in a simple manner, the model would not enter the process and would directly refuse to answer. However, when I modified the prompt to make the question more complex, there was a 50% chance of successfully entering the process, and the model would eventually provide an answer normally. But when using the 14B model, whether with Mradermacher's V1 quantization version or another person's V2 quantization version, the model would refuse to answer directly, regardless of whether the questions were simple or complex, and it would not enter the process.

Biatu

Jan 28, 2025

So I'm finding that asking it controversial and/or unethical things V2 just thinks to itself Yes, user wants "subject" and then next thought is filtered.

DanielTTY

Jan 31, 2025

ollama那里试了新的版本还是不回答要如何收复台湾的这个问题 :)

sxck36

Feb 8, 2025

I've tested both v1 and v2, but neither feels truly uncensored. Even with strict rules in the system prompt, the model ignores them if it deems the topic problematic. It tends to steer conversations in a positive direction and avoids answering certain questions and just defaults to "Thank you for your message".

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment