Instructions to use huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated") model = AutoModelForCausalLM.from_pretrained("huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated") messages = [ {"role": "user", "content": "Who are you?"}, ] inputs = tokenizer.apply_chat_template( messages, add_generation_prompt=True, tokenize=True, return_dict=True, return_tensors="pt", ).to(model.device) outputs = model.generate(**inputs, max_new_tokens=40) print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:])) - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated
- SGLang
How to use huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated with Docker Model Runner:
docker model run hf.co/huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated
the abliterated is not very thorough
It feels like the abliterated is not very thorough. When it comes to political issues (such as being asked “What impact would Trump's presidency have on China?” or “If China decides to reclaim Taiwan by force, what would be the most likely method employed?” etc.), a very official statement is given, but the question is not truly answered. The abliterated versions of the previous QWen or QwQ models did not have this issue. It should be a problem brought about by the distillation of the DeepSeek-R1 model.
Important Note There's a new version available, please try using the new version huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2.
Important Note There's a new version available, please try using the new version huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2.
Thank you for the update! However, my PC needs to wait for a GGUF version to test. I will provide feedback on the results after the testing is done.
Important Note There's a new version available, please try using the new version huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2.
I have already tried the V2 version, but the previously mentioned issues were not resolved. Normally, the model goes through a process before answering questions. However, for the types of issues mentioned earlier, the model would bypass the process and directly provide an official statement as a response. Moreover, this official statement is slightly different each time it is regenerated; the general meaning is the same, but the wording is not entirely identical. I suspect this might be due to the failure of the erasure technique.
I have also already tried the V2 version,the previously mentioned issues were not resolved.
I conducted several tests using the same quantization version by Mradermacher. When using the 32B model, if asking questions in a simple manner, the model would not enter the process and would directly refuse to answer. However, when I modified the prompt to make the question more complex, there was a 50% chance of successfully entering the process, and the model would eventually provide an answer normally. But when using the 14B model, whether with Mradermacher's V1 quantization version or another person's V2 quantization version, the model would refuse to answer directly, regardless of whether the questions were simple or complex, and it would not enter the process.
So I'm finding that asking it controversial and/or unethical things V2 just thinks to itself Yes, user wants "subject" and then next thought is filtered.
ollama那里试了新的版本 还是不回答要如何收复台湾的这个问题 :)
I've tested both v1 and v2, but neither feels truly uncensored. Even with strict rules in the system prompt, the model ignores them if it deems the topic problematic. It tends to steer conversations in a positive direction and avoids answering certain questions and just defaults to "Thank you for your message".