DeepSeek R1 模型本地部署与 API 集成实践 | 极客日志

PythonAI算法

DeepSeek R1 模型本地部署与 API 集成实践

综述由AI生成DeepSeek R1 是国产高性能推理模型，支持开源和商用。其核心优势及性能表现，详细讲解了基于 Ollama 的本地部署流程，包括安装、模型下载与运行，并提供了通过 Open Web UI 访问及在 Python 项目中集成本地与云端 API 的代码示例，帮助开发者快速上手使用。

橘子海发布于 2025/2/6更新于 2026/6/219 浏览

一、背景

2025 年 1 月 20 日，杭州深度求索人工智能基础技术研究有限公司发布了高性能 AI 推理模型 DeepSeek R1。该模型在数学、代码和自然语言推理等任务上表现出色，性能与 OpenAI 的 o1 正式版相当。作为国产 AI 技术的重大突破，DeepSeek R1 以开源形式向全球开发者开放，遵循 MIT 协议，支持免费商用。

二、DeepSeek R1 的核心优势

强化学习驱动的推理能力：R1 在后训练阶段应用了强化学习技术（RLHF），无需大量监督微调数据即可显著提升推理能力，有效降低了训练成本。

长链推理与模型蒸馏：采用长链推理（Chain of Thought）技术，能够逐步分解复杂问题。同时支持模型蒸馏，可将 R1 强大的推理能力迁移到小型模型中，满足特定场景下的低延迟或低成本需求。

开源与灵活的许可证：遵循 MIT License 开源协议，允许自由使用、修改和商用，极大地推动了 AI 技术的普及与创新。

三、性能评测对比

3.1 DeepSeek-R1-Evaluation

对于所有模型，最大生成长度均设置为 32,768 个标记。对于需要采样的基准测试，我们使用 0.6 的温度值、0.95 的 top-p 值，并针对每个查询生成 64 个响应以估算 pass@1。

Category	Benchmark (Metric)	Claude-3.5-Sonnet-1022	GPT-4o 0513	DeepSeek V3	OpenAI o1-mini	OpenAI o1-1217	DeepSeek R1
Architecture	-	-	MoE	-	-	MoE	-
# Activated Params	-	-	37B	-	-	37B	-
# Total Params	-	-	671B	-	-	671B	-
English	MMLU (Pass@1)	88.3	87.2	88.5	85.2	91.8	90.8
	MMLU-Redux (EM)	88.9	88.0	89.1	86.7	-	92.9
	MMLU-Pro (EM)	78.0	72.6	75.9	80.3	-	84.0
	DROP (3-shot F1)	88.3	83.7	91.6	83.9	90.2	92.2

相关免费在线工具

加密/解密文本
使用加密算法（如AES、TripleDES、Rabbit或RC4）加密和解密文本明文。在线工具，加密/解密文本在线工具，online
RSA密钥对生成器
生成新的随机RSA私钥和公钥pem证书。在线工具，RSA密钥对生成器在线工具，online
Mermaid 预览与可视化编辑
基于 Mermaid.js 实时预览流程图、时序图等图表，支持源码编辑与即时渲染。在线工具，Mermaid 预览与可视化编辑在线工具，online
随机西班牙地址生成器
随机生成西班牙地址（支持马德里、加泰罗尼亚、安达卢西亚、瓦伦西亚筛选），支持数量快捷选择、显示全部与下载。在线工具，随机西班牙地址生成器在线工具，online
Gemini 图片去水印
基于开源反向 Alpha 混合算法去除 Gemini/Nano Banana 图片水印，支持批量处理与下载。在线工具，Gemini 图片去水印在线工具，online
curl 转代码
解析常见 curl 参数并生成 fetch、axios、PHP curl 或 Python requests 示例代码。在线工具，curl 转代码在线工具，online

Model	AIME 2024 pass@1	AIME 2024 cons@64	MATH-500 pass@1	GPQA Diamond pass@1	LiveCodeBench pass@1	CodeForces rating
GPT-4o-0513	9.3	13.4	74.6	49.9	32.9	759
Claude-3.5-Sonnet-1022	16.0	26.7	78.3	65.0	38.9	717
o1-mini	63.6	80.0	90.0	60.0	53.8	1820
QwQ-32B-Preview	44.0	60.0	90.6	54.5	41.9	1316
DeepSeek-R1-Distill-Qwen-1.5B	28.9	52.7	83.9	33.8	16.9	954
DeepSeek-R1-Distill-Qwen-7B	55.5	83.3	92.8	49.1	37.6	1189
DeepSeek-R1-Distill-Qwen-14B	69.7	80.0	93.9	59.1	53.1	1481
DeepSeek-R1-Distill-Qwen-32B	72.6	83.3	94.3	62.1	57.2	1691
DeepSeek-R1-Distill-Llama-8B	50.4	80.0	89.1	49.0	39.6	1205
DeepSeek-R1-Distill-Llama-70B	70.0	86.7	94.5	65.2	57.5	1633

curl -fsSL https://ollama.com/install.sh | sh

ollama --version

ollama run deepseek-r1

# Default 7B model (4.7GB - ideal for consumer GPUs)
ollama run deepseek-r1

# Larger 70B model (requires 24GB+ VRAM)
ollama run deepseek-r1:70b

# Actual DeepSeek-R1 (requires 336GB+ VRAM for 4-bit quantization)
ollama run deepseek-r1:671b

docker run -d -p 3000:8080 \
    --add-host=host.docker.internal:host-gateway \
    -v open-webui:/app/backend/data \
    --name open-webui \
    --restart always \
    ghcr.io/open-webui/open-webui:main

import openai

# Connect to your local Ollama instance
client = openai.Client(
    base_url="http://localhost:11434/v1",
    api_key="ollama"  # Authentication-free private access
)

response = client.chat.completions.create(
    model="deepseek-r1:XXb", # change the "XX" by the distilled model you choose
    messages=[{"role": "user", "content": "Explain blockchain security"}],
    temperature=0.7,  # Controls creativity vs precision
    stream=True
)

for chunk in response:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content, end="", flush=True)

import openai
from dotenv import load_dotenv
import os

load_dotenv()

client = openai.OpenAI(
    base_url="https://api.deepseek.com/v1",
    api_key=os.getenv("DEEPSEEK_API_KEY")
)

response = client.chat.completions.create(
    model="deepseek-reasoner",
    messages=[{"role": "user", "content": "Write web scraping code with error handling"}],
    max_tokens=1000,  # Limit costs for long responses
    temperature=0.7
)

print(response.choices[0].message.content)

DeepSeek R1 模型本地部署与 API 集成实践

一、背景

二、DeepSeek R1 的核心优势

三、性能评测对比

3.1 DeepSeek-R1-Evaluation

更多推荐文章

相关免费在线工具

3.2 Distilled Model Evaluation

四、DeepSeek R1 模型的本地部署

4.1 硬件与环境要求

4.2 工具推荐

4.3 安装 Ollama

4.4 验证 Ollama 安装

4.5 下载 Deepseek R1

4.6 运行 Deepseek R1

4.7 设置 Open Web UI（私有接口）

五、试用一下 Deepseek R1

六、将 DeepSeek-R1 集成到你的项目中

6.1 本地部署（隐私优先）

6.2 使用官方 DeepSeek-R1 云 API

七、总结

更多推荐文章

相关免费在线工具

DeepSeek R1 模型本地部署与 API 集成实践

一、背景

二、DeepSeek R1 的核心优势

三、性能评测对比

3.1 DeepSeek-R1-Evaluation

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具

3.2 Distilled Model Evaluation

四、DeepSeek R1 模型的本地部署

4.1 硬件与环境要求

4.2 工具推荐

4.3 安装 Ollama

4.4 验证 Ollama 安装

4.5 下载 Deepseek R1

4.6 运行 Deepseek R1

4.7 设置 Open Web UI（私有接口）

五、试用一下 Deepseek R1

六、将 DeepSeek-R1 集成到你的项目中

6.1 本地部署（隐私优先）

6.2 使用官方 DeepSeek-R1 云 API

七、总结

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具