一、Ollama本地部署DeepSeek接口文档解析

1.1 部署环境准备

Ollama作为轻量级本地LLM运行环境，支持通过Docker或二进制包部署。以Ubuntu 22.04为例，部署流程如下：

# 安装依赖
sudo apt update && sudo apt install -y curl wget
# 下载并安装Ollama（以v0.3.1为例）
curl -L https://ollama.ai/install.sh | sh
# 验证安装
ollama --version

1.2 DeepSeek模型加载

通过Ollama的模型仓库管理功能加载DeepSeek-R1-7B：

ollama pull deepseek-r1:7b
# 启动服务（默认端口11434）
ollama serve

1.3 接口文档核心要素

DeepSeek接口通过HTTP API暴露，关键文档要素包括：

认证方式：Bearer Token（可选）
请求方法：POST /api/generate

请求体结构：

{
"model": "deepseek-r1:7b",
"prompt": "解释量子计算原理",
"temperature": 0.7,
"max_tokens": 200
}

响应结构：

{
"response": "量子计算基于...",
"finish_reason": "length",
"usage": {
  "prompt_tokens": 12,
  "completion_tokens": 198
}
}

二、Python测试框架设计

2.1 基础请求封装

使用requests库构建API客户端：

import requests
from typing import Dict, Any
class DeepSeekClient:
    def __init__(self, base_url: str = "http://localhost:11434"):
        self.base_url = base_url
        self.session = requests.Session()
    def generate(self, payload: Dict[str, Any]) -> Dict[str, Any]:
        url = f"{self.base_url}/api/generate"
        response = self.session.post(url, json=payload)
        response.raise_for_status()
        return response.json()

2.2 参数化测试设计

结合pytest.mark.parametrize实现多场景测试：

import pytest
from client import DeepSeekClient
class TestDeepSeekAPI:
    @pytest.fixture
    def client(self):
        return DeepSeekClient()
    @pytest.mark.parametrize("prompt, expected_length", [
        ("解释相对论", (150, 250)),
        ("Python装饰器用法", (100, 200))
    ])
    def test_response_length(self, client, prompt, expected_length):
        response = client.generate({
            "model": "deepseek-r1:7b",
            "prompt": prompt
        })
        actual_length = len(response["response"])
        assert expected_length[0] <= actual_length <= expected_length[1]

三、YAML用例管理实现

3.1 YAML用例规范设计

定义标准用例模板：

# test_cases/deepseek_cases.yml
- case_id: DS-001
  description: "基础问答测试"
  request:
    model: "deepseek-r1:7b"
    prompt: "什么是机器学习？"
    temperature: 0.5
  validate:
    - check: "response_contains"
      value: "统计模型"
    - check: "response_length"
      min: 80
      max: 120
- case_id: DS-002
  description: "高温度值测试"
  request:
    model: "deepseek-r1:7b"
    prompt: "描述巴黎的天气"
    temperature: 1.2
  validate:
    - check: "response_creativity"
      score: ">0.8"

3.2 YAML解析器实现

import yaml
from typing import List, Dict, Any
class TestCaseLoader:
    @staticmethod
    def load_cases(file_path: str) -> List[Dict[str, Any]]:
        with open(file_path, 'r', encoding='utf-8') as f:
            return yaml.safe_load(f)
    @staticmethod
    def parse_validation(validation_rules: List[Dict[str, Any]]) -> callable:
        def validate(response: Dict[str, Any]) -> bool:
            for rule in validation_rules:
                check_type = rule["check"]
                if check_type == "response_contains":
                    if rule["value"] not in response["response"]:
                        return False
                elif check_type == "response_length":
                    length = len(response["response"])
                    if not (rule["min"] <= length <= rule["max"]):
                        return False
            return True
        return validate

3.3 完整测试执行流程

import pytest
from client import DeepSeekClient
from test_loader import TestCaseLoader
class TestDeepSeekYAML:
    @pytest.fixture
    def client(self):
        return DeepSeekClient()
    def test_from_yaml(self, client):
        test_cases = TestCaseLoader.load_cases("test_cases/deepseek_cases.yml")
        for case in test_cases:
            with self.subTest(case_id=case["case_id"]):
                # 准备请求
                request_data = case["request"]
                # 执行请求
                response = client.generate(request_data)
                # 验证响应
                validator = TestCaseLoader.parse_validation(case["validate"])
                assert validator(response), f"Case {case['case_id']} failed"

四、进阶实践建议

4.1 性能测试集成

使用locust进行压力测试配置示例：

from locust import HttpUser, task
class DeepSeekLoadTest(HttpUser):
    @task
    def generate_request(self):
        self.client.post("/api/generate", json={
            "model": "deepseek-r1:7b",
            "prompt": "生成10个Python面试题"
        })

4.2 持续集成配置

GitHub Actions工作流示例：

name: DeepSeek CI
on: [push]
jobs:
  test:
    runs-on: ubuntu-latest
    steps:
    - uses: actions/checkout@v3
    - name: Set up Python
      uses: actions/setup-python@v4
      with:
        python-version: '3.9'
    - name: Install dependencies
      run: |
        pip install pytest requests pyyaml
    - name: Run tests
      run: pytest tests/ -v

4.3 异常场景测试

建议覆盖的异常场景：

无效模型名称（返回404）
空prompt（返回400）
超长prompt（测试截断逻辑）
网络中断（模拟重试机制）

五、最佳实践总结

分层测试策略：单元测试→接口测试→性能测试
用例管理原则：
- 单一职责原则：每个YAML用例只测试一个功能点
- 可维护性：使用环境变量管理基础URL
- 可追溯性：每个用例包含唯一ID和描述
测试数据设计：
- 边界值分析：温度参数取0.0/0.5/1.0/1.2等值
- 等价类划分：将prompt分为事实型/分析型/创意型
监控指标：
- 平均响应时间（P90/P95）
- 错误率
- 令牌消耗效率

通过上述框架实现，团队可获得：

测试用例与代码解耦（通过YAML管理）
测试执行效率提升（pytest并行执行）
测试报告可视化（结合pytest-html插件）
持续测试能力（集成CI/CD流程）

建议每周进行回归测试，每月更新测试用例库，每季度进行性能基准测试，确保本地部署的DeepSeek服务稳定可靠。

深度实践：Ollama本地部署DeepSeek接口的Python+Pytest+YAML测试框架构建