如何快速部署 PrivateGPT？构建企业级私有化大模型

如何快速部署 PrivateGPT？构建企业级私有化大模型 | 极客日志

git clone [email protected]:menloparklab/privateGPT-app.git
cd privateGPT-app/

wget https://gpt4all.io/models/ggml-gpt4all-j-v1.3-groovy.bin
mv ggml-gpt4all-j-v1.3-groovy.bin models/

pip install -r requirements.txt

pip install -r requirements.txt -i https://mirrors.aliyun.com/pypi/simple/

ERROR: Failed building wheel for llama-cpp-python
...
An error occurred while building with CMake.

yum list installed | grep devtoolset-11

sudo yum install -y devtoolset-11

scl enable devtoolset-11 bash

scl enable devtoolset-11 -

# 控制台启动，便于查看日志
gunicorn app:app -k uvicorn.workers.UvicornWorker --timeout 1500

# 后台进程启动，正式部署
nohup gunicorn app:app -k uvicorn.workers.UvicornWorker --timeout 1500 --bind=0.0.0.0:14800 > privateGPT-backend.log 2>&1 &

File "ingest.py", line 52
    def load_single_document(file_path: str) -> Document:
                                      ^
SyntaxError: invalid syntax

gunicorn app:app -k uvicorn.workers.UvicornWorker --timeout 1500 --bind=0.0.0.0:8000 --pythonpath=/opt/gptlabs/privateGPT/myenv/bin/python3.11

ModuleNotFoundError: No module named '_sqlite3'

python -c "import sqlite3; print(sqlite3.sqlite_version)"

tar -czvf python_backup.tar.gz /usr/local/bin/python3.11

LDFLAGS="${LDFLAGS} -Wl,-rpath=/usr/local/openssl/lib" ./configure --with-openssl=/usr/local/openssl --prefix=/usr/local/python3.11 --enable-loadable-sqlite-extensions
make
make install

# 控制台启动，便于查看日志
streamlit run streamlit_app.py --server.address 0.0.0.0 --logger.level=debug

# 后台进程启动，正式部署
nohup streamlit run streamlit_app.py --server.address 0.0.0.0 > privateGPT-frontend.log 2>&1 &

curl -X GET http://localhost:8000/

import requests
response = requests.get("http://localhost:8000/")
print(response.json())

curl -X POST -F "[email protected]" -F "[email protected]" -F "collection_name=my_collection" http://localhost:8000/embed

import requests
files = [("files", open("file1.txt", "rb")), ("files", open("file2.txt", "rb"))]
data = {"collection_name": "my_collection"}
response = requests.post("http://localhost:8000/embed", files=files, data=data)
print(response.json())

curl -X POST -H "Content-Type: application/json" -d '{"query": "sample query", "collection_name": "my_collection"}' http://localhost:8000/retrieve

import requests
data = {"query": "sample query", "collection_name": "my_collection"}
response = requests.post("http://localhost:8000/retrieve", json=data)
print(response.json())

如何快速部署 PrivateGPT？构建企业级私有化大模型

前言

一、为什么需要 PrivateGPT？

1.1 前提条件

1.2 使用场景

1.3 传统文本聊天类应用流程图

1.4 解耦后的文本聊天类应用流程图

二、如何快速部署 PrivateGPT？

三、Railway 远程部署

3.1 克隆 privateGPT-app 项目

3.2 部署 privateGPT-app 项目

四、本地服务器部署

4.1 环境要求

4.2 安装 Python

4.2.1 安装 Python 3.11 版本

4.2.2 创建 Python 虚拟环境

4.3 安装 privateGPT-app

4.3.1 下载源码

4.3.2 配置环境变量

4.3.3 下载 LLM 模型

4.3.4 安装项目依赖

4.3.5 安装错误分析

1) 安装过程中某个包太大，由于网络原因导致超时中断

2) 编译 llama-cpp-python 模块失败

4.4 运行 FastAPI 后端

4.4.1 启动出错，提示语法错误：SyntaxError: invalid syntax

4.4.2 启动报错：ModuleNotFoundError: No module named '_sqlite3'

4.5 运行 Streamlit 应用程序

4.6 访问 PrivateGPT

4.6.1 上传文档

4.6.2 向量计算

4.6.3 文本检索

五、重要注意事项

5.1 支持的文档扩展名

六、后端 API 接口

6.1 根路由

6.2 文档嵌入接口

6.3 数据检索接口

七、安全与性能优化建议

7.1 网络安全

7.2 性能调优

7.3 监控与维护

八、总结

微信扫一扫，关注极客日志

更多推荐文章

相关免费在线工具