唠唠闲话
整理一些优秀的项目。
Ollama: 本地部署的语言模型服务
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21
| version: '3.8' services: ollama: image: ollama/ollama container_name: ollama ports: - "11434:11434" volumes: - /sshfs/ollama:/root/.ollama environment: - CUDA_VISIBLE_DEVICES=0,1 - OLLAMA_KEEP_ALIVE="5m" deploy: resources: reservations: devices: - driver: nvidia count: all capabilities: [gpu] restart: unless-stopped
|
并发硬伤:https://github.com/ollama/ollama/issues/358
持续更新中:https://github.com/ollama/ollama/pull/3418
Open-WebUI: https://github.com/open-webui/open-webui
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
| version: '3.8' services: open-webui: image: ghcr.io/open-webui/open-webui:main container_name: open-webui restart: always ports: - "3000:8080" environment: - OLLAMA_BASE_URL: ${OLLAMA_BASE_URL} - RAG_EMBEDDING_MODEL: "all-MiniLM-L6-v2" - RAG_EMBEDDING_MODEL_DIR: "/app/backend/data/cache/embedding/models/" - HF_ENDPOINT: https://hf-mirror.com volumes: - ./open-webui:/app/backend/data
|
LiteLLM: https://github.com/BerriAI/litellm