tabby/deployment/docker-compose.yml

version: '3.3'

services:
  init:
    image: tabbyml/tabby
    container_name: tabby-init
    command: python -m tabby.tools.model_preload --repo_id ${MODEL_NAME}
    volumes:
      - ${HF_VOLUME}

  server:
    image: tabbyml/tabby
    container_name: tabby-server
    command: uvicorn tabby.server:app --host 0.0.0.0 --port 5000
    environment:
      MODEL_NAME: ${MODEL_NAME}
      MODEL_BACKEND: triton
    ports:
      - "5000:5000"
    volumes:
      - ${HF_VOLUME}
    depends_on:
      init:
        condition: service_completed_successfully
      triton:
        condition: service_healthy

  admin:
    image: tabbyml/tabby
    container_name: tabby-admin
    command: streamlit run tabby/admin/Home.py
    ports:
      - "8501:8501"

  triton:
    image: tabbyml/fastertransformer_backend
    container_name: tabby-triton
    command: /scripts/triton.sh
    shm_size: 1gb
    volumes:
      - ./scripts:/scripts
      - ${HF_VOLUME}
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              count: all
              capabilities: [gpu]
    environment:
      MODEL_NAME: ${MODEL_NAME}
    depends_on:
      init:
        condition: service_completed_successfully
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:8002/metrics"]
      interval: 2s
      timeout: 2s
      start_period: 120s
Add admin panel (w/ streamlit) and logging (w/ vectordev) (#4) 2023-03-22 15:18:12 +00:00			`version: '3.3'`

			`services:`
use TabbyML/NeoX-70M for minimal e2e deployment (#10) * use TabbyML/NeoX-70M for minimal e2e deployment * Use python3 of triton image 2023-03-25 09:39:40 +00:00			`init:`
			`image: tabbyml/tabby`
			`container_name: tabby-init`
Extract environment variable 2023-03-27 05:41:22 +00:00			`command: python -m tabby.tools.model_preload --repo_id ${MODEL_NAME}`
use TabbyML/NeoX-70M for minimal e2e deployment (#10) * use TabbyML/NeoX-70M for minimal e2e deployment * Use python3 of triton image 2023-03-25 09:39:40 +00:00			`volumes:`
Extract environment variable 2023-03-27 05:41:22 +00:00			`- ${HF_VOLUME}`
use TabbyML/NeoX-70M for minimal e2e deployment (#10) * use TabbyML/NeoX-70M for minimal e2e deployment * Use python3 of triton image 2023-03-25 09:39:40 +00:00
Add admin panel (w/ streamlit) and logging (w/ vectordev) (#4) 2023-03-22 15:18:12 +00:00			`server:`
			`image: tabbyml/tabby`
			`container_name: tabby-server`
Move python code under tabby/ (#8) * Add tabby config file * Rename train.yaml to trainer.yaml * Change server to relative import * Move source files into tabby * Rename conf 2023-03-25 04:20:29 +00:00			`command: uvicorn tabby.server:app --host 0.0.0.0 --port 5000`
Add admin panel (w/ streamlit) and logging (w/ vectordev) (#4) 2023-03-22 15:18:12 +00:00			`environment:`
Extract environment variable 2023-03-27 05:41:22 +00:00			`MODEL_NAME: ${MODEL_NAME}`
			`MODEL_BACKEND: triton`
Add admin panel (w/ streamlit) and logging (w/ vectordev) (#4) 2023-03-22 15:18:12 +00:00			`ports:`
			`- "5000:5000"`
			`volumes:`
Extract environment variable 2023-03-27 05:41:22 +00:00			`- ${HF_VOLUME}`
use TabbyML/NeoX-70M for minimal e2e deployment (#10) * use TabbyML/NeoX-70M for minimal e2e deployment * Use python3 of triton image 2023-03-25 09:39:40 +00:00			`depends_on:`
			`init:`
			`condition: service_completed_successfully`
Prepare public release with a minimal deployment setup (#16) * Move deployment to deployment-next * Add deployment setup * Update deployment-next * Remove vector label * update README.md 2023-03-26 14:44:15 +00:00			`triton:`
			`condition: service_healthy`
Add admin panel (w/ streamlit) and logging (w/ vectordev) (#4) 2023-03-22 15:18:12 +00:00
			`admin:`
			`image: tabbyml/tabby`
			`container_name: tabby-admin`
Move python code under tabby/ (#8) * Add tabby config file * Rename train.yaml to trainer.yaml * Change server to relative import * Move source files into tabby * Rename conf 2023-03-25 04:20:29 +00:00			`command: streamlit run tabby/admin/Home.py`
Add admin panel (w/ streamlit) and logging (w/ vectordev) (#4) 2023-03-22 15:18:12 +00:00			`ports:`
			`- "8501:8501"`

Prepare public release with a minimal deployment setup (#16) * Move deployment to deployment-next * Add deployment setup * Update deployment-next * Remove vector label * update README.md 2023-03-26 14:44:15 +00:00			`triton:`
			`image: tabbyml/fastertransformer_backend`
			`container_name: tabby-triton`
			`command: /scripts/triton.sh`
			`shm_size: 1gb`
Add admin panel (w/ streamlit) and logging (w/ vectordev) (#4) 2023-03-22 15:18:12 +00:00			`volumes:`
Prepare public release with a minimal deployment setup (#16) * Move deployment to deployment-next * Add deployment setup * Update deployment-next * Remove vector label * update README.md 2023-03-26 14:44:15 +00:00			`- ./scripts:/scripts`
Extract environment variable 2023-03-27 05:41:22 +00:00			`- ${HF_VOLUME}`
Prepare public release with a minimal deployment setup (#16) * Move deployment to deployment-next * Add deployment setup * Update deployment-next * Remove vector label * update README.md 2023-03-26 14:44:15 +00:00			`deploy:`
			`resources:`
			`reservations:`
			`devices:`
			`- driver: nvidia`
			`count: all`
			`capabilities: [gpu]`
Add tabby.tools.repository.updater for easier git repository synchronization. (#9) * Move dags to tabby.tasks * Add repository syncer * Follow redirect for curl 2023-03-25 06:44:46 +00:00			`environment:`
Extract environment variable 2023-03-27 05:41:22 +00:00			`MODEL_NAME: ${MODEL_NAME}`
Prepare public release with a minimal deployment setup (#16) * Move deployment to deployment-next * Add deployment setup * Update deployment-next * Remove vector label * update README.md 2023-03-26 14:44:15 +00:00			`depends_on:`
			`init:`
			`condition: service_completed_successfully`
			`healthcheck:`
			`test: ["CMD", "curl", "-f", "http://localhost:8002/metrics"]`
			`interval: 2s`
			`timeout: 2s`
			`start_period: 120s`