Commit Graph

6 Commits (22fbaefbd461cf536b6c6b8848e7fffe4d665320)

Author SHA1 Message Date
Meng Zhang 22fbaefbd4
feat: add deployment script on lambda cloud with skypilot (#37)
* feat: add deployment script on lambda cloud with skypilot

* docs: adjust API documentation level

* fix: move docker-compose install before the docker-compose pull

* Fix documentation

* Update replica updating
2023-04-02 20:32:49 +08:00
Meng Zhang 0a66a9d498 feat: support MODEL_REPLICA to control number of model instances in triton 2023-04-02 13:19:09 +08:00
Meng Zhang 20801bbe8c
Cleanup environment variable (#30)
* Remove EVENTS_LOG_DIR

* Rename supervisord.sh -> tabby.sh
2023-03-29 16:33:00 +08:00
Meng Zhang bf7d149a27
Add supervisord to support a single docker run deployment (#29)
* Add suppervisord in dockerfile

* Create supervisord

* Update README.md

* Update README.md
2023-03-29 12:57:03 +08:00
Meng Zhang 9d92821cf5
Add gptj converter (#19)
* Rename deployment-next to development

* Add GPTJ converter
2023-03-27 11:12:52 +08:00
Meng Zhang b622bd6762
use TabbyML/NeoX-70M for minimal e2e deployment (#10)
* use TabbyML/NeoX-70M for minimal e2e deployment

* Use python3 of triton image
2023-03-25 17:39:40 +08:00