Commit Graph

4 Commits (7c5d1d26bde51a95fe757e6bb3b1d4780bfcce1d)

Author SHA1 Message Date
Meng Zhang 22fbaefbd4
feat: add deployment script on lambda cloud with skypilot (#37)
* feat: add deployment script on lambda cloud with skypilot

* docs: adjust API documentation level

* fix: move docker-compose install before the docker-compose pull

* Fix documentation

* Update replica updating
2023-04-02 20:32:49 +08:00
Meng Zhang 0a66a9d498 feat: support MODEL_REPLICA to control number of model instances in triton 2023-04-02 13:19:09 +08:00
Meng Zhang 9d92821cf5
Add gptj converter (#19)
* Rename deployment-next to development

* Add GPTJ converter
2023-03-27 11:12:52 +08:00
Meng Zhang b622bd6762
use TabbyML/NeoX-70M for minimal e2e deployment (#10)
* use TabbyML/NeoX-70M for minimal e2e deployment

* Use python3 of triton image
2023-03-25 17:39:40 +08:00