Commit Graph

3 Commits (0a66a9d498e232407850537f1d18142046bd66d4)

Author SHA1 Message Date
Meng Zhang 0a66a9d498 feat: support MODEL_REPLICA to control number of model instances in triton 2023-04-02 13:19:09 +08:00
Meng Zhang 9d92821cf5
Add gptj converter (#19)
* Rename deployment-next to development

* Add GPTJ converter
2023-03-27 11:12:52 +08:00
Meng Zhang b622bd6762
use TabbyML/NeoX-70M for minimal e2e deployment (#10)
* use TabbyML/NeoX-70M for minimal e2e deployment

* Use python3 of triton image
2023-03-25 17:39:40 +08:00