Meng Zhang
db77d7f267
feat: support single container ( #46 )
...
* docs: update readme
* fix: do not exclude peft
* Free disk space before docker building
* fix: fix docker-compose
* fix: dockercompose user to 1000
* fix dockerfile
* fix: cachedir ownership
2023-04-05 20:19:43 +08:00
Meng Zhang
22fbaefbd4
feat: add deployment script on lambda cloud with skypilot ( #37 )
...
* feat: add deployment script on lambda cloud with skypilot
* docs: adjust API documentation level
* fix: move docker-compose install before the docker-compose pull
* Fix documentation
* Update replica updating
2023-04-02 20:32:49 +08:00
Meng Zhang
0a66a9d498
feat: support MODEL_REPLICA to control number of model instances in triton
2023-04-02 13:19:09 +08:00
Meng Zhang
9d92821cf5
Add gptj converter ( #19 )
...
* Rename deployment-next to development
* Add GPTJ converter
2023-03-27 11:12:52 +08:00
Meng Zhang
b622bd6762
use TabbyML/NeoX-70M for minimal e2e deployment ( #10 )
...
* use TabbyML/NeoX-70M for minimal e2e deployment
* Use python3 of triton image
2023-03-25 17:39:40 +08:00