Meng Zhang
6718afbf67
fix: server should still support prompt only use case
2023-06-10 00:39:32 -07:00
Meng Zhang
eecdde0269
fix: vocabulary.txt / shared_vocabulary.txt should be optional ( #225 )
2023-06-09 06:49:09 +00:00
Meng Zhang
40460885b0
docs: improve website theming
2023-06-07 11:17:00 -07:00
Meng Zhang
1aaf29c968
docs: switch openapi docs ( #215 )
...
* update openapi
* update
* fix: shared_vocabulary is not a required file
* docs: improve docs
2023-06-07 01:58:05 -07:00
Meng Zhang
4cb672ec39
feat: improve error handling and messages [TAB-58] ( #213 )
...
* add fatal macro
* switch expect to fatal
* improve error handling of serve
* improve error handling on download module
* improve error handling in scheduler
* improve error handling
* fmt
* fmt
2023-06-07 02:02:58 +00:00
Meng Zhang
fd1baff8d5
feat: support stop sequences [TAB-52] ( #212 )
...
* refactor: pass step and string token to callback
* add token to callback
* add stop regexp
* implement stop words logic
* pass token_ids from inference
* improve effiency of regexp match with reversed regex
* fmt
* add typescript and javascript stop words
* add cache for stop words regexp
2023-06-06 23:28:58 +00:00
Meng Zhang
007a40c582
feat: support early stop [TAB-51] ( #208 )
...
* bump ctranslate2 to v3.15.0
* enable early stop
* support early stop
2023-06-06 12:46:17 +00:00
Meng Zhang
249d51d0f5
feat: add indexer [TAB-17] ( #199 )
...
* add basic indexer
* formatting
2023-06-05 22:18:10 +00:00
Meng Zhang
272dde9769
refactor: rust nightly format ( #197 )
...
* chore: turn on group format
* turn on nightly fmt
2023-06-05 14:17:07 -07:00
Meng Zhang
f4442b104f
docs: usage string for scheduler
2023-06-05 12:57:18 -07:00
Meng Zhang
e8a33312bb
refactor: extract download into tabby-download ( #195 )
...
* refactor: extract download into tabby-download
* remove unused deps
2023-06-05 18:40:24 +00:00
Meng Zhang
e8b1c10738
feat: add `tabby scheduler` command ( #194 )
...
* feat: add `tabby scheduler` command
* update test cases
* fix fmt
2023-06-05 18:29:38 +00:00
Meng Zhang
da02d471a9
chore: add tabby-scheduler [TAB-17] ( #192 )
...
* add scheduler
* update fmt
* add integration tests for scheduler
2023-06-05 03:08:43 +00:00
Meng Zhang
2bf5bcd0cf
refactor: extract TextInferenceEngineImpl to reduce duplications between EncoderDecoderImpl and DecoderImpl #189
2023-06-04 22:28:39 +00:00
Meng Zhang
6de61f45bb
chore: mark thread safety [TAB-52] ( #186 )
...
* mark thread safety
* use shared_ptr to ensure thread safety
* fmt
2023-06-04 06:23:31 +00:00
Meng Zhang
775576b53e
fix: only use prompt_template when suffix presents [TAB-46] ( #184 )
...
* fix: only use prompt_template when suffix presents
* lint
2023-06-03 17:29:04 +00:00
Meng Zhang
2779da3cba
feat: supports FIM inference [TAB-46] ( #183 )
...
* Add prefix / suffix
* update
* feat: support segments in inference
* chore: add tabby.json in model repository to store prompt_template
* make prompt_template optional.
* download tabby.json in downloader
2023-06-02 16:47:48 -07:00
Meng Zhang
950a7a795f
fix: when model_id is an local dir, don't try to download model from remote
2023-06-02 13:48:53 -07:00
Meng Zhang
3cac2607e7
refactor: improve error handlings, fix clippy warnings ( #181 )
...
* refactor: minor improvements on error handling
* refactor: cleanup error handlings
* update
* update
* fix
* add clippy / test workflow
* fix clippy
* fix clippy
* update
2023-06-01 17:23:05 -07:00
Meng Zhang
ca077a3403
feat: ensure model exist before serving ( #180 )
...
* chore: migrate completion to new metadata format
* feat: ensure model exist before serving
2023-06-01 07:26:21 +00:00
Meng Zhang
9131567257
chore: migrate completion to new metadata format ( #179 )
2023-06-01 07:08:09 +00:00
Meng Zhang
e8dbd36663
feat: improve download command - support local cache checking behavior ( #178 )
...
* move download.rs
* add metadata
* support prefer local args
* fix format
* replace errorchain with anyhow
2023-06-01 06:42:04 +00:00
Meng Zhang
b8d67770ee
fix: cors in /, stop words removal ( #171 )
2023-05-30 17:12:10 -07:00
Meng Zhang
f3b37b253b
feat: add events logger ( #170 )
...
* add common events
* log events in serve
* formatting
* extract serde_conv
* update
* update
* fix logging
* update
2023-05-30 15:44:29 -07:00
Meng Zhang
d8cee4adac
chore: extract tabby-common ( #169 )
...
* chore: extract tabby-common
* simplify
2023-05-29 23:39:02 -07:00
Meng Zhang
8956056120
fix: support local model path ( #165 )
2023-05-29 17:40:16 -07:00
Meng Zhang
418558c05d
chore: cargo fmt
2023-05-29 17:00:23 -07:00
Meng Zhang
0d11b0e832
Make language field optional in rust implementation ( #164 )
...
* remove download_model.py as we have tabby serve now
* Make language field optional
2023-05-29 16:58:02 -07:00
Meng Zhang
48796ecd77
feat: add `tabby download` command ( #157 )
...
* simplify fmt-display
* cleanup
* move tabby-admin to reduce nest
* add model downloader
* get rid of model-type
* improve commands
* fix fmt
2023-05-28 14:36:11 -07:00
Meng Zhang
80d1dd1ca6
Add admin panel (behand --experimental-admin-panel) ( #156 )
...
* add admin
* add feature:admin_panel
* update
* add dist directory
* update
2023-05-28 01:17:32 -07:00
Meng Zhang
b8309d98cc
Switch to sccache ( #154 )
...
* fix fmt
* fix
* fix test
* fix clippy
* switch to sc cache
* fix
* update
* update
* update
* fix
* add test
* remove clippy
* update
* disable incremental
* update
* simply
2023-05-27 16:20:17 -07:00
Meng Zhang
0000312460
Add download_model.py. Adjust ctranslate2 backend model structure ( #153 )
...
* adjust
* update
* update
2023-05-27 14:51:12 -07:00
Meng Zhang
552711a560
Support causal lm (decoder only model) ( #151 )
...
* support
* support causal lm
2023-05-27 01:26:33 -07:00
Meng Zhang
72ed30e9ff
Build link shared in docker for ctranslate2 ( #150 )
...
* Build link shared in docker
* update
* update
2023-05-27 00:05:56 -07:00
Meng Zhang
06cf34a007
support static linking of ctranslate2 ( #148 )
...
* support static linking of ctranslate2
* update
* remove submodule rust-cxx-cmake-bridge
* support alwayslink with whole-archive
* update
* move export_libs
* update docker config
* update ctranslate2
* remove
* update
* update build.rs
* parse external libs
* cleanup
* add cargo fmt
2023-05-26 21:34:31 -07:00
Meng Zhang
7b10340e67
feat: add --port to serve command
2023-05-26 00:32:11 -07:00
Meng Zhang
c296b83de9
chore: remove unused lock
2023-05-26 00:06:10 -07:00
Meng Zhang
8dfe49ec6c
feat: support cuda devices in rust tabby ( #149 )
2023-05-25 23:23:07 -07:00
Meng Zhang
0acc975618
Support linux ctranslate2 cuda build ( #147 )
...
* Support linux build
* add <memory> to fix build error in linux
* add Dockerfile.tabby
* update
* update
* add rust docker image pipeline
* add docker.rust.yml
2023-05-25 18:18:22 -07:00
Meng Zhang
80588ddd22
fix: remove wrongly added submodule
2023-05-25 15:08:34 -07:00
Meng Zhang
a2476af373
add ctranslate2-bindings / tabby rust packages ( #146 )
...
* add ctranslate2-bindings
* add fixme for linux build
* turn off shared lib
* add tabby-cli
2023-05-25 14:05:28 -07:00