Meng Zhang
4cb672ec39
feat: improve error handling and messages [TAB-58] ( #213 )
...
* add fatal macro
* switch expect to fatal
* improve error handling of serve
* improve error handling on download module
* improve error handling in scheduler
* improve error handling
* fmt
* fmt
2023-06-07 02:02:58 +00:00
Meng Zhang
fd1baff8d5
feat: support stop sequences [TAB-52] ( #212 )
...
* refactor: pass step and string token to callback
* add token to callback
* add stop regexp
* implement stop words logic
* pass token_ids from inference
* improve effiency of regexp match with reversed regex
* fmt
* add typescript and javascript stop words
* add cache for stop words regexp
2023-06-06 23:28:58 +00:00
Meng Zhang
007a40c582
feat: support early stop [TAB-51] ( #208 )
...
* bump ctranslate2 to v3.15.0
* enable early stop
* support early stop
2023-06-06 12:46:17 +00:00
Meng Zhang
2bf5bcd0cf
refactor: extract TextInferenceEngineImpl to reduce duplications between EncoderDecoderImpl and DecoderImpl #189
2023-06-04 22:28:39 +00:00
Meng Zhang
6de61f45bb
chore: mark thread safety [TAB-52] ( #186 )
...
* mark thread safety
* use shared_ptr to ensure thread safety
* fmt
2023-06-04 06:23:31 +00:00
Meng Zhang
b8309d98cc
Switch to sccache ( #154 )
...
* fix fmt
* fix
* fix test
* fix clippy
* switch to sc cache
* fix
* update
* update
* update
* fix
* add test
* remove clippy
* update
* disable incremental
* update
* simply
2023-05-27 16:20:17 -07:00
Meng Zhang
552711a560
Support causal lm (decoder only model) ( #151 )
...
* support
* support causal lm
2023-05-27 01:26:33 -07:00
Meng Zhang
7b10340e67
feat: add --port to serve command
2023-05-26 00:32:11 -07:00
Meng Zhang
c296b83de9
chore: remove unused lock
2023-05-26 00:06:10 -07:00
Meng Zhang
8dfe49ec6c
feat: support cuda devices in rust tabby ( #149 )
2023-05-25 23:23:07 -07:00
Meng Zhang
a2476af373
add ctranslate2-bindings / tabby rust packages ( #146 )
...
* add ctranslate2-bindings
* add fixme for linux build
* turn off shared lib
* add tabby-cli
2023-05-25 14:05:28 -07:00