Commit Graph

8 Commits (fd1baff8d5f51d18eec6ba71bb49da811265b874)

Author SHA1 Message Date
Meng Zhang fd1baff8d5
feat: support stop sequences [TAB-52] (#212)
* refactor: pass step and string token to callback

* add token to callback

* add stop regexp

* implement stop words logic

* pass token_ids from inference

* improve effiency of regexp match with reversed regex

* fmt

* add typescript and javascript stop words

* add cache for stop words regexp
2023-06-06 23:28:58 +00:00
Meng Zhang 007a40c582
feat: support early stop [TAB-51] (#208)
* bump ctranslate2 to v3.15.0

* enable early stop

* support early stop
2023-06-06 12:46:17 +00:00
Meng Zhang 2bf5bcd0cf
refactor: extract TextInferenceEngineImpl to reduce duplications between EncoderDecoderImpl and DecoderImpl #189 2023-06-04 22:28:39 +00:00
Meng Zhang 6de61f45bb
chore: mark thread safety [TAB-52] (#186)
* mark thread safety

* use shared_ptr to ensure thread safety

* fmt
2023-06-04 06:23:31 +00:00
Meng Zhang 552711a560
Support causal lm (decoder only model) (#151)
* support

* support causal lm
2023-05-27 01:26:33 -07:00
Meng Zhang 8dfe49ec6c
feat: support cuda devices in rust tabby (#149) 2023-05-25 23:23:07 -07:00
Meng Zhang 0acc975618
Support linux ctranslate2 cuda build (#147)
* Support linux build

* add <memory> to fix build error in linux

* add Dockerfile.tabby

* update

* update

* add rust docker image pipeline

* add docker.rust.yml
2023-05-25 18:18:22 -07:00
Meng Zhang a2476af373
add ctranslate2-bindings / tabby rust packages (#146)
* add ctranslate2-bindings

* add fixme for linux build

* turn off shared lib

* add tabby-cli
2023-05-25 14:05:28 -07:00