Commit Graph

198 Commits (dcf82d45b58ca40ec6c357339457ea823a11fdb4)

Author SHA1 Message Date
Meng Zhang 5985d91782
fix: use int8_float16 to fix SantaCoder-1B (#237)
#236
2023-06-13 01:13:06 -07:00
Meng Zhang a8a145d9b3
feat: add metrics in dataset (#235) 2023-06-12 19:39:43 +00:00
Meng Zhang 2983f70d22
feat: build index from dataset (#234) 2023-06-12 19:21:27 +00:00
Meng Zhang 9af6607f85
feat: add dataset building (#231)
* feat: build dataset in scheduler

* add file rotate
2023-06-12 15:19:43 +00:00
Meng Zhang 3b7153ba23 docs: update website 2023-06-11 12:28:23 -07:00
Meng Zhang de546b03fe
feat: add otlp-endpoint for OpenTelemetry support [TAB-67] (#227)
* feat: add otlp-endpoint for OpenTelemetry support

* set default log level for axum tracing to INFO

* update build enviornment

* update
2023-06-10 22:46:25 -07:00
Meng Zhang 6180b32980
feat: add /v1/health (#226)
* feat: add /v1/health

* fix fmt
2023-06-10 22:37:42 -07:00
Meng Zhang 6718afbf67 fix: server should still support prompt only use case 2023-06-10 00:39:32 -07:00
Meng Zhang eecdde0269
fix: vocabulary.txt / shared_vocabulary.txt should be optional (#225) 2023-06-09 06:49:09 +00:00
Meng Zhang 40460885b0 docs: improve website theming 2023-06-07 11:17:00 -07:00
Meng Zhang 1aaf29c968
docs: switch openapi docs (#215)
* update openapi

* update

* fix: shared_vocabulary is not a required file

* docs: improve docs
2023-06-07 01:58:05 -07:00
Meng Zhang 4cb672ec39
feat: improve error handling and messages [TAB-58] (#213)
* add fatal macro

* switch expect to fatal

* improve error handling of serve

* improve error handling on download module

* improve error handling in scheduler

* improve error handling

* fmt

* fmt
2023-06-07 02:02:58 +00:00
Meng Zhang fd1baff8d5
feat: support stop sequences [TAB-52] (#212)
* refactor: pass step and string token to callback

* add token to callback

* add stop regexp

* implement stop words logic

* pass token_ids from inference

* improve effiency of regexp match with reversed regex

* fmt

* add typescript and javascript stop words

* add cache for stop words regexp
2023-06-06 23:28:58 +00:00
Meng Zhang 007a40c582
feat: support early stop [TAB-51] (#208)
* bump ctranslate2 to v3.15.0

* enable early stop

* support early stop
2023-06-06 12:46:17 +00:00
Meng Zhang 249d51d0f5
feat: add indexer [TAB-17] (#199)
* add basic indexer

* formatting
2023-06-05 22:18:10 +00:00
Meng Zhang 272dde9769
refactor: rust nightly format (#197)
* chore: turn on group format

* turn on nightly fmt
2023-06-05 14:17:07 -07:00
Meng Zhang f4442b104f docs: usage string for scheduler 2023-06-05 12:57:18 -07:00
Meng Zhang e8a33312bb
refactor: extract download into tabby-download (#195)
* refactor: extract download into tabby-download

* remove unused deps
2023-06-05 18:40:24 +00:00
Meng Zhang e8b1c10738
feat: add `tabby scheduler` command (#194)
* feat: add `tabby scheduler` command

* update test cases

* fix fmt
2023-06-05 18:29:38 +00:00
Meng Zhang da02d471a9
chore: add tabby-scheduler [TAB-17] (#192)
* add scheduler

* update fmt

* add integration tests for scheduler
2023-06-05 03:08:43 +00:00
Meng Zhang 2bf5bcd0cf
refactor: extract TextInferenceEngineImpl to reduce duplications between EncoderDecoderImpl and DecoderImpl #189 2023-06-04 22:28:39 +00:00
Meng Zhang 6de61f45bb
chore: mark thread safety [TAB-52] (#186)
* mark thread safety

* use shared_ptr to ensure thread safety

* fmt
2023-06-04 06:23:31 +00:00
Meng Zhang 775576b53e
fix: only use prompt_template when suffix presents [TAB-46] (#184)
* fix: only use prompt_template when suffix presents

* lint
2023-06-03 17:29:04 +00:00
Meng Zhang 2779da3cba
feat: supports FIM inference [TAB-46] (#183)
* Add prefix / suffix

* update

* feat: support segments in inference

* chore: add tabby.json in model repository to store prompt_template

* make prompt_template optional.

* download tabby.json in downloader
2023-06-02 16:47:48 -07:00
Meng Zhang 950a7a795f fix: when model_id is an local dir, don't try to download model from remote 2023-06-02 13:48:53 -07:00
Meng Zhang 3cac2607e7
refactor: improve error handlings, fix clippy warnings (#181)
* refactor: minor improvements on error handling

* refactor: cleanup error handlings

* update

* update

* fix

* add clippy / test workflow

* fix clippy

* fix clippy

* update
2023-06-01 17:23:05 -07:00
Meng Zhang ca077a3403
feat: ensure model exist before serving (#180)
* chore: migrate completion to new metadata format

* feat: ensure model exist before serving
2023-06-01 07:26:21 +00:00
Meng Zhang 9131567257
chore: migrate completion to new metadata format (#179) 2023-06-01 07:08:09 +00:00
Meng Zhang e8dbd36663
feat: improve download command - support local cache checking behavior (#178)
* move download.rs

* add metadata

* support prefer local args

* fix format

* replace errorchain with anyhow
2023-06-01 06:42:04 +00:00
Meng Zhang b8d67770ee
fix: cors in /, stop words removal (#171) 2023-05-30 17:12:10 -07:00
Meng Zhang f3b37b253b
feat: add events logger (#170)
* add common events

* log events in serve

* formatting

* extract serde_conv

* update

* update

* fix logging

* update
2023-05-30 15:44:29 -07:00
Meng Zhang d8cee4adac
chore: extract tabby-common (#169)
* chore: extract tabby-common

* simplify
2023-05-29 23:39:02 -07:00
Meng Zhang 8956056120
fix: support local model path (#165) 2023-05-29 17:40:16 -07:00
Meng Zhang 418558c05d chore: cargo fmt 2023-05-29 17:00:23 -07:00
Meng Zhang 0d11b0e832
Make language field optional in rust implementation (#164)
* remove download_model.py as we have tabby serve now

* Make language field optional
2023-05-29 16:58:02 -07:00
Meng Zhang 48796ecd77
feat: add `tabby download` command (#157)
* simplify fmt-display

* cleanup

* move tabby-admin to reduce nest

* add model downloader

* get rid of model-type

* improve commands

* fix fmt
2023-05-28 14:36:11 -07:00
Meng Zhang 80d1dd1ca6
Add admin panel (behand --experimental-admin-panel) (#156)
* add admin

* add feature:admin_panel

* update

* add dist directory

* update
2023-05-28 01:17:32 -07:00
Meng Zhang b8309d98cc
Switch to sccache (#154)
* fix fmt

* fix

* fix test

* fix clippy

* switch to sc cache

* fix

* update

* update

* update

* fix

* add test

* remove clippy

* update

* disable incremental

* update

* simply
2023-05-27 16:20:17 -07:00
Meng Zhang 0000312460
Add download_model.py. Adjust ctranslate2 backend model structure (#153)
* adjust

* update

* update
2023-05-27 14:51:12 -07:00
Meng Zhang 552711a560
Support causal lm (decoder only model) (#151)
* support

* support causal lm
2023-05-27 01:26:33 -07:00
Meng Zhang 72ed30e9ff
Build link shared in docker for ctranslate2 (#150)
* Build link shared in docker

* update

* update
2023-05-27 00:05:56 -07:00
Meng Zhang 06cf34a007
support static linking of ctranslate2 (#148)
* support static linking of ctranslate2

* update

* remove submodule rust-cxx-cmake-bridge

* support alwayslink with whole-archive

* update

* move export_libs

* update docker config

* update ctranslate2

* remove

* update

* update build.rs

* parse external libs

* cleanup

* add cargo fmt
2023-05-26 21:34:31 -07:00
Meng Zhang 7b10340e67 feat: add --port to serve command 2023-05-26 00:32:11 -07:00
Meng Zhang c296b83de9 chore: remove unused lock 2023-05-26 00:06:10 -07:00
Meng Zhang 8dfe49ec6c
feat: support cuda devices in rust tabby (#149) 2023-05-25 23:23:07 -07:00
Meng Zhang 0acc975618
Support linux ctranslate2 cuda build (#147)
* Support linux build

* add <memory> to fix build error in linux

* add Dockerfile.tabby

* update

* update

* add rust docker image pipeline

* add docker.rust.yml
2023-05-25 18:18:22 -07:00
Meng Zhang 80588ddd22 fix: remove wrongly added submodule 2023-05-25 15:08:34 -07:00
Meng Zhang a2476af373
add ctranslate2-bindings / tabby rust packages (#146)
* add ctranslate2-bindings

* add fixme for linux build

* turn off shared lib

* add tabby-cli
2023-05-25 14:05:28 -07:00