Meng Zhang
4388fd0050
feat: support prompt rewriting ( #295 )
...
* refactor: extract PromptBuilder
* feat: load tantivy index in prompt builder
* integrate with searcher
* add enable_prompt_rewrite to control rewrite behavior
* nit docs
* limit 1 snippet per identifier
* extract magic numbers
2023-07-13 09:05:41 +00:00
Meng Zhang
19586a4926
fix: fix bad indexing field of tag indexer ( #293 )
2023-07-13 07:19:46 +00:00
Meng Zhang
24d17dc1ed
feat: switching index to tag based ( #265 )
...
* add tag based index
* update for python
* fix lint
2023-07-13 06:25:39 +00:00
Meng Zhang
be90047477
fix: fix int8 compute type, fix auto compute type selection (include float32 into consideration for cuda compute capability <= 6.0) ( #291 )
2023-07-12 11:09:38 +08:00
Meng Zhang
e822d1857f
fix: add retry strategy in tabby download ( #285 )
2023-07-09 05:18:19 +00:00
Meng Zhang
9ca1f7e5f1
fix: add additional whitespace to match tokens that combining space and li… ( #270 )
...
* fix: add additional whitespace to match tokens that combining space and line break
* fix lint
2023-06-25 01:15:52 +00:00
Meng Zhang
631cff3aed
docs: update url of playground server
2023-06-23 18:55:23 -07:00
Meng Zhang
8879ed7655
feat: add experimental self host setup ( #261 )
2023-06-23 12:13:22 -07:00
Meng Zhang
59b03ae477
fix: use full clone in scheduler
2023-06-22 15:06:44 -07:00
Meng Zhang
fcbc5edc55
Revert "feat: add /experimental/search endpoint ( #258 )" ( #260 )
...
This reverts commit 04980160e5 .
2023-06-22 14:23:35 -07:00
Meng Zhang
04980160e5
feat: add /experimental/search endpoint ( #258 )
...
* feat: add /experimental/search endpoint
* fix format
2023-06-22 20:47:32 +00:00
Meng Zhang
8c1f6c2806
feat(scheduler): add java / python tags support ( #259 )
2023-06-22 20:25:56 +00:00
Meng Zhang
7ed5dd584d
feat: experiment ctags support in scheduler ( #207 )
...
* experiment ctags support
* add document.rs
* extract Document to common
* integrate tags into dataset builder
* skip if none
* do not add scheduler in client binary
* fix fmt
2023-06-21 19:48:13 -07:00
Meng Zhang
6eae16d475
fix: typo in openapi documentation
2023-06-16 20:22:38 -07:00
Meng Zhang
8ee700089f
feat: do not use fim template when suffix is empty string
2023-06-15 09:27:32 -07:00
Meng Zhang
9abf1a7521
fix: optional file should be put in cache key to avoid internet access when prefer_local_files = true ( #241 )
...
* feat: when file is 404, cache as NotFound
* explicitly mark optional file
* refactor
2023-06-15 04:13:52 +00:00
Meng Zhang
d572cf7d6d
api: add user field in completion api
2023-06-14 10:50:03 -07:00
Meng Zhang
93d5d8b297
docs: update website openapi.json
2023-06-13 13:13:03 -07:00
Meng Zhang
b2734aed59
feat: returns more information in /v1/health
2023-06-13 13:11:20 -07:00
Meng Zhang
9c9e46c6f4
feat: support set compute_type through commandline arguments
2023-06-13 12:04:07 -07:00
Meng Zhang
ba7e04d030
refactor: remove admin
2023-06-13 11:37:55 -07:00
Meng Zhang
5985d91782
fix: use int8_float16 to fix SantaCoder-1B ( #237 )
...
#236
2023-06-13 01:13:06 -07:00
Meng Zhang
a8a145d9b3
feat: add metrics in dataset ( #235 )
2023-06-12 19:39:43 +00:00
Meng Zhang
2983f70d22
feat: build index from dataset ( #234 )
2023-06-12 19:21:27 +00:00
Meng Zhang
9af6607f85
feat: add dataset building ( #231 )
...
* feat: build dataset in scheduler
* add file rotate
2023-06-12 15:19:43 +00:00
Meng Zhang
3b7153ba23
docs: update website
2023-06-11 12:28:23 -07:00
Meng Zhang
de546b03fe
feat: add otlp-endpoint for OpenTelemetry support [TAB-67] ( #227 )
...
* feat: add otlp-endpoint for OpenTelemetry support
* set default log level for axum tracing to INFO
* update build enviornment
* update
2023-06-10 22:46:25 -07:00
Meng Zhang
6180b32980
feat: add /v1/health ( #226 )
...
* feat: add /v1/health
* fix fmt
2023-06-10 22:37:42 -07:00
Meng Zhang
6718afbf67
fix: server should still support prompt only use case
2023-06-10 00:39:32 -07:00
Meng Zhang
eecdde0269
fix: vocabulary.txt / shared_vocabulary.txt should be optional ( #225 )
2023-06-09 06:49:09 +00:00
Meng Zhang
40460885b0
docs: improve website theming
2023-06-07 11:17:00 -07:00
Meng Zhang
1aaf29c968
docs: switch openapi docs ( #215 )
...
* update openapi
* update
* fix: shared_vocabulary is not a required file
* docs: improve docs
2023-06-07 01:58:05 -07:00
Meng Zhang
4cb672ec39
feat: improve error handling and messages [TAB-58] ( #213 )
...
* add fatal macro
* switch expect to fatal
* improve error handling of serve
* improve error handling on download module
* improve error handling in scheduler
* improve error handling
* fmt
* fmt
2023-06-07 02:02:58 +00:00
Meng Zhang
fd1baff8d5
feat: support stop sequences [TAB-52] ( #212 )
...
* refactor: pass step and string token to callback
* add token to callback
* add stop regexp
* implement stop words logic
* pass token_ids from inference
* improve effiency of regexp match with reversed regex
* fmt
* add typescript and javascript stop words
* add cache for stop words regexp
2023-06-06 23:28:58 +00:00
Meng Zhang
007a40c582
feat: support early stop [TAB-51] ( #208 )
...
* bump ctranslate2 to v3.15.0
* enable early stop
* support early stop
2023-06-06 12:46:17 +00:00
Meng Zhang
249d51d0f5
feat: add indexer [TAB-17] ( #199 )
...
* add basic indexer
* formatting
2023-06-05 22:18:10 +00:00
Meng Zhang
272dde9769
refactor: rust nightly format ( #197 )
...
* chore: turn on group format
* turn on nightly fmt
2023-06-05 14:17:07 -07:00
Meng Zhang
f4442b104f
docs: usage string for scheduler
2023-06-05 12:57:18 -07:00
Meng Zhang
e8a33312bb
refactor: extract download into tabby-download ( #195 )
...
* refactor: extract download into tabby-download
* remove unused deps
2023-06-05 18:40:24 +00:00
Meng Zhang
e8b1c10738
feat: add `tabby scheduler` command ( #194 )
...
* feat: add `tabby scheduler` command
* update test cases
* fix fmt
2023-06-05 18:29:38 +00:00
Meng Zhang
da02d471a9
chore: add tabby-scheduler [TAB-17] ( #192 )
...
* add scheduler
* update fmt
* add integration tests for scheduler
2023-06-05 03:08:43 +00:00
Meng Zhang
2bf5bcd0cf
refactor: extract TextInferenceEngineImpl to reduce duplications between EncoderDecoderImpl and DecoderImpl #189
2023-06-04 22:28:39 +00:00
Meng Zhang
6de61f45bb
chore: mark thread safety [TAB-52] ( #186 )
...
* mark thread safety
* use shared_ptr to ensure thread safety
* fmt
2023-06-04 06:23:31 +00:00
Meng Zhang
775576b53e
fix: only use prompt_template when suffix presents [TAB-46] ( #184 )
...
* fix: only use prompt_template when suffix presents
* lint
2023-06-03 17:29:04 +00:00
Meng Zhang
2779da3cba
feat: supports FIM inference [TAB-46] ( #183 )
...
* Add prefix / suffix
* update
* feat: support segments in inference
* chore: add tabby.json in model repository to store prompt_template
* make prompt_template optional.
* download tabby.json in downloader
2023-06-02 16:47:48 -07:00
Meng Zhang
950a7a795f
fix: when model_id is an local dir, don't try to download model from remote
2023-06-02 13:48:53 -07:00
Meng Zhang
3cac2607e7
refactor: improve error handlings, fix clippy warnings ( #181 )
...
* refactor: minor improvements on error handling
* refactor: cleanup error handlings
* update
* update
* fix
* add clippy / test workflow
* fix clippy
* fix clippy
* update
2023-06-01 17:23:05 -07:00
Meng Zhang
ca077a3403
feat: ensure model exist before serving ( #180 )
...
* chore: migrate completion to new metadata format
* feat: ensure model exist before serving
2023-06-01 07:26:21 +00:00
Meng Zhang
9131567257
chore: migrate completion to new metadata format ( #179 )
2023-06-01 07:08:09 +00:00
Meng Zhang
e8dbd36663
feat: improve download command - support local cache checking behavior ( #178 )
...
* move download.rs
* add metadata
* support prefer local args
* fix format
* replace errorchain with anyhow
2023-06-01 06:42:04 +00:00