Meng Zhang
57baecb370
fix: switch default running backend to openblas on x86 linux ( #380 )
2023-08-30 14:19:35 +00:00
Meng Zhang
054aefaf15
chore: add linux static build ( #379 )
...
* chore: add linux static build
* add touch
* update build env
* add sudo
* fix: protobuf ubuntu target
2023-08-30 18:45:05 +08:00
Meng Zhang
3526ca3164
chore: build with ruy (cpu only) on static mode for linux. ( #378 )
...
* chore: build with ruy (cpu only) on static mode for linux.
* update cmake min version
2023-08-30 18:04:40 +08:00
Meng Zhang
fc9a623e72
feat: add logging on server starting ( #372 )
2023-08-28 06:12:00 +00:00
Meng Zhang
65836ee199
feat: add stop words encoding offset for ctranslate model config ( #371 )
...
* feat: add stop words encoding offset for ctranslate model config
* feat: set default suffix to \n
* add special treatment for bytefallback tokens
2023-08-28 14:07:01 +08:00
vodkaslime
2a91a21787
feat: add gpu info to health state [TAB-162] ( #364 )
...
* feat: add gpu info to health response
* chore: error handling
* chore: refactor cpu manager code
* chore: typo
* chore: fix context mutability
* chore: fix context mutability
* feat: add link to NVML lib
* chore: refactor
* lint
* chore: resolve comments
* chore: fix typo
* chore: fix
* chore: resolve comments
* chore: fix
* chore: resolve comments
2023-08-21 18:06:38 +08:00
Meng Zhang
b1ad936033
feat: add version information in health state. ( #363 )
...
* feat: add git_hash in health state
* add more version information in health state
2023-08-20 15:21:12 +00:00
Meng Zhang
df45573501
feat: reduce ServeHealth event to every 300s to reduce event volume ( #362 )
2023-08-20 12:36:59 +00:00
vodkaslime
2026b4dd0e
feat: add architecture/cpu info to health api response [TAB-162] ( #355 )
...
* feat: add architecture, cpu and gpu info to health command
* chore: fix
* chore: fix
* chore: fix
* chore: fix lint
* chore: fix lint
* chore: remove gpu
* chore: resolve comments
* chore: resolve comments
* Update health.rs
---------
Co-authored-by: Meng Zhang <meng@tabbyml.com>
2023-08-15 15:22:03 +00:00
Meng Zhang
dbc89831b1
feat: add serve health heartbeat ( #343 )
...
* add serve health tracking
* fix lint
* fix
2023-08-09 08:08:42 +00:00
Meng Zhang
d0f6ad2d2a
feat: add anonymous usage tracker ( #342 )
...
* feat: add anonymous usage tracker
* improve deps
* update
* update
2023-08-09 07:31:13 +00:00
Meng Zhang
220fcc0d65
fix: make `config.experimental` optional ( #339 )
...
* fix: make config.experimental` optional
* add unit test for empty toml config
2023-08-07 09:53:00 +00:00
Meng Zhang
4eaae27ed3
Update Cargo.toml ( #331 )
2023-08-03 19:55:00 +08:00
Meng Zhang
6a50902ca7
fix: support ctranslate2 rev7 vocab files (.json) ( #327 )
2023-08-02 13:36:31 +00:00
Meng Zhang
57c811b30f
fix: improve download logging ( #325 )
...
* Suggest use `-it` so docker run generate progress bar of downloading information properly
* add info! log for model download
2023-08-02 06:30:35 +00:00
Meng Zhang
b8308b7118
refactor: extract TextGeneration trait ( #324 )
...
* add tabby-inference
* extract TextGeneration trait
* format
* Rename TextInferenceEngine to CTranslate2Engine
2023-08-02 06:12:51 +00:00
Meng Zhang
83e1cf76d8
feat: Upgrade ctranslate2 to v3.17.1 ( #323 )
2023-08-02 05:46:08 +00:00
Meng Zhang
95bd53ac9c
feat: add select kind param. Supported editors could log line select … ( #299 )
...
* feat: add select kind param. Supported editors could log line select or block select
* fix lint
2023-07-16 16:02:40 +08:00
Meng Zhang
be5fe0d737
feat: add rust prompt rewrite support ( #296 )
2023-07-13 09:31:44 +00:00
Meng Zhang
4388fd0050
feat: support prompt rewriting ( #295 )
...
* refactor: extract PromptBuilder
* feat: load tantivy index in prompt builder
* integrate with searcher
* add enable_prompt_rewrite to control rewrite behavior
* nit docs
* limit 1 snippet per identifier
* extract magic numbers
2023-07-13 09:05:41 +00:00
Meng Zhang
19586a4926
fix: fix bad indexing field of tag indexer ( #293 )
2023-07-13 07:19:46 +00:00
Meng Zhang
24d17dc1ed
feat: switching index to tag based ( #265 )
...
* add tag based index
* update for python
* fix lint
2023-07-13 06:25:39 +00:00
Meng Zhang
be90047477
fix: fix int8 compute type, fix auto compute type selection (include float32 into consideration for cuda compute capability <= 6.0) ( #291 )
2023-07-12 11:09:38 +08:00
Meng Zhang
e822d1857f
fix: add retry strategy in tabby download ( #285 )
2023-07-09 05:18:19 +00:00
Meng Zhang
9ca1f7e5f1
fix: add additional whitespace to match tokens that combining space and li… ( #270 )
...
* fix: add additional whitespace to match tokens that combining space and line break
* fix lint
2023-06-25 01:15:52 +00:00
Meng Zhang
631cff3aed
docs: update url of playground server
2023-06-23 18:55:23 -07:00
Meng Zhang
8879ed7655
feat: add experimental self host setup ( #261 )
2023-06-23 12:13:22 -07:00
Meng Zhang
59b03ae477
fix: use full clone in scheduler
2023-06-22 15:06:44 -07:00
Meng Zhang
fcbc5edc55
Revert "feat: add /experimental/search endpoint ( #258 )" ( #260 )
...
This reverts commit 04980160e5 .
2023-06-22 14:23:35 -07:00
Meng Zhang
04980160e5
feat: add /experimental/search endpoint ( #258 )
...
* feat: add /experimental/search endpoint
* fix format
2023-06-22 20:47:32 +00:00
Meng Zhang
8c1f6c2806
feat(scheduler): add java / python tags support ( #259 )
2023-06-22 20:25:56 +00:00
Meng Zhang
7ed5dd584d
feat: experiment ctags support in scheduler ( #207 )
...
* experiment ctags support
* add document.rs
* extract Document to common
* integrate tags into dataset builder
* skip if none
* do not add scheduler in client binary
* fix fmt
2023-06-21 19:48:13 -07:00
Meng Zhang
6eae16d475
fix: typo in openapi documentation
2023-06-16 20:22:38 -07:00
Meng Zhang
8ee700089f
feat: do not use fim template when suffix is empty string
2023-06-15 09:27:32 -07:00
Meng Zhang
9abf1a7521
fix: optional file should be put in cache key to avoid internet access when prefer_local_files = true ( #241 )
...
* feat: when file is 404, cache as NotFound
* explicitly mark optional file
* refactor
2023-06-15 04:13:52 +00:00
Meng Zhang
d572cf7d6d
api: add user field in completion api
2023-06-14 10:50:03 -07:00
Meng Zhang
93d5d8b297
docs: update website openapi.json
2023-06-13 13:13:03 -07:00
Meng Zhang
b2734aed59
feat: returns more information in /v1/health
2023-06-13 13:11:20 -07:00
Meng Zhang
9c9e46c6f4
feat: support set compute_type through commandline arguments
2023-06-13 12:04:07 -07:00
Meng Zhang
ba7e04d030
refactor: remove admin
2023-06-13 11:37:55 -07:00
Meng Zhang
5985d91782
fix: use int8_float16 to fix SantaCoder-1B ( #237 )
...
#236
2023-06-13 01:13:06 -07:00
Meng Zhang
a8a145d9b3
feat: add metrics in dataset ( #235 )
2023-06-12 19:39:43 +00:00
Meng Zhang
2983f70d22
feat: build index from dataset ( #234 )
2023-06-12 19:21:27 +00:00
Meng Zhang
9af6607f85
feat: add dataset building ( #231 )
...
* feat: build dataset in scheduler
* add file rotate
2023-06-12 15:19:43 +00:00
Meng Zhang
3b7153ba23
docs: update website
2023-06-11 12:28:23 -07:00
Meng Zhang
de546b03fe
feat: add otlp-endpoint for OpenTelemetry support [TAB-67] ( #227 )
...
* feat: add otlp-endpoint for OpenTelemetry support
* set default log level for axum tracing to INFO
* update build enviornment
* update
2023-06-10 22:46:25 -07:00
Meng Zhang
6180b32980
feat: add /v1/health ( #226 )
...
* feat: add /v1/health
* fix fmt
2023-06-10 22:37:42 -07:00
Meng Zhang
6718afbf67
fix: server should still support prompt only use case
2023-06-10 00:39:32 -07:00
Meng Zhang
eecdde0269
fix: vocabulary.txt / shared_vocabulary.txt should be optional ( #225 )
2023-06-09 06:49:09 +00:00
Meng Zhang
40460885b0
docs: improve website theming
2023-06-07 11:17:00 -07:00