Commit Graph

89 Commits (c76edeaa80c87afb4fe1f9bc953160fe76f6346d)

Author SHA1 Message Date
Meng Zhang c44a9c7195
fix: correct git_describe in /health (#383)
* fix: add missing Version component in OpenAPI definition

* fix: allow tag / dirty in git describe
2023-08-31 01:06:36 +00:00
Meng Zhang 57baecb370
fix: switch default running backend to openblas on x86 linux (#380) 2023-08-30 14:19:35 +00:00
Meng Zhang 054aefaf15
chore: add linux static build (#379)
* chore: add linux static build

* add touch

* update build env

* add sudo

* fix: protobuf ubuntu target
2023-08-30 18:45:05 +08:00
Meng Zhang 3526ca3164
chore: build with ruy (cpu only) on static mode for linux. (#378)
* chore: build with ruy (cpu only) on static mode for linux.

* update cmake min version
2023-08-30 18:04:40 +08:00
Meng Zhang fc9a623e72
feat: add logging on server starting (#372) 2023-08-28 06:12:00 +00:00
Meng Zhang 65836ee199
feat: add stop words encoding offset for ctranslate model config (#371)
* feat: add stop words encoding offset for ctranslate model config

* feat: set default suffix to \n

* add special treatment for bytefallback tokens
2023-08-28 14:07:01 +08:00
vodkaslime 2a91a21787
feat: add gpu info to health state [TAB-162] (#364)
* feat: add gpu info to health response

* chore: error handling

* chore: refactor cpu manager code

* chore: typo

* chore: fix context mutability

* chore: fix context mutability

* feat: add link to NVML lib

* chore: refactor

* lint

* chore: resolve comments

* chore: fix typo

* chore: fix

* chore: resolve comments

* chore: fix

* chore: resolve comments
2023-08-21 18:06:38 +08:00
Meng Zhang b1ad936033
feat: add version information in health state. (#363)
* feat: add git_hash in health state

* add more version information in health state
2023-08-20 15:21:12 +00:00
Meng Zhang df45573501
feat: reduce ServeHealth event to every 300s to reduce event volume (#362) 2023-08-20 12:36:59 +00:00
vodkaslime 2026b4dd0e
feat: add architecture/cpu info to health api response [TAB-162] (#355)
* feat: add architecture, cpu and gpu info to health command

* chore: fix

* chore: fix

* chore: fix

* chore: fix lint

* chore: fix lint

* chore: remove gpu

* chore: resolve comments

* chore: resolve comments

* Update health.rs

---------

Co-authored-by: Meng Zhang <meng@tabbyml.com>
2023-08-15 15:22:03 +00:00
Meng Zhang dbc89831b1
feat: add serve health heartbeat (#343)
* add serve health tracking

* fix lint

* fix
2023-08-09 08:08:42 +00:00
Meng Zhang d0f6ad2d2a
feat: add anonymous usage tracker (#342)
* feat: add anonymous usage tracker

* improve deps

* update

* update
2023-08-09 07:31:13 +00:00
Meng Zhang 220fcc0d65
fix: make `config.experimental` optional (#339)
* fix: make config.experimental` optional

* add unit test for empty toml config
2023-08-07 09:53:00 +00:00
Meng Zhang 4eaae27ed3
Update Cargo.toml (#331) 2023-08-03 19:55:00 +08:00
Meng Zhang 6a50902ca7
fix: support ctranslate2 rev7 vocab files (.json) (#327) 2023-08-02 13:36:31 +00:00
Meng Zhang 57c811b30f
fix: improve download logging (#325)
* Suggest use `-it` so docker run generate progress bar of downloading information properly

* add info! log for model download
2023-08-02 06:30:35 +00:00
Meng Zhang b8308b7118
refactor: extract TextGeneration trait (#324)
* add tabby-inference

* extract TextGeneration trait

* format

* Rename TextInferenceEngine to CTranslate2Engine
2023-08-02 06:12:51 +00:00
Meng Zhang 83e1cf76d8
feat: Upgrade ctranslate2 to v3.17.1 (#323) 2023-08-02 05:46:08 +00:00
Meng Zhang 95bd53ac9c
feat: add select kind param. Supported editors could log line select … (#299)
* feat: add select kind param. Supported editors could log line select or block select

* fix lint
2023-07-16 16:02:40 +08:00
Meng Zhang be5fe0d737
feat: add rust prompt rewrite support (#296) 2023-07-13 09:31:44 +00:00
Meng Zhang 4388fd0050
feat: support prompt rewriting (#295)
* refactor: extract PromptBuilder

* feat: load tantivy index in prompt builder

* integrate with searcher

* add enable_prompt_rewrite to control rewrite behavior

* nit docs

* limit 1 snippet per identifier

* extract magic numbers
2023-07-13 09:05:41 +00:00
Meng Zhang 19586a4926
fix: fix bad indexing field of tag indexer (#293) 2023-07-13 07:19:46 +00:00
Meng Zhang 24d17dc1ed
feat: switching index to tag based (#265)
* add tag based index

* update for python

* fix lint
2023-07-13 06:25:39 +00:00
Meng Zhang be90047477
fix: fix int8 compute type, fix auto compute type selection (include float32 into consideration for cuda compute capability <= 6.0) (#291) 2023-07-12 11:09:38 +08:00
Meng Zhang e822d1857f
fix: add retry strategy in tabby download (#285) 2023-07-09 05:18:19 +00:00
Meng Zhang 9ca1f7e5f1
fix: add additional whitespace to match tokens that combining space and li… (#270)
* fix: add additional whitespace to match tokens that combining space and line break

* fix lint
2023-06-25 01:15:52 +00:00
Meng Zhang 631cff3aed docs: update url of playground server 2023-06-23 18:55:23 -07:00
Meng Zhang 8879ed7655
feat: add experimental self host setup (#261) 2023-06-23 12:13:22 -07:00
Meng Zhang 59b03ae477 fix: use full clone in scheduler 2023-06-22 15:06:44 -07:00
Meng Zhang fcbc5edc55
Revert "feat: add /experimental/search endpoint (#258)" (#260)
This reverts commit 04980160e5.
2023-06-22 14:23:35 -07:00
Meng Zhang 04980160e5
feat: add /experimental/search endpoint (#258)
* feat: add /experimental/search endpoint

* fix format
2023-06-22 20:47:32 +00:00
Meng Zhang 8c1f6c2806
feat(scheduler): add java / python tags support (#259) 2023-06-22 20:25:56 +00:00
Meng Zhang 7ed5dd584d
feat: experiment ctags support in scheduler (#207)
* experiment ctags support

* add document.rs

* extract Document to common

* integrate tags into dataset builder

* skip if none

* do not add scheduler in client binary

* fix fmt
2023-06-21 19:48:13 -07:00
Meng Zhang 6eae16d475 fix: typo in openapi documentation 2023-06-16 20:22:38 -07:00
Meng Zhang 8ee700089f feat: do not use fim template when suffix is empty string 2023-06-15 09:27:32 -07:00
Meng Zhang 9abf1a7521
fix: optional file should be put in cache key to avoid internet access when prefer_local_files = true (#241)
* feat: when file is 404, cache as NotFound

* explicitly mark optional file

* refactor
2023-06-15 04:13:52 +00:00
Meng Zhang d572cf7d6d api: add user field in completion api 2023-06-14 10:50:03 -07:00
Meng Zhang 93d5d8b297 docs: update website openapi.json 2023-06-13 13:13:03 -07:00
Meng Zhang b2734aed59 feat: returns more information in /v1/health 2023-06-13 13:11:20 -07:00
Meng Zhang 9c9e46c6f4 feat: support set compute_type through commandline arguments 2023-06-13 12:04:07 -07:00
Meng Zhang ba7e04d030 refactor: remove admin 2023-06-13 11:37:55 -07:00
Meng Zhang 5985d91782
fix: use int8_float16 to fix SantaCoder-1B (#237)
#236
2023-06-13 01:13:06 -07:00
Meng Zhang a8a145d9b3
feat: add metrics in dataset (#235) 2023-06-12 19:39:43 +00:00
Meng Zhang 2983f70d22
feat: build index from dataset (#234) 2023-06-12 19:21:27 +00:00
Meng Zhang 9af6607f85
feat: add dataset building (#231)
* feat: build dataset in scheduler

* add file rotate
2023-06-12 15:19:43 +00:00
Meng Zhang 3b7153ba23 docs: update website 2023-06-11 12:28:23 -07:00
Meng Zhang de546b03fe
feat: add otlp-endpoint for OpenTelemetry support [TAB-67] (#227)
* feat: add otlp-endpoint for OpenTelemetry support

* set default log level for axum tracing to INFO

* update build enviornment

* update
2023-06-10 22:46:25 -07:00
Meng Zhang 6180b32980
feat: add /v1/health (#226)
* feat: add /v1/health

* fix fmt
2023-06-10 22:37:42 -07:00
Meng Zhang 6718afbf67 fix: server should still support prompt only use case 2023-06-10 00:39:32 -07:00
Meng Zhang eecdde0269
fix: vocabulary.txt / shared_vocabulary.txt should be optional (#225) 2023-06-09 06:49:09 +00:00