Commit Graph

185 Commits (144dceae4198aa8dd55b3046493e893391817edb)

Author SHA1 Message Date
Meng Zhang 144dceae41 fix: cap parallelisim to 4 for cuda to avoid oom 2023-10-20 00:36:35 -07:00
Meng Zhang a9f1829a52 Revert "Revert "refactor: deprecate --compute-type (#505)""
This reverts commit aa6f39985c.
2023-10-20 00:36:17 -07:00
Meng Zhang e125ab82fc Revert "Revert "feat: deprecate num_replicas_per_thread, generate default value for it""
This reverts commit 6ae42edeb9.
2023-10-20 00:36:09 -07:00
Meng Zhang 0f63d239ad Release 0.3.1-rc.0
ctranslate2-bindings@0.3.1-rc.0
http-api-bindings@0.3.1-rc.0
llama-cpp-bindings@0.3.1-rc.0
rust-cxx-cmake-bridge@0.3.1-rc.0
tabby@0.3.1-rc.0
tabby-common@0.3.1-rc.0
tabby-download@0.3.1-rc.0
tabby-inference@0.3.1-rc.0
tabby-scheduler@0.3.1-rc.0

Generated by cargo-workspaces
2023-10-19 00:14:42 -07:00
Meng Zhang 6ae42edeb9 Revert "feat: deprecate num_replicas_per_thread, generate default value for it"
This reverts commit b3b498624c.
2023-10-19 00:12:28 -07:00
Meng Zhang aa6f39985c Revert "refactor: deprecate --compute-type (#505)"
This reverts commit f7ebce2514.
2023-10-19 00:12:17 -07:00
Meng Zhang 00c9185488 fix(prompt): revert back the behavior, enabling fill-in-the-middle if suffix is empty (use an linebreak as suffix) 2023-10-14 11:42:38 -07:00
Meng Zhang 1a43048ee9 fix: ignore NotReady error for IndexServer 2023-10-14 01:52:40 -07:00
Meng Zhang 4dbaf4f312 Release 0.3.0
ctranslate2-bindings@0.3.0
http-api-bindings@0.3.0
llama-cpp-bindings@0.3.0
rust-cxx-cmake-bridge@0.3.0
tabby@0.3.0
tabby-common@0.3.0
tabby-download@0.3.0
tabby-inference@0.3.0
tabby-scheduler@0.3.0

Generated by cargo-workspaces
2023-10-13 17:45:07 -07:00
Meng Zhang ac73cbb6f9 fix: revisit language stop words support 2023-10-13 13:40:55 -07:00
Meng Zhang eb463ba496 Release 0.3.0-rc.1
ctranslate2-bindings@0.3.0-rc.1
http-api-bindings@0.3.0-rc.1
llama-cpp-bindings@0.3.0-rc.1
rust-cxx-cmake-bridge@0.3.0-rc.1
tabby@0.3.0-rc.1
tabby-common@0.3.0-rc.1
tabby-download@0.3.0-rc.1
tabby-inference@0.3.0-rc.1
tabby-scheduler@0.3.0-rc.1

Generated by cargo-workspaces
2023-10-13 11:43:34 -07:00
Meng Zhang f9b9a1a174 Merge remote-tracking branch 'origin/main' into r0.3 2023-10-13 11:39:51 -07:00
Meng Zhang f8be71d27d
refactor: improve swagger docs (#549)
* refactor: remove server url in swagger config (as we set "/" for default
server path)

* refactor: polish field naming

* refactor: remove useless modifier for ApiDoc
2023-10-13 18:38:05 +00:00
Meng Zhang 182aceed41 Release 0.3.0-rc.0
ctranslate2-bindings@0.3.0-rc.0
http-api-bindings@0.3.0-rc.0
llama-cpp-bindings@0.3.0-rc.0
tabby@0.3.0-rc.0
tabby-common@0.3.0-rc.0
tabby-download@0.3.0-rc.0
tabby-inference@0.3.0-rc.0
tabby-scheduler@0.3.0-rc.0

Generated by cargo-workspaces
2023-10-13 11:24:36 -07:00
Meng Zhang 1a87c99488
feat: add debug flag disable_prompt_rewrite (#545) 2023-10-13 06:55:41 +00:00
Meng Zhang 1ad871e1ff
feat: add debug request / response to visualize prompting with source code index (#544)
* feat: logs segments in completion log

* feat: tune prompt format and improve testing

* add debug options for easier of visualizing the prompt

* update
2023-10-12 19:27:52 -07:00
Meng Zhang 888fbca7e1
feat: support indexing typescript / javascript / tsx / jsx (#539) 2023-10-12 00:42:58 +00:00
Meng Zhang 92c1f5a8c0
feat: turn on index server if avaiable (#536) 2023-10-11 23:27:41 +00:00
Meng Zhang 1f9e248dfa
refactor: support multiple page in playground (#537) 2023-10-11 23:27:20 +00:00
Meng Zhang 99c39375fd
feat: set default empty screen questions for tabby-playground (#535) 2023-10-11 22:31:35 +00:00
Meng Zhang 75d2944fb6
feat: support loading the source code index whenever it's ready in file system (#530)
* feat: support loading index whenever it's ready

* fix test
2023-10-10 21:35:20 -07:00
Meng Zhang 6dbb712918 Release 0.3.0-dev
ctranslate2-bindings@0.3.0-dev
http-api-bindings@0.3.0-dev
llama-cpp-bindings@0.3.0-dev
tabby@0.3.0-dev
tabby-common@0.3.0-dev
tabby-download@0.3.0-dev
tabby-inference@0.3.0-dev
tabby-scheduler@0.3.0-dev

Generated by cargo-workspaces
2023-10-09 19:39:27 -07:00
Meng Zhang d7a8f50de8 fix: use spawn_blocking for sub task without await 2023-10-09 19:38:28 -07:00
Meng Zhang d21a4de79c
chore: set max timeout for /v1/completions handler (#526)
* chore: set max timeout for /v1/completions handler

* refactor: extract sub routers

* fix
2023-10-09 18:44:55 -07:00
Meng Zhang 24eadf0de8
refactor: make /v1/health accept GET requests (#527) 2023-10-09 18:34:56 +00:00
Meng Zhang 3eb5f4132c chore: add 'v*' match pattern to restrict git describe only compare against a actual release 2023-10-09 11:13:37 -07:00
Meng Zhang 0f8ee7f589
refactor: move http engine creation to its sub crates (#524) 2023-10-09 17:37:04 +00:00
Meng Zhang 8c09f75360
refactor: extract language related data into languages.rs (#518)
* refactor: extract language related data into languages.rs

* fix

* cleanup index

* fix

* further sanitize

* add a score threshold
2023-10-07 01:40:21 +00:00
Meng Zhang d85a7892d1
feat: connect prompt rewriting part (#517)
* feat: enable /v1beta/search if index is available

* make prompt rewriting work

* update

* fix test

* fix api doc
2023-10-07 00:29:24 +00:00
Meng Zhang 8497fb1372
feat: implement /v1beta/search interface (#516)
* feat: implement /v1beta/search interface

* update

* update

* improve debugger
2023-10-06 18:54:12 +00:00
Meng Zhang fd2a1ab865
fix: switch back to regex based implementation for stop words (#513) 2023-10-06 09:04:37 +00:00
Meng Zhang 4c00ac06fb
fix(download): mark ggml model downloading should be optional, as ggml is only used for metal backend for now (#512) 2023-10-05 16:54:56 +00:00
Meng Zhang 9cd2accbaa
feat: adjust code indexing logic (#510) 2023-10-05 05:29:41 +00:00
Meng Zhang 1babc38902
feat: add datset viewer, remove treesitter languages that hasn't been verified (#509)
* refactor: remove not verified tree sitter queries

* feat(experimental): add dataset viewer

update
2023-10-04 21:49:42 -07:00
Meng Zhang 55f68d4224
test: unit test for indexing job (#508)
* test: unit test for indexing job

* update

* reduce test fixture length
2023-10-04 23:27:19 +00:00
Meng Zhang f7ebce2514
refactor: deprecate --compute-type (#505) 2023-10-04 18:45:34 +00:00
Meng Zhang 8a03c9bf17 refactor: use / as server url
Swagger access to the page, regardless of host / port, will just work.
2023-10-03 18:33:56 -07:00
Meng Zhang 2d5b3e4ff5 chore: release v0.2.1 2023-10-03 17:13:39 -07:00
Meng Zhang 503c44e7c5 fix: playground environment misconfig 2023-10-03 17:10:02 -07:00
Meng Zhang b3b498624c feat: deprecate num_replicas_per_thread, generate default value for it 2023-10-03 17:02:37 -07:00
Meng Zhang 1afba47059 feat: allow set num_replicas_per_device for CUDA to increase throughput 2023-10-03 15:52:25 -07:00
Meng Zhang ceaa7ab012 chore: update main branch to v0.3.0-dev 2023-10-03 13:38:27 -07:00
Meng Zhang 1731c3075e chore: Update version to 0.2.0 2023-10-03 13:32:21 -07:00
Meng Zhang 0e5128e8fb feat: add chat_template field in tabby.json 2023-10-03 11:46:05 -07:00
Meng Zhang 7fc76228f7 chore: add debug log for /chat interface 2023-10-03 11:38:58 -07:00
Meng Zhang 692c2fe0fd Release 0.2.0-rc.0
ctranslate2-bindings@0.2.0-rc.0
http-api-bindings@0.2.0-rc.0
llama-cpp-bindings@0.2.0-rc.0
tabby@0.2.0-rc.0
tabby-common@0.2.0-rc.0
tabby-download@0.2.0-rc.0
tabby-inference@0.2.0-rc.0
tabby-scheduler@0.2.0-rc.0

Generated by cargo-workspaces
2023-10-02 19:14:12 -07:00
Meng Zhang 6306bb3f01
fix: if local file doens't exist, local_cache_key should be cleared (#501)
* fix: if local file doens't exist, local_cache_key should be cleared

* fix
2023-10-02 23:48:35 +00:00
Meng Zhang ce20bd6154
refactor: use RegexSet for cleaer stop regex construction (#499)
* fix: add a regression test cased for stop words regex matching

* refactor: use RegexSet for cleaer stop regex construction
2023-10-02 23:21:51 +00:00
Meng Zhang 63612d5a67
fix(tabby-download): even when prefer_local_file is set to true, we should still check for remote (if network is avaialble), to see if a file should be upgraded (#500) 2023-10-02 23:09:57 +00:00
Meng Zhang 80a17aea37
feat: only show /v1/chat api if --chat-model is set (#498) 2023-10-02 17:17:27 +00:00