Meng Zhang
1a87c99488
feat: add debug flag disable_prompt_rewrite ( #545 )
2023-10-13 06:55:41 +00:00
Meng Zhang
1ad871e1ff
feat: add debug request / response to visualize prompting with source code index ( #544 )
...
* feat: logs segments in completion log
* feat: tune prompt format and improve testing
* add debug options for easier of visualizing the prompt
* update
2023-10-12 19:27:52 -07:00
Meng Zhang
888fbca7e1
feat: support indexing typescript / javascript / tsx / jsx ( #539 )
2023-10-12 00:42:58 +00:00
Meng Zhang
92c1f5a8c0
feat: turn on index server if avaiable ( #536 )
2023-10-11 23:27:41 +00:00
Meng Zhang
1f9e248dfa
refactor: support multiple page in playground ( #537 )
2023-10-11 23:27:20 +00:00
Meng Zhang
99c39375fd
feat: set default empty screen questions for tabby-playground ( #535 )
2023-10-11 22:31:35 +00:00
Meng Zhang
75d2944fb6
feat: support loading the source code index whenever it's ready in file system ( #530 )
...
* feat: support loading index whenever it's ready
* fix test
2023-10-10 21:35:20 -07:00
Meng Zhang
6dbb712918
Release 0.3.0-dev
...
ctranslate2-bindings@0.3.0-dev
http-api-bindings@0.3.0-dev
llama-cpp-bindings@0.3.0-dev
tabby@0.3.0-dev
tabby-common@0.3.0-dev
tabby-download@0.3.0-dev
tabby-inference@0.3.0-dev
tabby-scheduler@0.3.0-dev
Generated by cargo-workspaces
2023-10-09 19:39:27 -07:00
Meng Zhang
d7a8f50de8
fix: use spawn_blocking for sub task without await
2023-10-09 19:38:28 -07:00
Meng Zhang
d21a4de79c
chore: set max timeout for /v1/completions handler ( #526 )
...
* chore: set max timeout for /v1/completions handler
* refactor: extract sub routers
* fix
2023-10-09 18:44:55 -07:00
Meng Zhang
24eadf0de8
refactor: make /v1/health accept GET requests ( #527 )
2023-10-09 18:34:56 +00:00
Meng Zhang
3eb5f4132c
chore: add 'v*' match pattern to restrict git describe only compare against a actual release
2023-10-09 11:13:37 -07:00
Meng Zhang
0f8ee7f589
refactor: move http engine creation to its sub crates ( #524 )
2023-10-09 17:37:04 +00:00
Meng Zhang
8c09f75360
refactor: extract language related data into languages.rs ( #518 )
...
* refactor: extract language related data into languages.rs
* fix
* cleanup index
* fix
* further sanitize
* add a score threshold
2023-10-07 01:40:21 +00:00
Meng Zhang
d85a7892d1
feat: connect prompt rewriting part ( #517 )
...
* feat: enable /v1beta/search if index is available
* make prompt rewriting work
* update
* fix test
* fix api doc
2023-10-07 00:29:24 +00:00
Meng Zhang
8497fb1372
feat: implement /v1beta/search interface ( #516 )
...
* feat: implement /v1beta/search interface
* update
* update
* improve debugger
2023-10-06 18:54:12 +00:00
Meng Zhang
fd2a1ab865
fix: switch back to regex based implementation for stop words ( #513 )
2023-10-06 09:04:37 +00:00
Meng Zhang
4c00ac06fb
fix(download): mark ggml model downloading should be optional, as ggml is only used for metal backend for now ( #512 )
2023-10-05 16:54:56 +00:00
Meng Zhang
9cd2accbaa
feat: adjust code indexing logic ( #510 )
2023-10-05 05:29:41 +00:00
Meng Zhang
1babc38902
feat: add datset viewer, remove treesitter languages that hasn't been verified ( #509 )
...
* refactor: remove not verified tree sitter queries
* feat(experimental): add dataset viewer
update
2023-10-04 21:49:42 -07:00
Meng Zhang
55f68d4224
test: unit test for indexing job ( #508 )
...
* test: unit test for indexing job
* update
* reduce test fixture length
2023-10-04 23:27:19 +00:00
Meng Zhang
f7ebce2514
refactor: deprecate --compute-type ( #505 )
2023-10-04 18:45:34 +00:00
Meng Zhang
8a03c9bf17
refactor: use / as server url
...
Swagger access to the page, regardless of host / port, will just work.
2023-10-03 18:33:56 -07:00
Meng Zhang
2d5b3e4ff5
chore: release v0.2.1
2023-10-03 17:13:39 -07:00
Meng Zhang
503c44e7c5
fix: playground environment misconfig
2023-10-03 17:10:02 -07:00
Meng Zhang
b3b498624c
feat: deprecate num_replicas_per_thread, generate default value for it
2023-10-03 17:02:37 -07:00
Meng Zhang
1afba47059
feat: allow set num_replicas_per_device for CUDA to increase throughput
2023-10-03 15:52:25 -07:00
Meng Zhang
ceaa7ab012
chore: update main branch to v0.3.0-dev
2023-10-03 13:38:27 -07:00
Meng Zhang
1731c3075e
chore: Update version to 0.2.0
2023-10-03 13:32:21 -07:00
Meng Zhang
0e5128e8fb
feat: add chat_template field in tabby.json
2023-10-03 11:46:05 -07:00
Meng Zhang
7fc76228f7
chore: add debug log for /chat interface
2023-10-03 11:38:58 -07:00
Meng Zhang
692c2fe0fd
Release 0.2.0-rc.0
...
ctranslate2-bindings@0.2.0-rc.0
http-api-bindings@0.2.0-rc.0
llama-cpp-bindings@0.2.0-rc.0
tabby@0.2.0-rc.0
tabby-common@0.2.0-rc.0
tabby-download@0.2.0-rc.0
tabby-inference@0.2.0-rc.0
tabby-scheduler@0.2.0-rc.0
Generated by cargo-workspaces
2023-10-02 19:14:12 -07:00
Meng Zhang
6306bb3f01
fix: if local file doens't exist, local_cache_key should be cleared ( #501 )
...
* fix: if local file doens't exist, local_cache_key should be cleared
* fix
2023-10-02 23:48:35 +00:00
Meng Zhang
ce20bd6154
refactor: use RegexSet for cleaer stop regex construction ( #499 )
...
* fix: add a regression test cased for stop words regex matching
* refactor: use RegexSet for cleaer stop regex construction
2023-10-02 23:21:51 +00:00
Meng Zhang
63612d5a67
fix(tabby-download): even when prefer_local_file is set to true, we should still check for remote (if network is avaialble), to see if a file should be upgraded ( #500 )
2023-10-02 23:09:57 +00:00
Meng Zhang
80a17aea37
feat: only show /v1/chat api if --chat-model is set ( #498 )
2023-10-02 17:17:27 +00:00
Meng Zhang
f05dd3a2f6
refactor: cleanup chat api make it message oriented ( #497 )
...
* refactor: refactor into /chat/completions api
* Revert "feat: support request level stop words (#492 )"
This reverts commit 0d6840e372 .
* feat: adjust interface
* switch interface in tabby-playground
* move to chat/prompt, add unit test
* update interface
2023-10-02 15:39:15 +00:00
Meng Zhang
dfdd0373a6
fix: when llama model loads failed, panic in rust stack
2023-10-01 22:25:25 -07:00
Meng Zhang
2171ba72ff
refactor: cleanup llama cpp implementations to fix warnings ( #495 )
2023-09-30 08:37:36 -07:00
Meng Zhang
aea8c74bdc
feat: add OpenAPI link to playground
2023-09-29 18:20:38 -07:00
Meng Zhang
10bf2d6c0c
feat: add param --instruct-model, allowing specify different model for q&a use cases. ( #494 )
2023-09-29 23:44:53 +00:00
Meng Zhang
eb15933255
feat: add tabby playground for q&a use case ( #493 )
...
* init commit
* support chat
* add theme toggle
* limit message to 2 lines
* update
* update formatting
* update
* update
* update
* fix formatting
* update
2023-09-29 15:51:54 -07:00
Meng Zhang
0d6840e372
feat: support request level stop words ( #492 )
2023-09-29 18:21:57 +00:00
Meng Zhang
486e507079
fix: correct Decoding behavior in incremental manner ( #491 )
...
* feat: implement IncrementalDecoding
* refactor: use IncrementalDecoding for ctranslate2
* refactor: rename StopWords to DecodingFactory
* refactor: move decoding logic to tabby-inference
* feat: optimize decoding range
* cleanup
2023-09-29 13:06:47 +00:00
Meng Zhang
5d9ca6928c
feat: update llama.cpp ( #488 )
...
* feat: update llama.cpp
* remove useless include
2023-09-28 23:59:59 +00:00
Meng Zhang
a159c2358d
refactor: move generate / generate_stream to /v1beta ( #487 )
2023-09-28 23:58:17 +00:00
Meng Zhang
56b7b850af
fix: Linkage issue on latest xcode commandline tools clang ( #486 )
2023-09-28 17:46:02 +00:00
Meng Zhang
44f013f26e
feat: add /generate and /generate_streaming ( #482 )
...
* feat: add generate_stream interface
* extract engine::create_engine
* feat add generate::generate
* support streaming in llama.cpp
* support streaming in ctranslate2
* update
* fix formatting
* refactor: extract helpers functions
2023-09-28 17:20:50 +00:00
Meng Zhang
d42942c379
feat: support ModelScope for model registry downloading ( #477 )
...
* feat: update cache info file after each file got downloaded
* refactor: extract Downloader for model downloading logic
* refactor: extract HuggingFaceRegistry
* refactor: extract serde_json to workspace dependency
* feat: add ModelScopeRegistry
* refactor: extract registry to its sub dir.
* feat: add scripts to mirror hf model to modelscope
2023-09-26 11:52:11 -07:00
胡锋
fb5a5971d3
feat: proxy server address mapping to the model server ( #461 )
...
* feat: proxy server address mapping to the model server
* fix: add swagger in Config
* refactor: add_proxy_server
* fix: missing semicolo
2023-09-21 07:06:51 +00:00