Commit Graph

687 Commits (fd2a1ab86509fdec297d5023ee419a277e86c973)

Author SHA1 Message Date
Meng Zhang fd2a1ab865
fix: switch back to regex based implementation for stop words (#513) 2023-10-06 09:04:37 +00:00
Meng Zhang 4c00ac06fb
fix(download): mark ggml model downloading should be optional, as ggml is only used for metal backend for now (#512) 2023-10-05 16:54:56 +00:00
Meng Zhang 9cd2accbaa
feat: adjust code indexing logic (#510) 2023-10-05 05:29:41 +00:00
Meng Zhang 1babc38902
feat: add datset viewer, remove treesitter languages that hasn't been verified (#509)
* refactor: remove not verified tree sitter queries

* feat(experimental): add dataset viewer

update
2023-10-04 21:49:42 -07:00
Meng Zhang 55f68d4224
test: unit test for indexing job (#508)
* test: unit test for indexing job

* update

* reduce test fixture length
2023-10-04 23:27:19 +00:00
Meng Zhang e0b2a775d8 docs: add m1/m2 support status to chat model in model directory 2023-10-04 15:04:45 -07:00
Meng Zhang 6b38b32117 docs: add chat_template to model spec 2023-10-04 15:04:45 -07:00
Meng Zhang f7ebce2514
refactor: deprecate --compute-type (#505) 2023-10-04 18:45:34 +00:00
Meng Zhang b16aeaebc0 docs: add HuggingFace link on spaces tutorial 2023-10-04 08:45:57 -07:00
Meng Zhang 93b6766fca
docs: Update MODEL_SPEC.md (#504)
Fix #503
2023-10-04 08:33:51 -07:00
Meng Zhang f44e87ff0e
docs(fix): broken links in model directory 2023-10-03 22:28:07 -07:00
Meng Zhang 0a949aa6b6
docs: add modelscope registry information to model directory 2023-10-03 22:23:47 -07:00
Meng Zhang 3600d9e399 chore: update sync.sh models 2023-10-03 20:28:58 -07:00
Meng Zhang e881008112
docs: remove WizardCoder-15B from model directory 2023-10-03 20:27:51 -07:00
Meng Zhang 1d364fd628 feat: add sync.sh for copy-to-modelscope 2023-10-03 20:03:13 -07:00
Meng Zhang 8a03c9bf17 refactor: use / as server url
Swagger access to the page, regardless of host / port, will just work.
2023-10-03 18:33:56 -07:00
Meng Zhang 5a6fa6defd
docs: Update CHANGELOG.md 2023-10-03 17:37:11 -07:00
Meng Zhang 2d5b3e4ff5 chore: release v0.2.1 2023-10-03 17:13:39 -07:00
Meng Zhang 503c44e7c5 fix: playground environment misconfig 2023-10-03 17:10:02 -07:00
Meng Zhang b3b498624c feat: deprecate num_replicas_per_thread, generate default value for it 2023-10-03 17:02:37 -07:00
Meng Zhang 1afba47059 feat: allow set num_replicas_per_device for CUDA to increase throughput 2023-10-03 15:52:25 -07:00
Meng Zhang 375675c6b4
docs: Update CHANGELOG.md 2023-10-03 13:44:17 -07:00
Meng Zhang ceaa7ab012 chore: update main branch to v0.3.0-dev 2023-10-03 13:38:27 -07:00
Meng Zhang 1731c3075e chore: Update version to 0.2.0 2023-10-03 13:32:21 -07:00
Meng Zhang b6ccf3df66 docs: update changelog for v0.2.0 2023-10-03 13:32:21 -07:00
Meng Zhang 6ad99cb23d
docs: update documentation to prepare for 0.2 release (#502)
* docs: fix installation emoji

* docs: set StarCoder-1B to be default model for docker install

* docs: add `--chat-model` in model directory
2023-10-03 13:11:07 -07:00
Meng Zhang 0e5128e8fb feat: add chat_template field in tabby.json 2023-10-03 11:46:05 -07:00
Meng Zhang 7fc76228f7 chore: add debug log for /chat interface 2023-10-03 11:38:58 -07:00
Meng Zhang 692c2fe0fd Release 0.2.0-rc.0
ctranslate2-bindings@0.2.0-rc.0
http-api-bindings@0.2.0-rc.0
llama-cpp-bindings@0.2.0-rc.0
tabby@0.2.0-rc.0
tabby-common@0.2.0-rc.0
tabby-download@0.2.0-rc.0
tabby-inference@0.2.0-rc.0
tabby-scheduler@0.2.0-rc.0

Generated by cargo-workspaces
2023-10-02 19:14:12 -07:00
Meng Zhang 6306bb3f01
fix: if local file doens't exist, local_cache_key should be cleared (#501)
* fix: if local file doens't exist, local_cache_key should be cleared

* fix
2023-10-02 23:48:35 +00:00
Meng Zhang ce20bd6154
refactor: use RegexSet for cleaer stop regex construction (#499)
* fix: add a regression test cased for stop words regex matching

* refactor: use RegexSet for cleaer stop regex construction
2023-10-02 23:21:51 +00:00
Meng Zhang 63612d5a67
fix(tabby-download): even when prefer_local_file is set to true, we should still check for remote (if network is avaialble), to see if a file should be upgraded (#500) 2023-10-02 23:09:57 +00:00
Meng Zhang 80a17aea37
feat: only show /v1/chat api if --chat-model is set (#498) 2023-10-02 17:17:27 +00:00
Meng Zhang f05dd3a2f6
refactor: cleanup chat api make it message oriented (#497)
* refactor: refactor into /chat/completions api

* Revert "feat: support request level stop words (#492)"

This reverts commit 0d6840e372.

* feat: adjust interface

* switch interface in tabby-playground

* move to chat/prompt, add unit test

* update interface
2023-10-02 15:39:15 +00:00
Meng Zhang dfdd0373a6 fix: when llama model loads failed, panic in rust stack 2023-10-01 22:25:25 -07:00
Meng Zhang 1fd3adbe0c
docs: add a snippet explaining the streaming example 2023-09-30 18:05:12 -07:00
Meng Zhang f7ecab5bca
docs: change `consumer` to `client` 2023-09-30 17:42:24 -07:00
Lucy Gao 6348018d38
docs: add back-pressure and cancellation blog (#479)
* docs: add back-pressure and cancellation blog

* fix(doc): format and content improvement for the back-pressure blog (#496)

* Minor editorial changes on cancellation blog.

* adjust blog structure

* rename blog title

---------

Co-authored-by: Wang Zixiao <wayne.wang0821@gmail.com>
Co-authored-by: Meng Zhang <meng@tabbyml.com>
2023-09-30 10:47:09 -07:00
Meng Zhang 2171ba72ff
refactor: cleanup llama cpp implementations to fix warnings (#495) 2023-09-30 08:37:36 -07:00
Meng Zhang aea8c74bdc feat: add OpenAPI link to playground 2023-09-29 18:20:38 -07:00
Meng Zhang 10bf2d6c0c
feat: add param --instruct-model, allowing specify different model for q&a use cases. (#494) 2023-09-29 23:44:53 +00:00
Meng Zhang 892aa61a53
docs: Update tabby-playground/README.md 2023-09-29 15:54:08 -07:00
Meng Zhang eb15933255
feat: add tabby playground for q&a use case (#493)
* init commit

* support chat

* add theme toggle

* limit message to 2 lines

* update

* update formatting

* update

* update

* update

* fix formatting

* update
2023-09-29 15:51:54 -07:00
Meng Zhang 0d6840e372
feat: support request level stop words (#492) 2023-09-29 18:21:57 +00:00
Meng Zhang 486e507079
fix: correct Decoding behavior in incremental manner (#491)
* feat: implement IncrementalDecoding

* refactor: use IncrementalDecoding for ctranslate2

* refactor: rename StopWords to DecodingFactory

* refactor: move decoding logic to tabby-inference

* feat: optimize decoding range

* cleanup
2023-09-29 13:06:47 +00:00
Zhiming Ma 52c4ef38d3
feat(intellij): update tabby-agent to 0.3.1. (#490) 2023-09-29 03:06:47 -07:00
Zhiming Ma 4ebad71805
fix(agent): hotfix vscode extension 0.6.1. Filter out no data AgentStats event. (#489) 2023-09-28 18:58:44 -07:00
Meng Zhang 5d9ca6928c
feat: update llama.cpp (#488)
* feat: update llama.cpp

* remove useless include
2023-09-28 23:59:59 +00:00
Meng Zhang a159c2358d
refactor: move generate / generate_stream to /v1beta (#487) 2023-09-28 23:58:17 +00:00
Meng Zhang 56b7b850af
fix: Linkage issue on latest xcode commandline tools clang (#486) 2023-09-28 17:46:02 +00:00