Meng Zhang
d0c9b56467
fix: chat model loading is wrongly distributed to --model ( #801 )
2023-11-16 03:35:12 +00:00
Meng Zhang
abe0411f03
feat: add ee feature flag to distinguish the OSS and EE offering ( #789 )
...
* feat: add ee feature flag to distinguish the OSS offering and EE augmented offering
* [autofix.ci] apply automated fixes
* fix lint
---------
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2023-11-14 16:15:15 -08:00
Meng Zhang
bd072d8fb9
feat(ui): always display chat api endpoints in swagger doc ( #784 )
2023-11-14 21:10:41 +00:00
Meng Zhang
618009373b
refactor: use tarpc for easier worker <-> hub communication ( #781 )
...
* temp
* generic
* adapt client
* rename to api
* Revert "rename to api"
This reverts commit 8a51b24fecd76a78e6df576ec51605b8d8418975.
* refactor: remove uselss mutation
* remove useless connection
* cleanup api structure
* restructure
* add webserver api error
* webserver.rs -> server.rs
* rename service to Hub
* update schema
* update naming
* shrink features
* update
* mv worker.rs -> server/worker.rs
2023-11-14 12:48:20 -08:00
Meng Zhang
e521f0637c
feat: add worker command worker::completion and worker::chat ( #778 )
2023-11-13 23:21:57 +00:00
Meng Zhang
e0017cadec
refactor: extract routes/ to share routes between commands ( #774 )
...
* refactor: extract routes/ to share routes between commands
* refactor: extract events api
* extract EventLogger service
* lift api into sub packages
* services completions -> completion
* remove useless code
* fix test
2023-11-13 06:24:20 +00:00
Meng Zhang
bad87a99a2
refactor: serve/engine.rs => services/model.rs
2023-11-12 20:41:15 -08:00
Meng Zhang
4359b0cc4b
refactor: extract CompletionState -> CompletionService ( #773 )
...
* refactor: extract CompletionState -> CompletionService
* fix comment
* Update README.md
cmake is preinstalled in ubuntu / debian
* fix compile error
* format files
* format files
---------
Co-authored-by: darknight <illuminating.me@gmail.com>
2023-11-12 16:14:58 -08:00
Meng Zhang
febfa18e4a
refactor: move code api to tabby/serve ( #771 )
2023-11-12 22:58:15 +00:00
Meng Zhang
17db8a2950
refactor: use unwrap_or_default as suggested by clippy nightly
2023-11-12 13:48:55 -08:00
Meng Zhang
15f768a971
feat: add tabby-webserver as distribution layer of tabby #769
2023-11-12 13:07:07 -08:00
Meng Zhang
d7180ec7b9
fix: remove useless hyper-reverse-proxy
2023-11-11 14:42:57 -08:00
Meng Zhang
8e58ff4003
refactor: move chat.rs to services/ rename search.rs -> code.rs
2023-11-11 14:42:57 -08:00
Meng Zhang
4069fe735f
refactor: remove useless route check for playground
2023-11-11 14:42:57 -08:00
Meng Zhang
7695a667bb
refactor: move tabby/src/search.rs -> tabby/src/services/search.rs
2023-11-11 14:42:57 -08:00
Meng Zhang
fce94f622b
refactor: use Arc<dyn TextGeneration> and Arc<dyn CodeSearch>
2023-11-11 14:42:57 -08:00
Eric
22592374c1
refactor: add experimental-http feature ( #750 )
...
* add experimental-http feature, update code
* refactor: add experimental-http feature
2023-11-11 17:50:16 +00:00
Meng Zhang
41f60d3204
refactor(code): extract `search_in_language` ( #762 )
...
* chore: init tabby-webserver
* add code search worker registry
* add webserver command
* add graphql
* extract schema
* refactor: extract registry.rs
* refactor
* update
* update
* update
* update
* update
* fix lint
2023-11-10 17:29:50 -08:00
Meng Zhang
9380da130e
fix: fix tests
2023-11-10 14:57:15 -08:00
Meng Zhang
4068d6e81d
refactor: extract BoxCodeSearch as interface to CodeSearch ( #756 )
2023-11-10 22:55:51 +00:00
Meng Zhang
b510f61aca
refactor: extract tabby_common::api::code / tabby_common::index::CodeSearchSchema ( #743 )
...
* refactor: extract tabby_common::api::code
mark CodeSearch being Send + Sync
* extract CodeSearchSchema
2023-11-10 10:11:13 -08:00
Meng Zhang
03ff80efdb
feat: update tabby-ui
2023-11-08 16:07:39 -08:00
Meng Zhang
b51520062a
refactor: extract ChatState -> ChatService ( #730 )
2023-11-08 22:12:29 +00:00
Meng Zhang
72d1d9f0bb
refactor: extract IndexServer into CodeSearchService ( #728 )
...
* refactor: extract IndexServer into CodeSearchService
* refactor: make CodeSearchService interface to be async
2023-11-08 21:42:03 +00:00
Meng Zhang
8ab35b2639
feat: add --parallelism to control throughput and vram usage ( #727 )
...
* feat: add --parallelism to control throughput and vram usage
* update default
* Revert "update default"
This reverts commit 349792c0d48d913dcd8be4ce1c9d7ce887918f29.
* cargo fmt
2023-11-08 18:31:22 +00:00
Meng Zhang
1ad0d39903
fix: deadlock between background job and requests ( #720 )
...
* fix: deadlock between background job and requests
* refactor: extract LlamaService
2023-11-07 13:11:28 -08:00
Meng Zhang
3c3b14c9f5
fix: cuda serialization
2023-11-07 00:55:38 -08:00
Meng Zhang
00e0c4fddc
chore: add machete check to ensure no unused dependencies ( #701 )
...
* refactor: remove useless dependencies
* add machete
2023-11-05 02:48:05 +00:00
Meng Zhang
fc9c9f644b
Release 0.6.0-dev
...
http-api-bindings@0.6.0-dev
llama-cpp-bindings@0.6.0-dev
tabby@0.6.0-dev
tabby-common@0.6.0-dev
tabby-download@0.6.0-dev
tabby-inference@0.6.0-dev
tabby-scheduler@0.6.0-dev
Generated by cargo-workspaces
2023-11-03 18:04:12 -07:00
Meng Zhang
ec8d88de0d
chore: release 0.5.0 ( #697 )
...
* Release 0.5.0-rc.0
http-api-bindings@0.5.0-rc.0
llama-cpp-bindings@0.5.0-rc.0
tabby@0.5.0-rc.0
tabby-common@0.5.0-rc.0
tabby-download@0.5.0-rc.0
tabby-inference@0.5.0-rc.0
tabby-scheduler@0.5.0-rc.0
Generated by cargo-workspaces
* fix: docker branch tag should only generate when not empty
* Release 0.5.0-rc.1
http-api-bindings@0.5.0-rc.1
llama-cpp-bindings@0.5.0-rc.1
tabby@0.5.0-rc.1
tabby-common@0.5.0-rc.1
tabby-download@0.5.0-rc.1
tabby-inference@0.5.0-rc.1
tabby-scheduler@0.5.0-rc.1
Generated by cargo-workspaces
* fix: handlebar syntax in meta action
* Release 0.5.0-rc.2
http-api-bindings@0.5.0-rc.2
llama-cpp-bindings@0.5.0-rc.2
tabby@0.5.0-rc.2
tabby-common@0.5.0-rc.2
tabby-download@0.5.0-rc.2
tabby-inference@0.5.0-rc.2
tabby-scheduler@0.5.0-rc.2
Generated by cargo-workspaces
* fix: handlebar syntax in meta action
* Release 0.5.0-rc.3
http-api-bindings@0.5.0-rc.3
llama-cpp-bindings@0.5.0-rc.3
tabby@0.5.0-rc.3
tabby-common@0.5.0-rc.3
tabby-download@0.5.0-rc.3
tabby-inference@0.5.0-rc.3
tabby-scheduler@0.5.0-rc.3
Generated by cargo-workspaces
* docs: update change log and docs
* fix: collect_snippet should handle NotReady error
* Release 0.5.0-rc.4
http-api-bindings@0.5.0-rc.4
llama-cpp-bindings@0.5.0-rc.4
tabby@0.5.0-rc.4
tabby-common@0.5.0-rc.4
tabby-download@0.5.0-rc.4
tabby-inference@0.5.0-rc.4
tabby-scheduler@0.5.0-rc.4
Generated by cargo-workspaces
* Release 0.5.0
http-api-bindings@0.5.0
llama-cpp-bindings@0.5.0
tabby@0.5.0
tabby-common@0.5.0
tabby-download@0.5.0
tabby-inference@0.5.0
tabby-scheduler@0.5.0
Generated by cargo-workspaces
2023-11-03 18:02:03 -07:00
Meng Zhang
e4efcc4091
fix: avoid special keywords (e.g AND) failed the query parsing ( #695 )
2023-11-03 01:13:28 +00:00
Meng Zhang
0e4a2d2a12
feat: simplify download management, model file should be able to indi… ( #690 )
...
* feat: simplify download management, model file should be able to individually introduced
* fix typo
* update local model support
* update spec back
* update spec
* update
* update
2023-11-02 16:01:04 -07:00
Meng Zhang
296342efd8
refactor: use llama.cpp tokenizer ( #683 )
...
* refactor: switch to llama.cpp tokenizer to simplify implementation
* refactor: remove tokenizer dependency from tabby
* refactor: renaming decoding to stop condition
* refactor: remove tokenizer dependency
* refactor: remove submodule
* chore: update formatting
* move tokenization to c++
2023-10-31 22:16:09 +00:00
Meng Zhang
f15926f233
chore(ui): update crates/tabby/ui
2023-10-31 08:51:22 -07:00
Meng Zhang
3f1f8bfd30
fix: add /swagger to tabby ui handler
2023-10-30 17:36:22 -07:00
Meng Zhang
73758e207d
feat: improve dashboard UI ( #677 )
2023-10-30 21:47:38 +00:00
Meng Zhang
b4772fbcd0
feat(ui): add dashboard ( #674 )
...
* feat(ui): add dashboard
* handle path
2023-10-30 07:29:50 +00:00
Meng Zhang
de827b1e74
Revert "feat: make --model optional ( #668 )" ( #672 )
...
This reverts commit c55e4481ba .
2023-10-29 21:44:11 -07:00
Meng Zhang
c55e4481ba
feat: make --model optional ( #668 )
2023-10-30 00:04:42 +00:00
Meng Zhang
2ee5dbfd4f
chore: move tabby-ui under ee license. ( #667 )
...
* chore: introduce tabby-ui EE license.
* update
2023-10-29 15:56:57 -07:00
Meng Zhang
8c680a73fb
feat(ui): add /api page ( #665 )
...
* refactor(tabby-ui): extract tabby-fetcher
* feat(tabby-ui): add /api page
* feat(tabby-ui): add chat model badge
* fix: add components.json for shadcn
* chore: release tabby-ui
2023-10-29 14:55:50 -07:00
Meng Zhang
7bd99d14c0
feat: support continuous batching in llama.cpp backend ( #659 )
...
* refactor: switch back to llama batch interface
* feat: support cont batching
2023-10-28 23:37:05 -07:00
Meng Zhang
14d03b6826
fix(ui): handle invalid semver error
2023-10-28 23:30:35 -07:00
Meng Zhang
43cc5f38cc
feat: do not download ctranslate2 files in downloader
2023-10-28 02:27:09 -07:00
Meng Zhang
23bd542cec
feat: switch cuda backend to llama.cpp ( #656 )
...
* feat: switch cuda backend to llama.cpp
* fix
* fix
2023-10-27 13:41:22 -07:00
Meng Zhang
f37840566b
feat: upgrade llama.cpp ( #645 )
...
* feat: upgrade llama.cpp
* update download files
* update changelog
* Update CHANGELOG.md
* Update CHANGELOG.md
2023-10-27 12:18:46 -07:00
Meng Zhang
62054cb4f1
feat: add new version notice in tabby-ui ( #640 )
...
* refactor: extract useHealth hook
* feat: add new version notice
2023-10-26 00:48:18 +00:00
Meng Zhang
9ecbf9031f
refactor: rename tabby-playground to tabby-ui ( #639 )
2023-10-25 23:05:11 +00:00
Meng Zhang
1a4c2aa71f
feat: swtich cpu backend to llama.cpp ( #638 )
...
* feat: swtich Cpu backend to llama.cpp
* feat: switch cpu serving to ggml
* fix cargo.toml
* use optional dependency
* fix compliation
* update ci target
2023-10-25 15:40:11 -07:00
Meng Zhang
21ec60eddf
feat: add server.completion_timeout to control timeout of /v1/completion ( #637 )
...
* feat: add server.completion_timeout to control timeout of /v1/completion
* Update config.rs
2023-10-25 22:05:23 +00:00