Meng Zhang
b1481b0e2e
chore: release 0.6.0 ( #882 )
...
* add loadtest
* release 0.6.0
* Release 0.6.0-rc.0
http-api-bindings@0.6.0-rc.0
juniper-axum@0.6.0-rc.0
llama-cpp-bindings@0.6.0-rc.0
tabby@0.6.0-rc.0
tabby-common@0.6.0-rc.0
tabby-download@0.6.0-rc.0
tabby-inference@0.6.0-rc.0
tabby-scheduler@0.6.0-rc.0
tabby-webserver@0.6.0-rc.0
Generated by cargo-workspaces
* Release 0.6.0-rc.1
http-api-bindings@0.6.0-rc.1
juniper-axum@0.6.0-rc.1
llama-cpp-bindings@0.6.0-rc.1
tabby@0.6.0-rc.1
tabby-common@0.6.0-rc.1
tabby-download@0.6.0-rc.1
tabby-inference@0.6.0-rc.1
tabby-scheduler@0.6.0-rc.1
tabby-webserver@0.6.0-rc.1
Generated by cargo-workspaces
* Release 0.6.0-rc.2
http-api-bindings@0.6.0-rc.2
juniper-axum@0.6.0-rc.2
llama-cpp-bindings@0.6.0-rc.2
tabby@0.6.0-rc.2
tabby-common@0.6.0-rc.2
tabby-download@0.6.0-rc.2
tabby-inference@0.6.0-rc.2
tabby-scheduler@0.6.0-rc.2
tabby-webserver@0.6.0-rc.2
Generated by cargo-workspaces
* Release 0.6.0
http-api-bindings@0.6.0
juniper-axum@0.6.0
llama-cpp-bindings@0.6.0
tabby@0.6.0
tabby-common@0.6.0
tabby-download@0.6.0
tabby-inference@0.6.0
tabby-scheduler@0.6.0
tabby-webserver@0.6.0
Generated by cargo-workspaces
2023-11-27 14:57:45 +08:00
Meng Zhang
8654116126
docs: add date to previous releases
2023-11-19 17:13:38 -08:00
Meng Zhang
d8e83d0610
refactor: extract run_app function ( #843 )
...
* refactor: extract run_app function
* lint
* change route to /metrics
* refactor
2023-11-20 01:00:35 +00:00
Jonathan Poisson
6dabecc3ee
feat: add Prometheus support to Tabby. ( #838 )
...
* Added Prometheus support to Tabby.
1) Added `axum-prometheus` to Cargo.toml
2) Added `metrics.rs` as a simple route to produce a metrics endpoint with prometheus-formatted events
3) Added `/v1/metrics` endpoint with API doc entry
4) Added the `PrometheusLayer` to the root layers.
This change effectively allows for external Prometheus to scrape metrics from Tabby during execution in order to monitor operations (resources, timings and overall usage).
* changed application/text to text/plain to adhere to rfc1341
* Update Makefile
* Update Makefile
* Update Makefile
* Update Makefile
* Update Makefile
* Update Makefile
* - Reworked code to fit upstream changes
- Added Prometheus layer and handle to worker.rs
* Update CHANGELOG.md
* [autofix.ci] apply automated fixes
* Simplified worker metrics route definition
* [autofix.ci] apply automated fixes
* [autofix.ci] apply automated fixes (attempt 2/3)
---------
Co-authored-by: Meng Zhang <meng@tabbyml.com>
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2023-11-19 15:40:57 -08:00
Meng Zhang
7dbbfc39c2
docs: update CHANGElog.md for 0.6.0
2023-11-19 15:35:38 -08:00
Meng Zhang
25621547a7
feat: make --model optional, so user can start a chat only instance ( #817 )
2023-11-17 22:35:58 +00:00
Meng Zhang
73a76a3d8e
feat(scheduler): add a tqdm bar for scheduler job to better present the remaining time. ( #754 )
...
* feat(scheduler): add a tqdm bar for scheduler job to better present the
remaining time.
* update
* add changelog
2023-11-10 19:52:07 +00:00
Meng Zhang
e6fb1b6ac0
docs: add v0.5.5 CHANGELOG.md
2023-11-09 00:36:35 -08:00
Meng Zhang
6bea2a4a78
docs: add v0.5.4 to changelog
2023-11-07 13:27:52 -08:00
Meng Zhang
8d7724b30a
chore: Update CHANGELOG.md
2023-11-07 00:58:25 -08:00
Meng Zhang
cd8edc2fef
Update CHANGELOG.md
2023-11-07 00:35:25 -08:00
Meng Zhang
ec8d88de0d
chore: release 0.5.0 ( #697 )
...
* Release 0.5.0-rc.0
http-api-bindings@0.5.0-rc.0
llama-cpp-bindings@0.5.0-rc.0
tabby@0.5.0-rc.0
tabby-common@0.5.0-rc.0
tabby-download@0.5.0-rc.0
tabby-inference@0.5.0-rc.0
tabby-scheduler@0.5.0-rc.0
Generated by cargo-workspaces
* fix: docker branch tag should only generate when not empty
* Release 0.5.0-rc.1
http-api-bindings@0.5.0-rc.1
llama-cpp-bindings@0.5.0-rc.1
tabby@0.5.0-rc.1
tabby-common@0.5.0-rc.1
tabby-download@0.5.0-rc.1
tabby-inference@0.5.0-rc.1
tabby-scheduler@0.5.0-rc.1
Generated by cargo-workspaces
* fix: handlebar syntax in meta action
* Release 0.5.0-rc.2
http-api-bindings@0.5.0-rc.2
llama-cpp-bindings@0.5.0-rc.2
tabby@0.5.0-rc.2
tabby-common@0.5.0-rc.2
tabby-download@0.5.0-rc.2
tabby-inference@0.5.0-rc.2
tabby-scheduler@0.5.0-rc.2
Generated by cargo-workspaces
* fix: handlebar syntax in meta action
* Release 0.5.0-rc.3
http-api-bindings@0.5.0-rc.3
llama-cpp-bindings@0.5.0-rc.3
tabby@0.5.0-rc.3
tabby-common@0.5.0-rc.3
tabby-download@0.5.0-rc.3
tabby-inference@0.5.0-rc.3
tabby-scheduler@0.5.0-rc.3
Generated by cargo-workspaces
* docs: update change log and docs
* fix: collect_snippet should handle NotReady error
* Release 0.5.0-rc.4
http-api-bindings@0.5.0-rc.4
llama-cpp-bindings@0.5.0-rc.4
tabby@0.5.0-rc.4
tabby-common@0.5.0-rc.4
tabby-download@0.5.0-rc.4
tabby-inference@0.5.0-rc.4
tabby-scheduler@0.5.0-rc.4
Generated by cargo-workspaces
* Release 0.5.0
http-api-bindings@0.5.0
llama-cpp-bindings@0.5.0
tabby@0.5.0
tabby-common@0.5.0
tabby-download@0.5.0
tabby-inference@0.5.0
tabby-scheduler@0.5.0
Generated by cargo-workspaces
2023-11-03 18:02:03 -07:00
Meng Zhang
2adcc0726c
feat: support prefix query on name field ( #694 )
...
* feat: support prefix phase query on name field
* update changelog
2023-11-03 01:04:33 +00:00
Meng Zhang
296342efd8
refactor: use llama.cpp tokenizer ( #683 )
...
* refactor: switch to llama.cpp tokenizer to simplify implementation
* refactor: remove tokenizer dependency from tabby
* refactor: renaming decoding to stop condition
* refactor: remove tokenizer dependency
* refactor: remove submodule
* chore: update formatting
* move tokenization to c++
2023-10-31 22:16:09 +00:00
Meng Zhang
de827b1e74
Revert "feat: make --model optional ( #668 )" ( #672 )
...
This reverts commit c55e4481ba .
2023-10-29 21:44:11 -07:00
Meng Zhang
c55e4481ba
feat: make --model optional ( #668 )
2023-10-30 00:04:42 +00:00
Meng Zhang
23bd542cec
feat: switch cuda backend to llama.cpp ( #656 )
...
* feat: switch cuda backend to llama.cpp
* fix
* fix
2023-10-27 13:41:22 -07:00
Meng Zhang
f37840566b
feat: upgrade llama.cpp ( #645 )
...
* feat: upgrade llama.cpp
* update download files
* update changelog
* Update CHANGELOG.md
* Update CHANGELOG.md
2023-10-27 12:18:46 -07:00
Meng Zhang
1a4c2aa71f
feat: swtich cpu backend to llama.cpp ( #638 )
...
* feat: swtich Cpu backend to llama.cpp
* feat: switch cpu serving to ggml
* fix cargo.toml
* use optional dependency
* fix compliation
* update ci target
2023-10-25 15:40:11 -07:00
Meng Zhang
cd14ad723b
docs: update changelog for 0.4.0 ( #629 )
2023-10-24 16:26:23 -07:00
Meng Zhang
422b27d2ed
feat: support local directory for Repository.git_url ( #622 )
2023-10-24 01:29:38 +00:00
Meng Zhang
3087f38bb4
docs: Update CHANGELOG.md
2023-10-21 12:36:48 -07:00
Meng Zhang
12686f9d6c
Update CHANGELOG.md
2023-10-21 12:35:12 -07:00
Meng Zhang
933506d2ce
docs: Update CHANGELOG.md
2023-10-21 12:33:05 -07:00
Meng Zhang
7ae1a97d57
docs: update changelog and documentation to indicate golang support ( #557 )
...
* docs: update changelog and documentation to indicate golang support
* Update programming-languages.md
2023-10-14 00:59:07 -07:00
Meng Zhang
d45d22e159
docs: update CHANGELOG for v0.3.0
2023-10-13 17:44:19 -07:00
Meng Zhang
dfa6ed76b1
docs: update changelog for v0.3.0
2023-10-13 17:34:19 -07:00
Meng Zhang
9db4fa7ffd
docs: update CHANGELOG.md for v0.2.2
2023-10-13 13:19:57 -07:00
Meng Zhang
5a6fa6defd
docs: Update CHANGELOG.md
2023-10-03 17:37:11 -07:00
Meng Zhang
2d5b3e4ff5
chore: release v0.2.1
2023-10-03 17:13:39 -07:00
Meng Zhang
375675c6b4
docs: Update CHANGELOG.md
2023-10-03 13:44:17 -07:00
Meng Zhang
ceaa7ab012
chore: update main branch to v0.3.0-dev
2023-10-03 13:38:27 -07:00
Meng Zhang
b6ccf3df66
docs: update changelog for v0.2.0
2023-10-03 13:32:21 -07:00