Meng Zhang
ceaa7ab012
chore: update main branch to v0.3.0-dev
2023-10-03 13:38:27 -07:00
Meng Zhang
1731c3075e
chore: Update version to 0.2.0
2023-10-03 13:32:21 -07:00
Meng Zhang
692c2fe0fd
Release 0.2.0-rc.0
...
ctranslate2-bindings@0.2.0-rc.0
http-api-bindings@0.2.0-rc.0
llama-cpp-bindings@0.2.0-rc.0
tabby@0.2.0-rc.0
tabby-common@0.2.0-rc.0
tabby-download@0.2.0-rc.0
tabby-inference@0.2.0-rc.0
tabby-scheduler@0.2.0-rc.0
Generated by cargo-workspaces
2023-10-02 19:14:12 -07:00
Meng Zhang
f05dd3a2f6
refactor: cleanup chat api make it message oriented ( #497 )
...
* refactor: refactor into /chat/completions api
* Revert "feat: support request level stop words (#492 )"
This reverts commit 0d6840e372 .
* feat: adjust interface
* switch interface in tabby-playground
* move to chat/prompt, add unit test
* update interface
2023-10-02 15:39:15 +00:00
Meng Zhang
eb15933255
feat: add tabby playground for q&a use case ( #493 )
...
* init commit
* support chat
* add theme toggle
* limit message to 2 lines
* update
* update formatting
* update
* update
* update
* fix formatting
* update
2023-09-29 15:51:54 -07:00
Meng Zhang
486e507079
fix: correct Decoding behavior in incremental manner ( #491 )
...
* feat: implement IncrementalDecoding
* refactor: use IncrementalDecoding for ctranslate2
* refactor: rename StopWords to DecodingFactory
* refactor: move decoding logic to tabby-inference
* feat: optimize decoding range
* cleanup
2023-09-29 13:06:47 +00:00
Meng Zhang
44f013f26e
feat: add /generate and /generate_streaming ( #482 )
...
* feat: add generate_stream interface
* extract engine::create_engine
* feat add generate::generate
* support streaming in llama.cpp
* support streaming in ctranslate2
* update
* fix formatting
* refactor: extract helpers functions
2023-09-28 17:20:50 +00:00
Meng Zhang
d42942c379
feat: support ModelScope for model registry downloading ( #477 )
...
* feat: update cache info file after each file got downloaded
* refactor: extract Downloader for model downloading logic
* refactor: extract HuggingFaceRegistry
* refactor: extract serde_json to workspace dependency
* feat: add ModelScopeRegistry
* refactor: extract registry to its sub dir.
* feat: add scripts to mirror hf model to modelscope
2023-09-26 11:52:11 -07:00
Meng Zhang
c107c991ff
chore: bump tabby version to 0.1.1
2023-09-17 17:09:56 +08:00
Meng Zhang
f0ed366420
feat: add support vertex-ai http bindings ( #419 )
...
* feat: add support vertex-ai http bindings
* support prefix / suffix
2023-09-09 11:22:58 +00:00
Meng Zhang
17397c8c8c
feat: add http api bindings ( #410 )
...
* feat: add http-api-bindings
* feat: add http-api-bindings
* hand max_input_length
* rename
* update
* update
* add examples/simple.rs
* update
* add default value for stop words
* update
* fix lint
* update
2023-09-09 03:59:42 +00:00
vodkaslime
3c7c8d9293
feat: add cargo test to github actions and run only unit tests in ci [TAB-185] ( #390 )
...
* feat: add cargo test to github actions
* chore: fix lint
* chore: add openblas dependency
* chore: update build dependency
* chore: resolve comments
* chore: fix lint
* chore: fix lint
* chore: test installing dependencies
* chore: refactor integ test
* update ci
* cleanup
---------
Co-authored-by: Meng Zhang <meng@tabbyml.com>
2023-09-03 05:04:52 +00:00
Meng Zhang
b0074d7e30
feat: support cancellation in llama backend [TAB-146] ( #392 )
...
* feat: support cancellation in llama backend
* fix lint
2023-09-03 02:15:54 +00:00
Meng Zhang
3573d4378e
feat: llama.cpp for metal support [TAB-146] ( #391 )
...
* feat: init commit adding llama-cpp-bindings
* add llama.cpp submodule
* add LlamaEngine to hold llama context / llama model
* add cxxbridge
* add basic greedy sampling
* move files
* make compile success
* connect TextGeneration with LlamaEngine
* experimental support llama.cpp
* add metal device
* add Accelerate
* fix namespace for llama-cpp-bindings
* fix lint
* move stepping logic to rust
* add stop words package
* use stop-words in ctranslate2-bindings
* use raw string for regex
* use Arc<Tokenizer> for sharing tokenizers
* refactor: remove useless stop_words_encoding_offset
* switch to tokenizers 0.13.4-rc.3
* fix lints in cpp
* simplify implementation of greedy decoding
* feat: split metal feature for llama backend
* add ci
* update ci
* build tabby bin in ci build
2023-09-03 09:59:07 +08:00
vodkaslime
e5598e63f2
feat: extend language [TAB-181] ( #385 )
...
* feat: extend indexer's language support
* feat: extend language support
* chore: add support for mjs and mts
* chore: fix lint
2023-08-31 07:36:57 +00:00
vodkaslime
2a91a21787
feat: add gpu info to health state [TAB-162] ( #364 )
...
* feat: add gpu info to health response
* chore: error handling
* chore: refactor cpu manager code
* chore: typo
* chore: fix context mutability
* chore: fix context mutability
* feat: add link to NVML lib
* chore: refactor
* lint
* chore: resolve comments
* chore: fix typo
* chore: fix
* chore: resolve comments
* chore: fix
* chore: resolve comments
2023-08-21 18:06:38 +08:00
Meng Zhang
b1ad936033
feat: add version information in health state. ( #363 )
...
* feat: add git_hash in health state
* add more version information in health state
2023-08-20 15:21:12 +00:00
vodkaslime
2026b4dd0e
feat: add architecture/cpu info to health api response [TAB-162] ( #355 )
...
* feat: add architecture, cpu and gpu info to health command
* chore: fix
* chore: fix
* chore: fix
* chore: fix lint
* chore: fix lint
* chore: remove gpu
* chore: resolve comments
* chore: resolve comments
* Update health.rs
---------
Co-authored-by: Meng Zhang <meng@tabbyml.com>
2023-08-15 15:22:03 +00:00
Meng Zhang
d0f6ad2d2a
feat: add anonymous usage tracker ( #342 )
...
* feat: add anonymous usage tracker
* improve deps
* update
* update
2023-08-09 07:31:13 +00:00
Meng Zhang
57c811b30f
fix: improve download logging ( #325 )
...
* Suggest use `-it` so docker run generate progress bar of downloading information properly
* add info! log for model download
2023-08-02 06:30:35 +00:00
Meng Zhang
b8308b7118
refactor: extract TextGeneration trait ( #324 )
...
* add tabby-inference
* extract TextGeneration trait
* format
* Rename TextInferenceEngine to CTranslate2Engine
2023-08-02 06:12:51 +00:00
Meng Zhang
be5fe0d737
feat: add rust prompt rewrite support ( #296 )
2023-07-13 09:31:44 +00:00
Meng Zhang
4388fd0050
feat: support prompt rewriting ( #295 )
...
* refactor: extract PromptBuilder
* feat: load tantivy index in prompt builder
* integrate with searcher
* add enable_prompt_rewrite to control rewrite behavior
* nit docs
* limit 1 snippet per identifier
* extract magic numbers
2023-07-13 09:05:41 +00:00
Meng Zhang
24d17dc1ed
feat: switching index to tag based ( #265 )
...
* add tag based index
* update for python
* fix lint
2023-07-13 06:25:39 +00:00
Meng Zhang
e822d1857f
fix: add retry strategy in tabby download ( #285 )
2023-07-09 05:18:19 +00:00
Meng Zhang
fcbc5edc55
Revert "feat: add /experimental/search endpoint ( #258 )" ( #260 )
...
This reverts commit 04980160e5 .
2023-06-22 14:23:35 -07:00
Meng Zhang
04980160e5
feat: add /experimental/search endpoint ( #258 )
...
* feat: add /experimental/search endpoint
* fix format
2023-06-22 20:47:32 +00:00
Meng Zhang
8c1f6c2806
feat(scheduler): add java / python tags support ( #259 )
2023-06-22 20:25:56 +00:00
Meng Zhang
7ed5dd584d
feat: experiment ctags support in scheduler ( #207 )
...
* experiment ctags support
* add document.rs
* extract Document to common
* integrate tags into dataset builder
* skip if none
* do not add scheduler in client binary
* fix fmt
2023-06-21 19:48:13 -07:00
Meng Zhang
9af6607f85
feat: add dataset building ( #231 )
...
* feat: build dataset in scheduler
* add file rotate
2023-06-12 15:19:43 +00:00
Meng Zhang
de546b03fe
feat: add otlp-endpoint for OpenTelemetry support [TAB-67] ( #227 )
...
* feat: add otlp-endpoint for OpenTelemetry support
* set default log level for axum tracing to INFO
* update build enviornment
* update
2023-06-10 22:46:25 -07:00
Meng Zhang
fd1baff8d5
feat: support stop sequences [TAB-52] ( #212 )
...
* refactor: pass step and string token to callback
* add token to callback
* add stop regexp
* implement stop words logic
* pass token_ids from inference
* improve effiency of regexp match with reversed regex
* fmt
* add typescript and javascript stop words
* add cache for stop words regexp
2023-06-06 23:28:58 +00:00
Meng Zhang
249d51d0f5
feat: add indexer [TAB-17] ( #199 )
...
* add basic indexer
* formatting
2023-06-05 22:18:10 +00:00
Meng Zhang
e8a33312bb
refactor: extract download into tabby-download ( #195 )
...
* refactor: extract download into tabby-download
* remove unused deps
2023-06-05 18:40:24 +00:00
Meng Zhang
e8b1c10738
feat: add `tabby scheduler` command ( #194 )
...
* feat: add `tabby scheduler` command
* update test cases
* fix fmt
2023-06-05 18:29:38 +00:00
Meng Zhang
da02d471a9
chore: add tabby-scheduler [TAB-17] ( #192 )
...
* add scheduler
* update fmt
* add integration tests for scheduler
2023-06-05 03:08:43 +00:00
Meng Zhang
2bf5bcd0cf
refactor: extract TextInferenceEngineImpl to reduce duplications between EncoderDecoderImpl and DecoderImpl #189
2023-06-04 22:28:39 +00:00
Meng Zhang
6de61f45bb
chore: mark thread safety [TAB-52] ( #186 )
...
* mark thread safety
* use shared_ptr to ensure thread safety
* fmt
2023-06-04 06:23:31 +00:00
Meng Zhang
2779da3cba
feat: supports FIM inference [TAB-46] ( #183 )
...
* Add prefix / suffix
* update
* feat: support segments in inference
* chore: add tabby.json in model repository to store prompt_template
* make prompt_template optional.
* download tabby.json in downloader
2023-06-02 16:47:48 -07:00
Meng Zhang
e8dbd36663
feat: improve download command - support local cache checking behavior ( #178 )
...
* move download.rs
* add metadata
* support prefer local args
* fix format
* replace errorchain with anyhow
2023-06-01 06:42:04 +00:00
Meng Zhang
f3b37b253b
feat: add events logger ( #170 )
...
* add common events
* log events in serve
* formatting
* extract serde_conv
* update
* update
* fix logging
* update
2023-05-30 15:44:29 -07:00
Meng Zhang
d8cee4adac
chore: extract tabby-common ( #169 )
...
* chore: extract tabby-common
* simplify
2023-05-29 23:39:02 -07:00
Meng Zhang
48796ecd77
feat: add `tabby download` command ( #157 )
...
* simplify fmt-display
* cleanup
* move tabby-admin to reduce nest
* add model downloader
* get rid of model-type
* improve commands
* fix fmt
2023-05-28 14:36:11 -07:00
Meng Zhang
80d1dd1ca6
Add admin panel (behand --experimental-admin-panel) ( #156 )
...
* add admin
* add feature:admin_panel
* update
* add dist directory
* update
2023-05-28 01:17:32 -07:00
Meng Zhang
734957d1de
build mac binary in ci ( #152 )
...
* build mac binary in ci
* update
* update
* turn on verbose
* fix cross compile
* update
* update
* update
* update
* update
* update
* update
2023-05-27 14:31:27 -07:00