tabby/crates/llama-cpp-bindings
Meng Zhang ad3b974d5c
feat: implement input truncation for llama-cpp-bindings (#416)
* feat: implement input truncation for llama-cpp-bindings

* set max input length to 1024

* fix: batching tokens with n_batches

* fix batching
2023-09-09 00:20:51 +08:00
..
include feat: implement input truncation for llama-cpp-bindings (#416) 2023-09-09 00:20:51 +08:00
llama.cpp@06fc4020de feat: tune llama metal backend performance (#393) 2023-09-05 10:14:29 +08:00
src feat: implement input truncation for llama-cpp-bindings (#416) 2023-09-09 00:20:51 +08:00
Cargo.toml feat: support cancellation in llama backend 2023-09-03 09:59:40 +08:00
build.rs feat: add cargo test to github actions and run only unit tests in ci [TAB-185] (#390) 2023-09-03 05:04:52 +00:00