* feat: implement input truncation for llama-cpp-bindings * set max input length to 1024 * fix: batching tokens with n_batches * fix batching |
||
|---|---|---|
| .. | ||
| engine.h | ||
* feat: implement input truncation for llama-cpp-bindings * set max input length to 1024 * fix: batching tokens with n_batches * fix batching |
||
|---|---|---|
| .. | ||
| engine.h | ||