Meng Zhang
|
8d1303d6e4
|
fix: properly recycle request id (#920)
|
2023-11-30 17:01:52 +08:00 |
Meng Zhang
|
2b131ad1d2
|
refactor: handle max output length in StopCondition (#910)
* refactor: handle max output length in StopCondition
* trim stop words
* [autofix.ci] apply automated fixes
---------
Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
|
2023-11-28 16:57:16 +08:00 |
Meng Zhang
|
8c669dee8e
|
fix: llama.cpp queuing logic (#741)
|
2023-11-09 08:29:54 +00:00 |
Meng Zhang
|
1ad0d39903
|
fix: deadlock between background job and requests (#720)
* fix: deadlock between background job and requests
* refactor: extract LlamaService
|
2023-11-07 13:11:28 -08:00 |