Commit Graph

4 Commits (b2a92f1cf77a93ac3747550ad56b79ab8ca8ed51)

Author SHA1 Message Date
Meng Zhang 8d1303d6e4
fix: properly recycle request id (#920) 2023-11-30 17:01:52 +08:00
Meng Zhang 2b131ad1d2
refactor: handle max output length in StopCondition (#910)
* refactor: handle max output length in StopCondition

* trim stop words

* [autofix.ci] apply automated fixes

---------

Co-authored-by: autofix-ci[bot] <114827586+autofix-ci[bot]@users.noreply.github.com>
2023-11-28 16:57:16 +08:00
Meng Zhang 8c669dee8e
fix: llama.cpp queuing logic (#741) 2023-11-09 08:29:54 +00:00
Meng Zhang 1ad0d39903
fix: deadlock between background job and requests (#720)
* fix: deadlock between background job and requests

* refactor: extract LlamaService
2023-11-07 13:11:28 -08:00