🧑‍🔬 Models Directory

Completion models (For `--model`)

We recommend using

Chat models (For `--chat-model`)

To ensure optimal response quality, and given that latency requirements are not stringent in this scenario, we recommend using a model with at least 3B parameters.

🧑‍🔬 Models Directory

Completion models (For --model)

Chat models (For --chat-model)

Completion models (For `--model`)

Chat models (For `--chat-model`)