2023-03-27 04:09:07 +00:00
< div align = "center" >
2023-03-27 04:45:59 +00:00
2023-03-27 04:09:07 +00:00
# 🐾 Tabby
2023-03-16 09:28:20 +00:00
[](https://opensource.org/licenses/Apache-2.0)
2023-03-16 09:26:43 +00:00
[](https://github.com/psf/black)
2023-03-22 15:20:24 +00:00

2023-03-27 04:45:59 +00:00
2023-03-27 04:47:40 +00:00

2023-03-27 04:45:59 +00:00
2023-03-27 04:09:07 +00:00
< / div >
2023-03-16 09:26:43 +00:00
> **Warning**
2023-03-27 04:09:07 +00:00
> Tabby is still in the alpha phrase
An opensource / on-prem alternative to GitHub Copilot.
## Features
* Self-contained, with no need for a DBMS or cloud service
* Web UI for visualizing and configuration models and MLOps.
* OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE).
* Consumer level GPU supports (FP-16 weight loading with various optimization).
2023-03-16 10:23:45 +00:00
2023-03-27 04:54:37 +00:00
## Get started
The easiest way of getting started is using the `deployment/docker-compose.yml` :
```bash
docker-compose up
```
Note: To use GPUs, you need to install the [NVIDIA Container Toolkit ](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html ). We also recommend using NVIDIA drivers with CUDA version 11.8 or higher.
You can then query the server using `/v1/completions` endpoint:
```bash
curl -X POST http://localhost:5000/v1/completions -H 'Content-Type: application/json' --data '{
"prompt": "def binarySearch(arr, left, right, x):\n mid = (left +"
}'
```
We also provides an interactive playground in admin panel [localhost:8501 ](http://localhost:8501 )

2023-03-22 16:21:07 +00:00
2023-03-27 04:45:59 +00:00
## TODOs
2023-03-26 16:13:19 +00:00
2023-03-27 04:09:07 +00:00
* [ ] Fine-tuning models on private code repository.
* [ ] Plot metrics in admin panel (e.g acceptance rate).
* [ ] Production ready (Open Telemetry, Prometheus metrics).
2023-03-27 04:48:58 +00:00
* [ ] Token streaming using Server-Sent Events (SSE)