To determine the mapping between the GPU card type and its compute capability, please visit this page
Tabby supports replicating models on multiple GPUs to increase throughput. You can specify the devices for model replication by using the --device-indices option.