Switch transcription comparison to Voxtral
This commit is contained in:
@@ -103,15 +103,15 @@ services:
|
||||
- "--port"
|
||||
- "8000"
|
||||
- "--max-model-len"
|
||||
- "32768"
|
||||
- "16384"
|
||||
- "--gpu-memory-utilization"
|
||||
- "0.62"
|
||||
- "0.55"
|
||||
- "--api-key"
|
||||
- "${VLLM_API_KEY}"
|
||||
- "--max-num-seqs"
|
||||
- "2"
|
||||
- "1"
|
||||
- "--max-num-batched-tokens"
|
||||
- "8192"
|
||||
- "4096"
|
||||
healthcheck:
|
||||
test: ["CMD", "python3", "-c", "import urllib.request; urllib.request.urlopen('http://localhost:8000/health')"]
|
||||
interval: 30s
|
||||
|
||||
Reference in New Issue
Block a user