Tune audio model GPU profiles
This commit is contained in:
@@ -36,7 +36,7 @@ services:
|
||||
- "--max-model-len"
|
||||
- "8192"
|
||||
- "--gpu-memory-utilization"
|
||||
- "0.45"
|
||||
- "0.25"
|
||||
- "--api-key"
|
||||
- "${VLLM_API_KEY}"
|
||||
- "--max-num-seqs"
|
||||
@@ -95,7 +95,7 @@ services:
|
||||
- "--max-model-len"
|
||||
- "32768"
|
||||
- "--gpu-memory-utilization"
|
||||
- "0.72"
|
||||
- "0.62"
|
||||
- "--api-key"
|
||||
- "${VLLM_API_KEY}"
|
||||
- "--max-num-seqs"
|
||||
|
||||
Reference in New Issue
Block a user