Document vLLM audio URL payloads
This commit is contained in:
@@ -59,8 +59,8 @@ Recommended comparison order:
|
|||||||
3. `voxtral-small` (`mistralai/Voxtral-Small-24B-2507`)
|
3. `voxtral-small` (`mistralai/Voxtral-Small-24B-2507`)
|
||||||
|
|
||||||
Qwen2-Audio and Voxtral are called through an OpenAI-compatible
|
Qwen2-Audio and Voxtral are called through an OpenAI-compatible
|
||||||
`/v1/chat/completions` endpoint with `input_audio`; set their endpoint URLs only
|
`/v1/chat/completions` endpoint with vLLM-style `audio_url` data URLs; set
|
||||||
after the models are actually exposed on the AI server.
|
their endpoint URLs only after the models are actually exposed on the AI server.
|
||||||
|
|
||||||
AI-server compose snippets for these temporary comparison endpoints live in
|
AI-server compose snippets for these temporary comparison endpoints live in
|
||||||
`deploy/ai-server/docker-compose.audio.yml`. They are profile-gated because the
|
`deploy/ai-server/docker-compose.audio.yml`. They are profile-gated because the
|
||||||
|
|||||||
Reference in New Issue
Block a user