Model details
View repositoryExample usage
Qwen 3 ASR 1.7B is a SOTA transcription model developed by Alibaba.
It supports the following languages: Chinese (zh), English (en), Cantonese (yue), Arabic (ar), German (de), French (fr), Spanish (es), Portuguese (pt), Indonesian (id), Italian (it), Korean (ko), Russian (ru), Thai (th), Vietnamese (vi), Japanese (ja), Turkish (tr), Hindi (hi), Malay (ms), Dutch (nl), Swedish (sv), Danish (da), Finnish (fi), Polish (pl), Czech (cs), Filipino (fil), Persian (fa), Greek (el), Hungarian (hu), Macedonian (mk), Romanian (ro)
Input
1from openai import OpenAI
2
3model_id = "" # place model ID here
4
5client = OpenAI(
6 api_key="BASETEN-API-KEY",
7 base_url=f"https://model-{model_id}.api.baseten.co/environments/production/sync/v1"
8)
9
10response = client.chat.completions.create(
11 model="Qwen/Qwen3-ASR-1.7B",
12 stream=False,
13 messages=[
14 {
15 "role": "user",
16 "content": [
17 {
18 "type": "audio_url",
19 "audio_url":
20 {"url": "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-ASR-Repo/asr_en.wav"}
21
22 }
23 ]
24 }
25 ],
26)
27
28print(response.choices[0].message.content)JSON output
1null