"Inference Engineering" is now available. Get your copy here
transcription

Qwen LogoQwen 3 ASR 1.7B

SOTA ASR model that is part of the Qwen family

Model details

View repository

Example usage

Qwen 3 ASR 1.7B is a SOTA transcription model developed by Alibaba.

It supports the following languages:
Chinese (zh), English (en), Cantonese (yue), Arabic (ar), German (de), French (fr), Spanish (es), Portuguese (pt), Indonesian (id), Italian (it), Korean (ko), Russian (ru), Thai (th), Vietnamese (vi), Japanese (ja), Turkish (tr), Hindi (hi), Malay (ms), Dutch (nl), Swedish (sv), Danish (da), Finnish (fi), Polish (pl), Czech (cs), Filipino (fil), Persian (fa), Greek (el), Hungarian (hu), Macedonian (mk), Romanian (ro)

Input
1from openai import OpenAI
2
3model_id = ""  # place model ID here
4
5client = OpenAI(
6    api_key="BASETEN-API-KEY",
7    base_url=f"https://model-{model_id}.api.baseten.co/environments/production/sync/v1"
8)
9
10response = client.chat.completions.create(
11    model="Qwen/Qwen3-ASR-1.7B",
12    stream=False,
13    messages=[
14        {
15            "role": "user",
16            "content": [
17                {
18                    "type": "audio_url",
19                    "audio_url":
20                        {"url": "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-ASR-Repo/asr_en.wav"}
21
22                }
23            ]
24        }
25    ],
26)
27
28print(response.choices[0].message.content)
JSON output
1null

🔥 Trending models