large language

Qwen LogoQwen 2.5 72B Math Instruct

The largest model in the Qwen family of LLMs, tuned for math

Model details

View repository

Example usage

Qwen 2.5 72B Math is an OpenAI-compatible model and can be called using the OpenAI SDK in any language.

Input
1from openai import OpenAI
2import os
3
4model_url = "" # Copy in from API pane in Baseten model dashboard
5
6client = OpenAI(
7    api_key=os.environ['BASETEN_API_KEY'],
8    base_url=model_url
9)
10
11# Chat completion
12response_chat = client.chat.completions.create(
13    model="",
14    messages=[
15        {"role": "user", "content": "Tell me a fun fact about cats."}
16    ],
17    temperature=0.3,
18    max_tokens=100,
19)
20print(response_chat)
JSON output
1{
2    "id": "143",
3    "choices": [
4        {
5            "finish_reason": "stop",
6            "index": 0,
7            "logprobs": null,
8            "message": {
9                "content": "[Model output here]",
10                "role": "assistant",
11                "audio": null,
12                "function_call": null,
13                "tool_calls": null
14            }
15        }
16    ],
17    "created": 1741224586,
18    "model": "",
19    "object": "chat.completion",
20    "service_tier": null,
21    "system_fingerprint": null,
22    "usage": {
23        "completion_tokens": 145,
24        "prompt_tokens": 38,
25        "total_tokens": 183,
26        "completion_tokens_details": null,
27        "prompt_tokens_details": null
28    }
29}

large language models

See all
Meta logo
Model API
LLM

Llama 4 Maverick

V4.0 - Instruct - vLLM - B200
DeepSeek Logo
Model API
LLM

DeepSeek-V3

V3 - SGLang - B200
DeepSeek Logo
Model API
LLM

DeepSeek-R1

R1 - SGLang - B200

Qwen models

See all
Qwen Logo
LLM

Qwen 3 235B

V3 - SGLang - H100
Qwen Logo
LLM

Qwen 3 4B

V3 - TRT-LLM - H100
Qwen Logo
LLM

Qwen 3 32B

V3 - TRT-LLM - H100

🔥 Trending models