Model rankings

Production-ready model rankings.

Compare hosted models by scenario, speed, context, pricing, and availability. Start with a shortlist, then open the registry to test or request a route.

Open model registryCatalog ready

Overall model ranking

#1 scx/llama-3.3-70bMeta · Global apps and multilingual assistant routes

88SCX score

Catalog ready

Live and preview models are grouped so teams can find a usable route faster.

Compare in one table

Context, speed, pricing, status, and best-fit scenarios stay side by side.

Route confidence

Use score bands and dimension leaders to shortlist models before evaluation.

Model intelligence

Use these charts to see leading routes, coverage balance, and the tradeoffs between cost, latency, and context length.

Top production routesscore

01scx/llama-3.3-70b

02scx/qwen2.5-72b-instruct

03scx/qwen3-32b

04scx/glm-4-32b

05scx/qwen2.5-coder-32b

06scx/deepseek-r1-32b

07scx/deepseek-v3

08scx/llama-3.1-8b-instruct

Score distribution25 models

090-100

1385-89

680-84

670-79

Provider coverage7

Qwen

DeepSeek

Mistral

BAAI

Google

Overall model ranking

A practical starting point for choosing model aliases across common production workloads.

Rank	Model	Provider	Best for	Context	Speed	Price	Status	Score
01	ME scx/llama-3.3-70bMultilingual	Meta	Global apps and multilingual assistant routes	128K	Fast	0.88 Credit / 1K output tokens	Live	88
02	QW scx/qwen2.5-72b-instructGeneral chat	Qwen	Copilots, workflows, customer support	128K	Fast	0.95 Credit / 1K output tokens	Live	88
03	QW scx/qwen3-32bReasoning	Qwen	Fast assistants, routing fallback, live UX	128K	Fast	0.78 Credit / 1K output tokens	Live	88
04	ZH scx/glm-4-32bGeneral chat	Zhipu	Copilots, workflows, customer support	128K	Fast	0.74 Credit / 1K output tokens	Live	87
05	QW scx/qwen2.5-coder-32bCoding	Qwen	Code generation, review, migration plans	128K	Fast	0.82 Credit / 1K output tokens	Live	87
06	DE scx/deepseek-r1-32bReasoning	DeepSeek	Agent planning, tool use, structured reasoning	128K	Balanced	1.45 Credit / 1K output tokens	Live	86
07	DE scx/deepseek-v3General chat	DeepSeek	Copilots, workflows, customer support	128K	Fast	0.82 Credit / 1K output tokens	Live	86
08	ME scx/llama-3.1-8b-instructGeneral chat	Meta	Low-cost fallback routes and high-volume traffic	128K	Very fast	0.22 Credit / 1K output tokens	Live	86
09	GO scx/gemma-3-27b-itVision	Google	Multimodal apps, image understanding, assistants	128K	Fast	0.58 Credit / 1K output tokens	Live	85
10	IN scx/internlm2.5-20b-chatGeneral chat	InternLM	Fast assistants, routing fallback, live UX	32K	Very fast	0.36 Credit / 1K output tokens	Live	85
11	MI scx/mixtral-8x22bGeneral chat	Mistral	Copilots, workflows, customer support	64K	Fast	0.78 Credit / 1K output tokens	Live	85
12	QW scx/qwen2.5-vl-72bVision	Qwen	Document vision, screenshots, visual reasoning	128K	Balanced	1.05 Credit / 1K output tokens	Live	85
13	01 scx/yi-largeMultilingual	01.AI	Global apps and multilingual assistant routes	200K	Balanced	0.92 Credit / 1K output tokens	Live	85
14	OP scx/internvl3-38bVision	OpenGVLab	Document vision, screenshots, visual reasoning	128K	Balanced	0.96 Credit / 1K output tokens	Live	84
15	MI scx/mistral-large-2Coding	Mistral	Code generation, review, migration plans	128K	Balanced	1.10 Credit / 1K output tokens	Live	83
16	DE scx/deepseek-r1-70bReasoning	DeepSeek	Hard reasoning, math, multi-step analysis	128K	Deep	1.95 Credit / 1K output tokens	Preview	82
17	GO scx/gemma-2-9b-itGeneral chat	Google	Low-cost fallback routes and high-volume traffic	8K	Very fast	0.16 Credit / 1K output tokens	Live	81
18	MI scx/minimax-text-01Long context	MiniMax	Long documents, research packs, policy analysis	1M	Balanced	1.25 Credit / 1K output tokens	Preview	81
19	QW scx/qwen3-235b-a22bReasoning	Qwen	Hard reasoning, math, multi-step analysis	128K	Deep	2.20 Credit / 1K output tokens	Preview	81
20	JI scx/jina-embeddings-v3Embeddings	Jina AI	Memory layers, RAG pipelines, long retrieval	8K	Instant	0.10 Credit / 1K input tokens	Live	79
21	BA scx/bge-m3Embeddings	BAAI	Semantic search, retrieval, knowledge bases	8K	Instant	0.12 Credit / 1K input tokens	Live	78
22	MI scx/e5-mistral-7b-instructEmbeddings	Mistral	Memory layers, RAG pipelines, long retrieval	32K	Very fast	0.18 Credit / 1K input tokens	Live	78
23	BA scx/bge-reranker-v2-m3Rerank	BAAI	Search quality, retrieval reranking, answer grounding	8K	Very fast	0.18 Credit / 1K input tokens	Live	76
24	WH scx/whisper-large-v3Audio	Whisper	Speech transcription and meeting intelligence	30 min	Balanced	0.20 Credit / minute	Preview	73
25	ST scx/stable-diffusion-xlImage	Stability	Image generation, creative workflows, drafts	Image	Balanced	0.65 Credit / image	Preview	71