Catalog ready
Live and preview models are grouped so teams can find a usable route faster.Model rankings
Production-ready model rankings.
Compare hosted models by scenario, speed, context, pricing, and availability. Start with a shortlist, then open the registry to test or request a route.
Open model registryCatalog ready
Overall model ranking
#1 scx/llama-3.3-70bMeta · Global apps and multilingual assistant routes88SCX score
Compare in one table
Context, speed, pricing, status, and best-fit scenarios stay side by side.Route confidence
Use score bands and dimension leaders to shortlist models before evaluation.Model intelligence
Model intelligence
Use these charts to see leading routes, coverage balance, and the tradeoffs between cost, latency, and context length.
Top production routesscore
Score distribution25 models
090-100
1385-89
680-84
670-79
Provider coverage7
Capability coverage25 models
Context envelope25 models
8K4
32K2
64K1
128K14
200K1
1M1
Audio1
Image1
Cost and speed shortlistLow cost / Fast response
Low costscore
01scx/llama-3.1-8b-instruct0.22 Credit / 1K output tokens93
02scx/jina-embeddings-v30.10 Credit / 1K input tokens92
03scx/bge-m30.12 Credit / 1K input tokens91
04scx/e5-mistral-7b-instruct0.18 Credit / 1K input tokens91
05scx/bge-reranker-v2-m30.18 Credit / 1K input tokens89
Fast responsescore
01scx/bge-m3Instant97
02scx/jina-embeddings-v3Instant97
03scx/bge-reranker-v2-m3Very fast93
04scx/e5-mistral-7b-instructVery fast93
05scx/gemma-2-9b-itVery fast93
OverallBest all-around aliases for production trials.ReasoningMulti-step reasoning, agent plans, and tool use.CodeCode generation, review, migration, and copilots.VisionScreenshots, documents, images, and multimodal apps.RetrievalEmbeddings, rerank, RAG, and search quality.ValueLower Credit cost with usable production posture.Low latencyFast response routes for real-time UX.
Ranking view
Overall model ranking
A practical starting point for choosing model aliases across common production workloads.| Rank | Model | Provider | Best for | Context | Speed | Price | Status | Score |
|---|---|---|---|---|---|---|---|---|
| 01 | ME scx/llama-3.3-70bMultilingual | Meta | Global apps and multilingual assistant routes | 128K | Fast | 0.88 Credit / 1K output tokens | Live | 88 |
| 02 | QW scx/qwen2.5-72b-instructGeneral chat | Qwen | Copilots, workflows, customer support | 128K | Fast | 0.95 Credit / 1K output tokens | Live | 88 |
| 03 | QW scx/qwen3-32bReasoning | Qwen | Fast assistants, routing fallback, live UX | 128K | Fast | 0.78 Credit / 1K output tokens | Live | 88 |
| 04 | ZH scx/glm-4-32bGeneral chat | Zhipu | Copilots, workflows, customer support | 128K | Fast | 0.74 Credit / 1K output tokens | Live | 87 |
| 05 | QW scx/qwen2.5-coder-32bCoding | Qwen | Code generation, review, migration plans | 128K | Fast | 0.82 Credit / 1K output tokens | Live | 87 |
| 06 | DE scx/deepseek-r1-32bReasoning | DeepSeek | Agent planning, tool use, structured reasoning | 128K | Balanced | 1.45 Credit / 1K output tokens | Live | 86 |
| 07 | DE scx/deepseek-v3General chat | DeepSeek | Copilots, workflows, customer support | 128K | Fast | 0.82 Credit / 1K output tokens | Live | 86 |
| 08 | ME scx/llama-3.1-8b-instructGeneral chat | Meta | Low-cost fallback routes and high-volume traffic | 128K | Very fast | 0.22 Credit / 1K output tokens | Live | 86 |
| 09 | GO scx/gemma-3-27b-itVision | Multimodal apps, image understanding, assistants | 128K | Fast | 0.58 Credit / 1K output tokens | Live | 85 | |
| 10 | IN scx/internlm2.5-20b-chatGeneral chat | InternLM | Fast assistants, routing fallback, live UX | 32K | Very fast | 0.36 Credit / 1K output tokens | Live | 85 |
| 11 | MI scx/mixtral-8x22bGeneral chat | Mistral | Copilots, workflows, customer support | 64K | Fast | 0.78 Credit / 1K output tokens | Live | 85 |
| 12 | QW scx/qwen2.5-vl-72bVision | Qwen | Document vision, screenshots, visual reasoning | 128K | Balanced | 1.05 Credit / 1K output tokens | Live | 85 |
| 13 | 01 scx/yi-largeMultilingual | 01.AI | Global apps and multilingual assistant routes | 200K | Balanced | 0.92 Credit / 1K output tokens | Live | 85 |
| 14 | OP scx/internvl3-38bVision | OpenGVLab | Document vision, screenshots, visual reasoning | 128K | Balanced | 0.96 Credit / 1K output tokens | Live | 84 |
| 15 | MI scx/mistral-large-2Coding | Mistral | Code generation, review, migration plans | 128K | Balanced | 1.10 Credit / 1K output tokens | Live | 83 |
| 16 | DE scx/deepseek-r1-70bReasoning | DeepSeek | Hard reasoning, math, multi-step analysis | 128K | Deep | 1.95 Credit / 1K output tokens | Preview | 82 |
| 17 | GO scx/gemma-2-9b-itGeneral chat | Low-cost fallback routes and high-volume traffic | 8K | Very fast | 0.16 Credit / 1K output tokens | Live | 81 | |
| 18 | MI scx/minimax-text-01Long context | MiniMax | Long documents, research packs, policy analysis | 1M | Balanced | 1.25 Credit / 1K output tokens | Preview | 81 |
| 19 | QW scx/qwen3-235b-a22bReasoning | Qwen | Hard reasoning, math, multi-step analysis | 128K | Deep | 2.20 Credit / 1K output tokens | Preview | 81 |
| 20 | JI scx/jina-embeddings-v3Embeddings | Jina AI | Memory layers, RAG pipelines, long retrieval | 8K | Instant | 0.10 Credit / 1K input tokens | Live | 79 |
| 21 | BA scx/bge-m3Embeddings | BAAI | Semantic search, retrieval, knowledge bases | 8K | Instant | 0.12 Credit / 1K input tokens | Live | 78 |
| 22 | MI scx/e5-mistral-7b-instructEmbeddings | Mistral | Memory layers, RAG pipelines, long retrieval | 32K | Very fast | 0.18 Credit / 1K input tokens | Live | 78 |
| 23 | BA scx/bge-reranker-v2-m3Rerank | BAAI | Search quality, retrieval reranking, answer grounding | 8K | Very fast | 0.18 Credit / 1K input tokens | Live | 76 |
| 24 | WH scx/whisper-large-v3Audio | Whisper | Speech transcription and meeting intelligence | 30 min | Balanced | 0.20 Credit / minute | Preview | 73 |
| 25 | ST scx/stable-diffusion-xlImage | Stability | Image generation, creative workflows, drafts | Image | Balanced | 0.65 Credit / image | Preview | 71 |
