Ranking de modelos de Inteligência geral

Veja quais modelos de IA vão melhor em Inteligência geral, quais permanecem confiáveis e onde aparecem as maiores diferenças. Ordenar por: Métrica ↑.

Modelos exibidos

Média de Pontuação de Inteligência geral

6.1

Melhor modelo

Qwen3.5-35B-A3B 2.8

Motivos de falha

Com motivo de falha Não seguiu as instruções78 Com motivo de falha Resposta incorreta59 Com motivo de falha Erro de API12 Com motivo de falha Tempo esgotado4

210/210

Posição	Modelo	Empresa	Pontuação de Inteligência geral	Pontuação	Custo total	Testes corretos	Tempo de resposta (médio)
#194	GLM 4.7 Flash medium	Z.ai	3.6	4.3	$0.166	0/1	18.1s
Total de testes 1 Testes errados 1 Custo total $0.166 Tempo de resposta (médio) 18.1s
#51	Nemotron 3 Ultra medium	NVIDIA	3.7	7.5	$0.774	0/1	2.52s
Total de testes 1 Testes errados 1 Custo total $0.774 Tempo de resposta (médio) 2.52s
#21	GPT-5.2 medium	OpenAI	3.7	8.4	$0.951	0/1	4.32s
Total de testes 1 Testes errados 1 Custo total $0.951 Tempo de resposta (médio) 4.32s
#180	GPT-5.4 Nano none	OpenAI	3.8	4.8	$0.041	0/1	1.31s
Total de testes 1 Testes errados 1 Custo total $0.041 Tempo de resposta (médio) 1.31s
#190	MiniMax M2.5 medium	Minimax	3.8	4.6	$0.340	0/1	6.63s
Total de testes 1 Testes errados 1 Custo total $0.340 Tempo de resposta (médio) 6.63s
#171	North Mini Code none	Cohere	3.9	5.1	$0.000	0/1	34.8s
Total de testes 1 Testes errados 1 Custo total $0.000 Tempo de resposta (médio) 34.8s
#172	MiniMax M2.7 medium	Minimax	3.9	5.0	$0.163	0/1	38.7s
Total de testes 1 Testes errados 1 Custo total $0.163 Tempo de resposta (médio) 38.7s
#75	Grok 4.20 medium	X AI	3.9	7.1	$0.777	0/1	24.5s
Total de testes 1 Testes errados 1 Custo total $0.777 Tempo de resposta (médio) 24.5s
#29	Step 3.7 Flash medium	Stepfun	4.0	8.0	$0.515	0/1	6.85s
Total de testes 1 Testes errados 1 Custo total $0.515 Tempo de resposta (médio) 6.85s
#104	Gemini 3.1 Flash Lite Preview low	Google	4.0	6.5	$0.646	0/1	1.54s
Total de testes 1 Testes errados 1 Custo total $0.646 Tempo de resposta (médio) 1.54s
#105	Gemini 3.1 Flash Lite low	Google	4.0	6.5	$0.621	0/1	1.37s
Total de testes 1 Testes errados 1 Custo total $0.621 Tempo de resposta (médio) 1.37s
#106	Gemini 3.1 Flash Lite Preview none	Google	4.0	6.4	$0.052	0/1	741ms
Total de testes 1 Testes errados 1 Custo total $0.052 Tempo de resposta (médio) 741ms
#113	MiMo-V2-Flash medium	Xiaomi	4.0	6.3	$0.043	0/1	4.20s
Total de testes 1 Testes errados 1 Custo total $0.043 Tempo de resposta (médio) 4.20s
#120	Gemini 3.1 Flash Lite minimal	Google	4.0	6.1	$0.047	0/1	791ms
Total de testes 1 Testes errados 1 Custo total $0.047 Tempo de resposta (médio) 791ms
#122	Gemini 3.1 Flash Lite none	Google	4.0	6.1	$0.046	0/1	992ms
Total de testes 1 Testes errados 1 Custo total $0.046 Tempo de resposta (médio) 992ms

Ranking de Inteligência geral

Filtrar modelos

Melhores modelos por Pontuação de Inteligência geral

Pontuação de Inteligência geral vs custo total

Melhores modelos por Tempo de resposta (médio)