Metrics

AI LLM (Large Language Model) benchmark metrics platforms 大语言模型各种任务评价指标 评价平台 评价标准 语文写作,编程,数学 等等

2025-04-19. Category & Tags: AIGC, LLM, Large Language Model, 大语言模型, 语言模型, Benchmark, Metrics

See also the main item: /llm. Price, Sizes # Unit: /M tokens Prices are from OpenRouter.ai & SiliconFlow.cn. Model Context Input Output Model Size qwen-2.5-coder-32b-instruct 33K $0.07/¥ 1.26 $0.15 32B qwen/qwen-2.5-72b-instruct 33K $0.12 $0.39 72B Qwen3 235B A22B 41K $0.2 $0.6 235B A22B DeepSeek V3 0324 64K $0.27/¥ 8 $1.1 685B DeepSeek V3 164K $0.38 $0.89 671B DeepSeek R1 164K $0.5/¥ 16 $2.18 671B Gemini 2.5 Pro Prev. 1M $1.25 $10 ? ...

AI LLM (Large Language Model) benchmark metrics platforms 大语言模型各种任务评价指标 评价平台 评价标准 语文写作,编程,数学 等等

2025-04-19. Category & Tags: AIGC, LLM, Large Language Model, 大语言模型, 语言模型, Benchmark, Metrics

See also the main item: /llm. Price, Sizes # Unit: /M tokens Prices are from OpenRouter.ai & SiliconFlow.cn. Model Context Input Output Model Size qwen-2.5-coder-32b-instruct 33K $0.07/¥ 1.26 $0.15 32B qwen/qwen-2.5-72b-instruct 33K $0.12 $0.39 72B Qwen3 235B A22B 41K $0.2 $0.6 235B A22B DeepSeek V3 0324 64K $0.27/¥ 8 $1.1 685B DeepSeek V3 164K $0.38 $0.89 671B DeepSeek R1 164K $0.5/¥ 16 $2.18 671B Gemini 2.5 Pro Prev. 1M $1.25 $10 ? ...