Blog
Latest updates and announcements from the MCPMark team.
Published on
NEWSMCP benchmark Leaderboard 2025-09-10 Update
Explore the latest MCPMark leaderboard update featuring top MCP benchmark models like Qwen-3-Max, Grok-Code-Fast-1, and Kimi-K2-0905. Discover their tool-use capabilities, success rates, and cost efficiency for real-world MCP applications.
Published on
NEWSIntroducing MCPMark: a comprehensive and challenging MCP Benchmark
Introducing MCPMark, a comprehensive MCP benchmark to stress-test AI models on MCP tasks. Featuring 127 expert-crafted samples, diverse environments like Notion, Github, and Postgres. Explore detailed leaderboards, cost analysis, and rigorous task design for real-world AI evaluation.
Published on
NEWS