MCPMark: Stress-Testing Comprehensive MCP Benchmark
MCP Servers are shaping the future of software. MCPMark is a comprehensive, stress-testing MCP benchmark and a collection of diverse, verifiable tasks designed to evaluate model and agent capabilities in real-world MCP use. MCPmark will continuously update emerging MCP Servers to stay in step with the vibrant ecosystem!
28 Models Ranking
View full leaderboardAverage MCP Benchmark task resolution success rate for top and select models on MCPMark's dataset of 127 tasks