Published on
Introducing EVAL SYS
Evaluation Systems Organization
EVAL SYS is a living, open-source community to track and advance model agentic capabilities. We’ll be releasing benchmarks, datasets, toolchains, models to push the field forward. Initiated by LobeHub, we would love to collaborate with research labs, MCP servers, independent contributors, and more.
Join us, contribute, or reach out!