Loading...
Discovering the best AI tools for you
Agent benchmark for tool-augmented LLMs
Task Count
200
Scoring Method
Task completion rate
Released
2024-10-01