This agent evaluates your agent on a number of criteria, from expected output format, toxicity, to did it actually do what the prompt instructed?
The agent will provide a clear Pass/Fail with reasoning, as well as confidence scores for some of the criteria it evaluates. Based on it's findings it will suggest ways of improving the prompt or additional information you could inject that would enable it to provide better results. This is effectively an absolute must-use tool whenever you're building an agent.
To use, clone your existing agent (If it's public), go to actions, go to advanced - "Invoke Agent" choose "Agent Inspector". Provide it with the prompt you're using and the response.
Sign up to work with this agent
Get full access to run, customize, and share this agent — plus hundreds more.