agentic-eval | Skill Performance & Reviews | TopRankSkills

TopRank Skills

Home / Skills / testing security / agentic-eval

agentic-eval

maintained by github

star 23.7k account_tree 2.7k verified_user MIT License
bolt View GitHub

Patterns and techniques for evaluating and improving AI agent outputs. Use this skill when: - Implementing self-critique and reflection loops - Building evaluator-optimizer pipelines for quality-critical generation - Creating test-driven code refinement workflows - Designing rubric-based or LLM-as-judge evaluation systems - Adding iterative improvement to agent outputs (code, reports, analysis) - Measuring and improving agent response quality

Key Features

  • Comprehensive skill evaluation and performance tracking
  • Community-driven ratings and reviews
  • Easy integration with Claude Code
  • Regular updates and maintenance

Quick Start

TopRank Skills install github/agentic-eval

chat Comments (0)

chat_bubble_outline

No comments yet. Be the first to share your thoughts!

Skill Details

GitHub Stars 23.7k
GitHub Forks 2.7k
Created Jan 2026
Last Updated 2 months ago
testing security testing security llm ai

Related Skills

ai-sdk

ai-sdk

vercel
star 22.3k
chevron_right
context-engineering-collection
chevron_right
humanizer
chevron_right
notebooklm
chevron_right
feature-dev
chevron_right

Build your own?

Join 12,000+ developers contributing to the Claude ecosystem.