Home / Skills / testing security / agentic-eval

agentic-eval

maintained by github

star 23.7k account_tree 2.7k verified_user MIT License

Overview Implementation Examples History

Patterns and techniques for evaluating and improving AI agent outputs. Use this skill when: - Implementing self-critique and reflection loops - Building evaluator-optimizer pipelines for quality-critical generation - Creating test-driven code refinement workflows - Designing rubric-based or LLM-as-judge evaluation systems - Adding iterative improvement to agent outputs (code, reports, analysis) - Measuring and improving agent response quality

Key Features

Comprehensive skill evaluation and performance tracking
Community-driven ratings and reviews
Easy integration with Claude Code
Regular updates and maintenance

Quick Start

TopRank Skills install github/agentic-eval

chat Comments (0)

chat_bubble_outline

No comments yet. Be the first to share your thoughts!

Skill Details

GitHub Stars 23.7k

GitHub Forks 2.7k

Created Jan 2026

Last Updated 4 months ago

testing security testing security llm ai

Related Skills

ai-sdk

vercel

star 22.3k

chevron_right

context-engineering-collection

muratcankoylan

star 8.3k

chevron_right

humanizer

blader

star 4.2k

chevron_right

notebooklm

PleasePrompto

star 3.9k

chevron_right

feature-dev

cexll

star 2.4k

chevron_right

Build your own?

Join 12,000+ developers contributing to the Claude ecosystem.

Sign in to Comment

agentic-eval

Key Features

Quick Start

chat Comments (0)

Skill Details

Related Skills

ai-sdk

context-engineering-collection

humanizer

notebooklm

feature-dev

Build your own?