Home / Skills / tools / ct-grade

ct-grade

star 126 account_tree 18 verified_user MIT License

Overview Implementation Examples History

CLEO session grading and A/B behavioral analysis with token tracking. Evaluates agent session quality via a 5-dimension rubric (S1 session discipline, S2 discovery efficiency, S3 task hygiene, S4 error protocol, S5 progressive disclosure). Supports three modes: (1) scenario — run playbook scenarios S1-S5 against MCP or CLI; (2) ab — blind A/B comparison of CLEO MCP gateway vs CLI for same domain operations with token cost measurement; (3) blind — spawn two agents with different configurations, blind-comparator picks winner, analyzer produces recommendation. Use when grading agent sessions, running grade playbook scenarios, comparing MCP vs CLI behavioral differences, measuring token usage across interface types, or performing multi-run blind A/B evaluation with statistical analysis and comparative report. Triggers on: grade session, evaluate agent behavior, A/B test CLEO interfaces, run grade scenario, token usage analysis, behavioral rubric, protocol compliance scoring, MCP vs CLI comparison.

Key Features

Comprehensive skill evaluation and performance tracking
Community-driven ratings and reviews
Easy integration with Claude Code
Regular updates and maintenance

Quick Start

TopRank Skills install kryptobaseddev/ct-grade-v2-1

chat Comments (0)

chat_bubble_outline

No comments yet. Be the first to share your thoughts!

Skill Details

GitHub Stars 126

GitHub Forks 18

Created Mar 2026

Last Updated 3个月前

tools tools debugging

Related Skills

fabric

danielmiessler

star 9.7k

chevron_right

typescript-expert

vudovn

star 4.2k

chevron_right

break-loop

mindfold-ai

star 3.3k

chevron_right

burp-suite

trailofbits

star 2.4k

chevron_right

page-behavior-audit

openclaw

star 2.4k

chevron_right

Build your own?

Join 12,000+ developers contributing to the Claude ecosystem.

Sign in to Comment

ct-grade

Key Features

Quick Start

chat Comments (0)

Skill Details

Related Skills

fabric

typescript-expert

break-loop

burp-suite

page-behavior-audit

Build your own?