desktop-computer-automation | Skill Performance & Reviews | TopRankSkills

TopRank Skills

Home / Skills / tools / desktop-computer-automation

desktop-computer-automation

maintained by web-infra-dev

star 90 account_tree 5 verified_user MIT License
bolt View GitHub

Vision-driven desktop automation using Midscene. Control your desktop (macOS, Windows, Linux) with natural language commands. Operates entirely from screenshots — no DOM or accessibility labels required. Can interact with all visible elements on screen regardless of technology stack. ⚠️ WARNING: This skill takes over the user's real mouse and keyboard. The user cannot use their computer while automation is running. → For web apps, prefer the "Browser Automation" skill instead — it runs in a headless browser and does NOT interfere with the user's mouse/keyboard. → Only use this skill for desktop-native applications (Electron, Qt, native macOS/Windows/Linux apps) that cannot be tested in a browser. Triggers: open app, press key, desktop, computer, click on screen, type text, screenshot desktop, launch application, switch window, desktop automation, control computer, mouse click, keyboard shortcut, screen capture, find on screen, read screen, verify window, close app, minimize window, maximize window, test des

Key Features

  • Comprehensive skill evaluation and performance tracking
  • Community-driven ratings and reviews
  • Easy integration with Claude Code
  • Regular updates and maintenance

Quick Start

TopRank Skills install web-infra-dev/computer-automation

chat Comments (0)

chat_bubble_outline

No comments yet. Be the first to share your thoughts!

Skill Details

GitHub Stars 90
GitHub Forks 5
Created Mar 2026
Last Updated 3个月前
tools tools system admin

Related Skills

docker-expert
chevron_right
telnyx-network
chevron_right
plex

plex

openclaw
star 2.4k
chevron_right
discord-governance
chevron_right
hetzner-provisioner
chevron_right

Build your own?

Join 12,000+ developers contributing to the Claude ecosystem.