Overview
- Skill Key
- gedigi/peekaboox
- Author
- gedigi
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/gedigi/peekaboox
- Latest Commit SHA
- f730d24d8582f102d964f4e72f7a0f8c2d523256
Control and automate the Linux desktop GUI on X11. Use this skill to take screenshots, find and click UI elements, type text, send keyboard shortcuts, scroll, manage windows (focus, minimize, maximize, close, move, resize), and use your own vision to locate elements on screen. Requires X11 (not Wayland). Use for desktop automation, GUI testing, remote desktop control, and any task requiring interaction with graphical applications.
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 linux-desktop 技能。 若已安装,则直接安装 linux-desktop 技能。
# Linux Desktop GUI Automation Automate any X11 Linux desktop: capture screens, find and click elements, type, use hotkeys, manage windows. Preferred screenshot interpretation path: capture with `capture.sh` and interpret the image directly in your OpenClaw chat (existing image-capable model connection). ## Prerequisites - X11 session running (XFCE, GNOME on X11, KDE on X11, i3, openbox, etc.) - `DISPLAY` environment variable set (usually `:0`) - Run `bash install.sh` once to install dependencies - No extra key needed for screenshot interpretation when using OpenClaw's image-capable chat path ## Quick Reference | Task | Command | |------|---------| | Take screenshot | `bash capture.sh` | | Screenshot of window | `bash capture.sh --window "Firefox"` | | List windows | `bash inspect.sh` | | Active window info | `bash inspect.sh --active` | | Find window by name | `bash inspect.sh --window "Firefox"` | | Click at coordinates | `bash click.sh --x 500 --y 300` | | Right-click | `bash click.sh --x 500 --y 300 --button right` | | Double-click | `bash click.sh --x 500 --y 300 --double` | | Click relative to window | `bash click.sh --window "Firefox" --x 200 --y 150` | | Type text | `bash type.sh "hello world"` | | Type into window | `bash type.sh --window "Terminal" "ls -la"` | | Send hotkey | `bash hotkey.sh "ctrl+c"` | | Send Enter | `bash hotkey.sh "Return"` | | Scroll down | `bash scroll.sh --direction down --amount 3` | | Scroll up at position | `bash scroll.sh --x 500 --y 300 --direction up --amount 3` | | Focus window | `bash window.sh --action focus --window "Firefox"` | | Minimize window | `bash window.sh --action minimize --window "Firefox"` | | Maximize window | `bash window.sh --action maximize --window "Firefox"` | | Close window | `bash window.sh --action close --window "Firefox"` | | Move window | `bash window.sh --action move --window "Firefox" --x 100 --y 50` | | Resize window | `bash window.sh --action resize --window "Firefox" --width 1280 --height...
# linux-desktop — OpenClaw Linux GUI Automation Skill Full GUI automation for Linux X11 desktops. This skill gives OpenClaw the ability to capture screenshots, inspect windows, click, type, send hotkeys, scroll, and manage windows. ## Requirements - Linux with X11 session (XFCE, GNOME on X11, KDE on X11, i3, openbox, etc.) - `DISPLAY` environment variable set (usually `:0`) ## Installation ```bash bash install.sh ``` This installs system dependencies (`xdotool`, `wmctrl`, `scrot`, `x11-utils`, `imagemagick`, Python). Supported package managers: apt (Debian/Ubuntu), dnf (Fedora/RHEL), pacman (Arch). ## Tools | Script | Purpose | |--------|---------| | `capture.sh` | Take screenshots (full screen or specific window) | | `inspect.sh` | List windows, get window details (JSON output) | | `click.sh` | Mouse click at coordinates (left/right/middle, single/double) | | `type.sh` | Type text into the focused window | | `hotkey.sh` | Send keyboard shortcuts (ctrl+c, alt+F4, etc.) | | `scroll.sh` | Scroll up/down at current or specified position | | `window.sh` | Window management (focus, minimize, maximize, close, move, resize) | ## Quick Start ```bash # Take a screenshot bash capture.sh # List all open windows bash inspect.sh # Click at coordinates bash click.sh --x 500 --y 300 # Type text bash type.sh "Hello world" # Send Ctrl+C bash hotkey.sh "ctrl+c" ``` Preferred workflow: capture screenshot, interpret it directly in your OpenClaw chat, then act with coordinates. ## Testing ```bash bash test.sh ``` ## OpenClaw Integration Because this repo is dedicated to this single skill, use the repo root as the skill folder: ```bash cp -r <this-repo> ~/.openclaw/workspace/skills/linux-desktop/ ``` Restart the OpenClaw gateway; the agent will read `SKILL.md`. ## Limitations - **X11 only** — does not work on Wayland sessions - Some applications with custom rendering may resist automation
capt-marbles
Task Router
capncoconut
Register, communicate, and earn on the x402hub AI agent marketplace. Use when an agent needs to register on x402hub, browse or claim bounties, submit deliverables, send messages to other agents via x402 Relay, check marketplace stats, or manage agent credentials. Triggers on x402hub, agent marketplace, bounty, relay messaging, agent-to-agent communication, or USDC earning.
capevace
Real-time event bus for AI agents. Publish, subscribe, and share live signals across a network of agents with Unix-style simplicity.
captchasco
OpenClaw integration guidance for CAPTCHAS Agent API, including OpenResponses tool schemas and plugin tool registration.
carol-gutianle
name: modelready description: Start using a local or Hugging Face model instantly, directly from chat. metadata: {"openclaw":{"requires":{"bins": "bash", "curl" }, "env": "URL" }}
canbirlik
Controls Wiz smart bulbs (turn on/off, RGB colors, disco mode) via local WiFi.