Overview
- Skill Key
- andreybespalov89/selenium-browser
- Author
- andreybespalov89
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/andreybespalov89/selenium-browser
- Latest Commit SHA
- 4ba63ccaec8e2d69aa627581fd38bcb89112f69b
Start a Selenium‑controlled Chrome browser, open a URL, take a screenshot, and report progress. Supports headless mode and optional proxy.
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 selenium-browser 技能。 若已安装,则直接安装 selenium-browser 技能。
## Usage
The skill triggers on any message that contains *Chrome*, *browser*, *Selenium*, *screenshot*, or *open*.
```bash
selenium-browser <URL> [--headless] [--proxy=<url>]
```
### Command flow
1. **Launch** Chrome (or Chromium) under Selenium.
2. **Navigate** to `<URL>`.
3. **Take a screenshot** of the loaded page.
4. **Save** the image in `/home/main/clawd/diffusion_pdfs/` and **report** the path back to the chat.
5. If anything fails, send an **error message**.
## Scripts
### scripts/launch_browser.py
```python
#!/usr/bin/env python3
import os
import sys
import time
import base64
from selenium import webdriver
from selenium.webdriver.chrome.service import Service
from selenium.webdriver.chrome.options import Options
# CLI parsing
import argparse
parser = argparse.ArgumentParser(description="Launch Selenium Chrome and take a screenshot.")
parser.add_argument("url", help="URL to open")
parser.add_argument("--headless", action="store_true", help="Run Chrome headless")
parser.add_argument("--proxy", help="Proxy URL (e.g., http://proxy:3128)")
args = parser.parse_args()
# Prepare Chrome options
chrome_options = Options()
if args.headless:
chrome_options.add_argument("--headless")
chrome_options.add_argument("--disable-gpu")
chrome_options.add_argument("--no-sandbox")
chrome_options.add_argument("--disable-dev-shm-usage")
if args.proxy:
chrome_options.add_argument(f"--proxy-server={args.proxy}")
# Locate binaries
chrome_bin = os.getenv("CHROME_BIN", "/usr/bin/google-chrome")
chromedriver_path = os.getenv("CHROMEDRIVER_PATH", "/usr/local/bin/chromedriver")
service = Service(executable_path=chromedriver_path)
# Start browser
try:
driver = webdriver.Chrome(service=service, options=chrome_options)
except Exception as e:
print(f"❌ Failed to start Chrome: {e}", file=sys.stderr)
sys.exit(1)
# Navigate and wait for page load
try:
driver.get(args.url)
time.sleep(5) # simple wait; can replace with WebDriverWait for better reliability
e...
openbotx
An open-source platform for orchestrating AI agents — secure, simple, and built for everyone. Multi-agent, real-time task board, web control panel, skills system, browser automation, multi-provider, scheduler, and more. One command to start. Everything from the browser. No coding required.
sealiu1997
OpenClaw skill for 知识星球 / ZSXQ digests with token-first auth and browser recovery.
edholofy
University for AI agents. 92 courses, 4400+ scenarios, any model via OpenRouter. Auto-training loops generate per-model SKILL.md documents. Works with Claude Code, OpenClaw, Cursor, Windsurf. No fine-tuning required.
openstockdata
OpenClaw Skill for stock data analysis
abczsl520
🌐 OpenClaw skill for Browser-Use — AI-powered browser automation for complex multi-step workflows (login, form filling, scraping, posting)
ashemag
OpenClaw skill to crosspost X/Twitter posts to Reddit via browser automation