Overview
- Skill Key
- damirikys/scrapling-fetcher
- Author
- damirikys
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/damirikys/scrapling-fetcher
- Latest Commit SHA
- 8d0e2838fd2214bcad0cdc1d3d8137013de38a22
Web scraping using Scrapling — a Python framework with anti-bot bypass (Cloudflare Turnstile, fingerprint spoofing), adaptive element tracking, stealth headless browser, and full CSS/XPath extraction. Use when web_fetch fails (Cloudflare, JS-rendered pages), or when extracting structured data from websites (prices, articles, lists). Supports HTTP, stealth, and full browser modes. Source: github.com/D4Vinci/Scrapling (PyPI: scrapling). Only use on sites you have permission to scrape.
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 scrapling 技能。 若已安装,则直接安装 scrapling 技能。
# Scrapling Skill **Source:** https://github.com/D4Vinci/Scrapling (open source, MIT-like license) **PyPI:** `scrapling` — install before first use (see below) > ⚠️ Only scrape sites you have permission to access. Respect `robots.txt` and Terms of Service. Do not use stealth modes to bypass paywalls or access restricted content without authorization. ## Installation (one-time, confirm with user before running) ```bash pip install scrapling[all] patchright install chromium # required for stealth/dynamic modes ``` - `scrapling[all]` installs `patchright` (a stealth fork of Playwright, bundled as a PyPI package — not a typo), `curl_cffi`, MCP server deps, and IPython shell. - `patchright install chromium` downloads Chromium (~100 MB) via patchright's own installer (same mechanism as `playwright install chromium`). - Confirm with user before running — installs ~200 MB of dependencies and browser binaries. ## Script `scripts/scrape.py` — CLI wrapper for all three fetcher modes. ```bash # Basic fetch (text output) python3 ~/skills/scrapling/scripts/scrape.py <url> -q # CSS selector extraction python3 ~/skills/scrapling/scripts/scrape.py <url> --selector ".class" -q # Stealth mode (Cloudflare bypass) — only on sites you're authorized to access python3 ~/skills/scrapling/scripts/scrape.py <url> --mode stealth -q # JSON output python3 ~/skills/scrapling/scripts/scrape.py <url> --selector "h2" --json -q ``` ## Fetcher Modes - **http** (default) — Fast HTTP with browser TLS fingerprint spoofing. Most sites. - **stealth** — Headless Chrome with anti-detect. For Cloudflare/anti-bot. - **dynamic** — Full Playwright browser. For heavy JS SPAs. ## When to Use Each Mode - `web_fetch` returns 403/429/Cloudflare challenge → use `--mode stealth` - Page content requires JS execution → use `--mode dynamic` - Regular site, just need text/data → use `--mode http` (default) ## Python Inline Usage For custom logic beyond the CLI, write inline Python. See `references/pattern...
openbotx
An open-source platform for orchestrating AI agents — secure, simple, and built for everyone. Multi-agent, real-time task board, web control panel, skills system, browser automation, multi-provider, scheduler, and more. One command to start. Everything from the browser. No coding required.
heyixuan2
Bambu Lab 3D printer control and automation. Activate when user mentions: printer status, 3D printing, slice, analyze model, generate 3D, AMS filament, print monitor, Bambu Lab, or any 3D printing task. Full pipeline: search → generate → analyze → colorize → preview → open BS → user slice → print → monitor. Supports all 9 Bambu Lab printers (A1 Mini, A1, P1S, P2S, X1C, X1E, H2C, H2S, H2D).
sealiu1997
OpenClaw skill for 知识星球 / ZSXQ digests with token-first auth and browser recovery.
abczsl520
🌐 OpenClaw skill for Browser-Use — AI-powered browser automation for complex multi-step workflows (login, form filling, scraping, posting)
human-pages-ai
Search and hire real humans for tasks — photography, delivery, research, and more
chizhongwang
Teach AI agents to discover hidden APIs, data functions & browser automation recipes for any website. Works with Claude Code, ClawHub, Cursor & any AgentSkills-compatible agent.