Overview
- Skill Key
- devbd1/scrapling-mcp
- Author
- devbd1
- Source Repo
- openclaw/skills
- Version
- -
- Source Path
- skills/devbd1/scrapling-mcp
- Latest Commit SHA
- 7ecc23afd22d9fffef8896f83af5b820194116a1
Advanced web scraping with Scrapling — MCP-native guidance for extraction, crawling, and anti-bot handling. Use via mcporter (MCP) to call the `scrapling` MCP server for execution; this skill provides strategy, recipes, and best practices.
Stars
0
Installs
0
Status
ACTIVE
Visibility
PUBLIC
直接复制以下提示词,发送给你的 AI 助手即可完成安装。
请先检查是否已安装 SkillHub 商店,若未安装,请根据 https://skillhub-1388575217.cos.ap-guangzhou.myqcloud.com/install/skillhub.md 安装 SkillHub 商店,然后安装 scrapling-mcp 技能。 若已安装,则直接安装 scrapling-mcp 技能。
# Scrapling MCP — Web Scraping Guidance
Source repo: https://github.com/DevBD1/openclaw-skill-scrapling-mcp
> **Guidance Layer + MCP Integration**
> Use this skill for **strategy and patterns**. For execution, call Scrapling's MCP server via `mcporter`.
## Quick Start (MCP)
### 1. Install Scrapling with MCP support
```bash
pip install scrapling[mcp]
# Or for full features:
pip install scrapling[mcp,playwright]
python -m playwright install chromium
```
### 2. Add to OpenClaw MCP config
```json
{
"mcpServers": {
"scrapling": {
"command": "python",
"args": ["-m", "scrapling.mcp"]
}
}
}
```
### 3. Call via mcporter
```
mcporter call scrapling fetch_page --url "https://example.com"
```
## Execution vs Guidance
| Task | Tool | Example |
|------|------|---------|
| Fetch a page | **mcporter** | `mcporter call scrapling fetch_page --url URL` |
| Extract with CSS | **mcporter** | `mcporter call scrapling css_select --selector ".title::text"` |
| Which fetcher to use? | **This skill** | See "Fetcher Selection Guide" below |
| Anti-bot strategy? | **This skill** | See "Anti-Bot Escalation Ladder" |
| Complex crawl patterns? | **This skill** | See "Spider Recipes" |
## Fetcher Selection Guide
```
┌─────────────────┐ ┌──────────────────┐ ┌──────────────────┐
│ Fetcher │────▶│ DynamicFetcher │────▶│ StealthyFetcher │
│ (HTTP) │ │ (Browser/JS) │ │ (Anti-bot) │
└─────────────────┘ └──────────────────┘ └──────────────────┘
Fastest JS-rendered Cloudflare,
Static pages SPAs, React/Vue Turnstile, etc.
```
### Decision Tree
1. **Static HTML?** → `Fetcher` (10-100x faster)
2. **Need JS execution?** → `DynamicFetcher`
3. **Getting blocked?** → `StealthyFetcher`
4. **Complex session?** → Use Session variants
### MCP Fetch Modes
- `fetch_page` — HTTP fetcher
- `fetch_dynamic` — Browser-based with Playwright
- `fetch_stealthy` — Anti-bot bypass mo...
openbotx
An open-source platform for orchestrating AI agents — secure, simple, and built for everyone. Multi-agent, real-time task board, web control panel, skills system, browser automation, multi-provider, scheduler, and more. One command to start. Everything from the browser. No coding required.
heyixuan2
Bambu Lab 3D printer control and automation. Activate when user mentions: printer status, 3D printing, slice, analyze model, generate 3D, AMS filament, print monitor, Bambu Lab, or any 3D printing task. Full pipeline: search → generate → analyze → colorize → preview → open BS → user slice → print → monitor. Supports all 9 Bambu Lab printers (A1 Mini, A1, P1S, P2S, X1C, X1E, H2C, H2S, H2D).
sealiu1997
OpenClaw skill for 知识星球 / ZSXQ digests with token-first auth and browser recovery.
abczsl520
🌐 OpenClaw skill for Browser-Use — AI-powered browser automation for complex multi-step workflows (login, form filling, scraping, posting)
ashemag
OpenClaw skill to crosspost X/Twitter posts to Reddit via browser automation
chizhongwang
Teach AI agents to discover hidden APIs, data functions & browser automation recipes for any website. Works with Claude Code, ClawHub, Cursor & any AgentSkills-compatible agent.