Skills

All Skills

scraping

Skills tagged with #scraping

@yokingma
MCP

One Search Mcp

Web search, crawl, scrape & extract with agent-browser, SearXNG, Tavily, DuckDuckGo, Bing & more

mcpgithubsearchbrowserweb
yokingma/one-search-mcp
19d ago
0
@scrapfly
MCP

Mcp

Scrape any website, extract structured data, and collect web content at scale with AI agents

mcpaiweb
scrapfly/scrapfly-mcp
19d ago
0
@scrape-badger
MCP

ScrapeBadger

Twitter/X scraping API for AI agents. Get profiles, tweets, trends, and more.

mcpgithubapiaifile
scrape-badger/scrapebadger-mcp
19d ago
0
@curiositech

2026-legal-research-agent

Expert legal research agent for finding and scraping expungement data state by state. Knows authoritative sources, URL patterns, Firecrawl configuration, and 2026 legal landscape. Activate on "find expungement data", "scrape state laws", "legal research", "court URLs", "statute sources", "Clean Slate laws", "automatic expungement research". NOT for interpreting laws (use national-expungement-expert), building UI, or legal advice.

curiositech/some_claude_skills+88 more
19d ago
530
@DevsHero
MCP

ShadowCrawl

Stealth scraping & search. Bypasses Cloudflare, DataDome & LinkedIn via Cyborg HITL approach.

mcpgithubapisearch
DevsHero/ShadowCrawl
19d ago
0
@SawyerHood

dev-browser

Browser automation with persistent page state. Use when users ask to navigate websites, fill forms, take screenshots, extract web data, test web apps, or automate browser workflows. Trigger phrases include "go to [url]", "click on", "fill out the form", "take a screenshot", "scrape", "automate", "test the website", "log into", or any browser interaction request.

SawyerHood/dev-browser
18d ago
3.9K0
@D4Vinci
MCP

Scrapling MCP Server

Web scraping with stealth HTTP, real browsers, and Cloudflare bypass. CSS selectors supported.

mcpgithubapibrowserweb
D4Vinci/Scrapling
19d ago
0
@mcp-registry
MCP

Olostep Mcp Server

Olostep MCP server for web scraping, google search and website urls search.

mcpgithubapisearchweb
19d ago
0
@securecoders
MCP

OpenGraph.io MCP Server

MCP server for OpenGraph.io API - fetch OG data, screenshots, scrape, and generate images

mcpgithubapi
securecoders/opengraph-io-mcp
19d ago
0
@HomenShum
MCP

Io.Github.HomenShum/Nodebench

260 MCP tools across 49 domains. AI Flywheel, quality gates, research, web scraping.

mcpgithubapiaisearchweb
HomenShum/nodebench-ai
19d ago
0
@rog0x
MCP

Io.Github.Rog0x/Web

Web scraping, search, monitoring, and HTML-to-markdown for AI agents

mcpgithubapiaisearchweb
rog0x/mcp-web-tools
19d ago
0
@patchy631

brightdata-web-mcp

Search the web, scrape websites, extract structured data from URLs, and automate browsers using Bright Data's Web MCP. Use when fetching live web content, bypassing blocks/CAPTCHAs, getting product data from Amazon/eBay, social media posts, or when standard requests fail.

patchy631/ai-engineering-hub+1 more
18d ago
32.0K0
@firecrawl
MCP

Firecrawl MCP Server

MCP server for Firecrawl web scraping, structured data extraction and web search integration.

mcpgithubapisearchweb
firecrawl/firecrawl-mcp-server.git
19d ago
0
@debytesio

job-hunter

This skill should be used when the user asks to "find jobs", "search for jobs matching my expectations", "find the best job matching my expectation", "job hunt", "search job platforms", "match jobs to my profile", "find AI engineer jobs", "find ML engineer jobs", "search for senior software engineer roles", "find jobs with visa sponsorship", or mentions job hunting, job matching, career search, or job platform scraping.

debytesio/claude-plugin-jobhunter
18d ago
50
@ArchiveBox

abx-dl

Use this when you need to scrape websites, extract page content, download media, or run the ArchiveBox extractors without a full ArchiveBox install. abx-dl can save many kinds of web content including txt, md, html, json, pdf, png, jpg, mp4, mp3, srt, screenshots, favicons, headers, DOM snapshots, mirrored sites, and more using the same plugin ecosystem that powers ArchiveBox.

ArchiveBox/abx-dl
18d ago
990
@henry-ships
MCP

SparkForge — 20+ Utility APIs with x402 Micropayments

20+ pay-per-use APIs: image gen, crypto data, email verify, SSL check, web scraping, and more.

mcpapiaiweb
henry-ships/sparkforge
19d ago
0
@saifyxpro

cli

Use when an agent needs to operate HeadlessX through the CLI instead of calling files or APIs directly. Covers installing the published HeadlessX CLI package, logging in with an API URL and API key, and running `headlessx` commands for website scraping, map, crawl, Google AI Search, Tavily, Exa, YouTube, jobs, and operators. Trigger for requests like "use the CLI", "test the CLI", "show the command", "log in with the CLI", or "run HeadlessX from terminal".

saifyxpro/HeadlessX
18d ago
1.7K0
@SkillBoss-AI

skillboss-cold-email

Automated cold email pipeline. Finds target companies, enriches contacts, scrapes websites, and generates personalized cold emails using AI. One API call does it all: search → enrich → scrape → write.

SkillBoss-AI/skillboss-skills+1 more
18d ago
560
@zoharbabin
MCP

Google Researcher

MCP server providing Google Search, web scraping, and multi-source research tools for AI assistants

mcpgithubapiaisearchweb
zoharbabin/google-research-mcp
19d ago
0
@baixianger
MCP

Camoufox Mcp

Anti-detection browser automation with Camoufox - stealth Firefox for web scraping

mcpgithubapiaibrowserweb
baixianger/camoufox-mcp
19d ago
0
@Decodo
MCP

Io.Github.Decodo/Mcp Web Scraper

Enable your AI agents to scrape and parse web content dynamically, including geo-restricted sites

mcpgithubaiweb
Decodo/mcp-web-scraper
19d ago
0
@Agent-Engineer-Master

analyzing-dtc-stores

Use when the user provides a DTC or ecommerce store URL and asks for a teardown, breakdown, brand analysis, competitor teardown, investor memo, store audit, deep dive, or 'what's going on with [brand]'. Produces an investor-grade markdown teardown report covering brand, market, unit economics, supply chain, channel mix, marketing, reviews, agentic-commerce readiness, risks, and a falsifiable verdict. Triggers: 'dtc teardown', 'brand teardown', 'store teardown', 'competitor teardown', 'analyze this store', 'investor memo on [brand]', 'break down [store url]'. Do NOT use for SEO-only audits, design-system extraction, lead-gen scraping, or general web scraping with no brand/investor focus.

Agent-Engineer-Master/skill-engineer+7 more
9d ago
60
@cloudflare

cloudflare-browser

Control headless Chrome via Cloudflare Browser Rendering CDP WebSocket. Use for screenshots, page navigation, scraping, and video capture when browser automation is needed in a Cloudflare Workers environment. Requires CDP_SECRET env var and cdpUrl configured in browser.profiles.

cloudflare/moltworker
18d ago
9.6K0
@bamchi
MCP

Scrapi

Web scraping for AI agents. Converts URLs to clean, LLM-ready Markdown with anti-bot bypass.

mcpgithubapiaiwebllm
bamchi/scrapi-mcp-server
19d ago
0
@arabold

docs-manage

Manage the Grounded Docs MCP Server documentation index. Covers scraping and indexing documentation from URLs or local files, refreshing existing indexes with changed content, and removing libraries from the index. Use when you need to add, update, or delete indexed documentation.

arabold/docs-mcp-server+1 more
18d ago
1.1K0
@fredpsantos33
MCP

Io.Github.Fredpsantos33/Iteratools

40+ pay-per-use tools for AI agents: search, TTS, QR, PDF, scraping, image gen. x402.

mcpgithubapiaisearch
fredpsantos33/iteratools-mcp
19d ago
0
@ScrapeGraphAI
MCP

Io.Github.ScrapeGraphAI/Scrapegraph Mcp

AI-powered web scraping and data extraction capabilities through ScrapeGraph API

mcpgithubapiaiweb
ScrapeGraphAI/scrapegraph-mcp
19d ago
0
@HatmanStack
MCP

Io.Github.HatmanStack/Ragstack

Search, chat, upload, and scrape a serverless RAGStack knowledge base on AWS.

mcpgithubawssearchrag
HatmanStack/RAGStack-Lambda
19d ago
0
@pinchtab

pinchtab

Use this skill when a task needs browser automation through PinchTab: open a website, inspect interactive elements, click through flows, fill out forms, scrape page text, log into sites with a persistent profile, export screenshots or PDFs, manage multiple browser instances, or fall back to the HTTP API when the CLI is unavailable. Prefer this skill for token-efficient browser work driven by stable accessibility refs such as `e5` and `e12`.

pinchtab/pinchtab+1 more
18d ago
7.5K0
@Damien829
MCP

AiPayGen — 250 AI tools for Agents

250+ AI tools: research, write, code, translate, analyze, scrape, memory, and more.

mcpgithubaisearchmemory
Damien829/aipaygen
19d ago
0
@gosom

google-maps-scraper

Free and open-source Google Maps scraper using Docker. Use when the user wants to find businesses, extract leads, emails, reviews, or ratings from Google Maps. Triggers on requests like "find all <business type> in <city>", "scrape Google Maps for <keyword>", "get leads from Google Maps". Keywords: google maps, scrape, business, leads, restaurants, shops, places, reviews, ratings, emails, contacts.

gosom/google-maps-scraper
18d ago
3.4K0
@lingxling

adhx

Fetch any X/Twitter post as clean LLM-friendly JSON. Converts x.com, twitter.com, or adhx.com links into structured data with full article content, author info, and engagement metrics. No scraping or browser required.

lingxling/awesome-skills-cn+42 more
9d ago
610
@szymdzum

bdg

Use bdg CLI for browser automation via Chrome DevTools Protocol. Provides direct CDP access (60+ domains, 300+ methods) for DOM queries, navigation, screenshots, network control, and JavaScript execution. Use this skill when you need to automate browsers, scrape dynamic content, or interact with web pages programmatically.

szymdzum/browser-debugger-cli
18d ago
1100
@brain-bootstrap

Browser Automation with Playwright MCP

**Use when:** you need to interact with a web page — test a UI, scrape docs, verify a login flow, research a live API.

brain-bootstrap/claude-code-brain-bootstrap+13 more
14d ago
90