scraping

Expert legal research agent for finding and scraping expungement data state by state. Knows authoritative sources, URL patterns, Firecrawl configuration, and 2026 legal landscape. Activate on "find expungement data", "scrape state laws", "legal research", "court URLs", "statute sources", "Clean Slate laws", "automatic expungement research". NOT for interpreting laws (use national-expungement-expert), building UI, or legal advice.

curiositech/some_claude_skills+88 more

2mo ago

530

@DevsHero

MCP

ShadowCrawl

Stealth scraping & search. Bypasses Cloudflare, DataDome & LinkedIn via Cyborg HITL approach.

dev-browser

Browser automation with persistent page state. Use when users ask to navigate websites, fill forms, take screenshots, extract web data, test web apps, or automate browser workflows. Trigger phrases include "go to [url]", "click on", "fill out the form", "take a screenshot", "scrape", "automate", "test the website", "log into", or any browser interaction request.

SawyerHood/dev-browser

Scrapling MCP Server

Web scraping with stealth HTTP, real browsers, and Cloudflare bypass. CSS selectors supported.

mcpgithubapibrowserweb

Olostep Mcp Server

Olostep MCP server for web scraping, google search and website urls search.

mcpgithubapisearchweb

2mo ago

@securecoders

MCP

OpenGraph.io MCP Server

MCP server for OpenGraph.io API - fetch OG data, screenshots, scrape, and generate images

mcpgithubapi

securecoders/opengraph-io-mcp

2mo ago

@HomenShum

MCP

Io.Github.HomenShum/Nodebench

260 MCP tools across 49 domains. AI Flywheel, quality gates, research, web scraping.

mcpgithubapiaisearchweb

HomenShum/nodebench-ai

2mo ago

@rog0x

MCP

Io.Github.Rog0x/Web

Web scraping, search, monitoring, and HTML-to-markdown for AI agents

mcpgithubapiaisearchweb

rog0x/mcp-web-tools

2mo ago

@patchy631

brightdata-web-mcp

Search the web, scrape websites, extract structured data from URLs, and automate browsers using Bright Data's Web MCP. Use when fetching live web content, bypassing blocks/CAPTCHAs, getting product data from Amazon/eBay, social media posts, or when standard requests fail.

patchy631/ai-engineering-hub+1 more

Firecrawl MCP Server

MCP server for Firecrawl web scraping, structured data extraction and web search integration.

mcpgithubapisearchweb

firecrawl/firecrawl-mcp-server.git

2mo ago

@debytesio

job-hunter

This skill should be used when the user asks to "find jobs", "search for jobs matching my expectations", "find the best job matching my expectation", "job hunt", "search job platforms", "match jobs to my profile", "find AI engineer jobs", "find ML engineer jobs", "search for senior software engineer roles", "find jobs with visa sponsorship", or mentions job hunting, job matching, career search, or job platform scraping.

debytesio/claude-plugin-jobhunter

2mo ago

@ArchiveBox

abx-dl

Use this when you need to scrape websites, extract page content, download media, or run the ArchiveBox extractors without a full ArchiveBox install. abx-dl can save many kinds of web content including txt, md, html, json, pdf, png, jpg, mp4, mp3, srt, screenshots, favicons, headers, DOM snapshots, mirrored sites, and more using the same plugin ecosystem that powers ArchiveBox.

SparkForge — 20+ Utility APIs with x402 Micropayments

20+ pay-per-use APIs: image gen, crypto data, email verify, SSL check, web scraping, and more.

mcpapiaiweb

henry-ships/sparkforge

2mo ago

@saifyxpro

cli

Use when an agent needs to operate HeadlessX through the CLI instead of calling files or APIs directly. Covers installing the published HeadlessX CLI package, logging in with an API URL and API key, and running `headlessx` commands for website scraping, map, crawl, Google AI Search, Tavily, Exa, YouTube, jobs, and operators. Trigger for requests like "use the CLI", "test the CLI", "show the command", "log in with the CLI", or "run HeadlessX from terminal".

skillboss-cold-email

Automated cold email pipeline. Finds target companies, enriches contacts, scrapes websites, and generates personalized cold emails using AI. One API call does it all: search â enrich â scrape â write.

SkillBoss-AI/skillboss-skills+1 more

Google Researcher

MCP server providing Google Search, web scraping, and multi-source research tools for AI assistants

mcpgithubapiaisearchweb

zoharbabin/google-research-mcp

2mo ago

@baixianger

MCP

Camoufox Mcp

Anti-detection browser automation with Camoufox - stealth Firefox for web scraping

mcpgithubapiaibrowserweb

baixianger/camoufox-mcp

2mo ago

@Decodo

MCP

Io.Github.Decodo/Mcp Web Scraper

Enable your AI agents to scrape and parse web content dynamically, including geo-restricted sites

mcpgithubaiweb

Decodo/mcp-web-scraper

2mo ago

@Agent-Engineer-Master

analyzing-dtc-stores

Use when the user provides a DTC or ecommerce store URL and asks for a teardown, breakdown, brand analysis, competitor teardown, investor memo, store audit, deep dive, or 'what's going on with [brand]'. Produces an investor-grade markdown teardown report covering brand, market, unit economics, supply chain, channel mix, marketing, reviews, agentic-commerce readiness, risks, and a falsifiable verdict. Triggers: 'dtc teardown', 'brand teardown', 'store teardown', 'competitor teardown', 'analyze this store', 'investor memo on [brand]', 'break down [store url]'. Do NOT use for SEO-only audits, design-system extraction, lead-gen scraping, or general web scraping with no brand/investor focus.

Agent-Engineer-Master/skill-engineer+7 more

2mo ago

@cloudflare

cloudflare-browser

Control headless Chrome via Cloudflare Browser Rendering CDP WebSocket. Use for screenshots, page navigation, scraping, and video capture when browser automation is needed in a Cloudflare Workers environment. Requires CDP_SECRET env var and cdpUrl configured in browser.profiles.

cloudflare/moltworker

Scrapi

Web scraping for AI agents. Converts URLs to clean, LLM-ready Markdown with anti-bot bypass.

mcpgithubapiaiwebllm

bamchi/scrapi-mcp-server

2mo ago

@arabold

docs-manage

Manage the Grounded Docs MCP Server documentation index. Covers scraping and indexing documentation from URLs or local files, refreshing existing indexes with changed content, and removing libraries from the index. Use when you need to add, update, or delete indexed documentation.

arabold/docs-mcp-server+1 more

Io.Github.Fredpsantos33/Iteratools

40+ pay-per-use tools for AI agents: search, TTS, QR, PDF, scraping, image gen. x402.

mcpgithubapiaisearch

fredpsantos33/iteratools-mcp

2mo ago

@ScrapeGraphAI

MCP

Io.Github.ScrapeGraphAI/Scrapegraph Mcp

AI-powered web scraping and data extraction capabilities through ScrapeGraph API

mcpgithubapiaiweb

ScrapeGraphAI/scrapegraph-mcp

2mo ago

@HatmanStack

MCP

Io.Github.HatmanStack/Ragstack

Search, chat, upload, and scrape a serverless RAGStack knowledge base on AWS.

mcpgithubawssearchrag

HatmanStack/RAGStack-Lambda

2mo ago

@pinchtab

pinchtab

Use this skill when a task needs browser automation through PinchTab: open a website, inspect interactive elements, click through flows, fill out forms, scrape page text, log into sites with a persistent profile, export screenshots or PDFs, manage multiple browser instances, or fall back to the HTTP API when the CLI is unavailable. Prefer this skill for token-efficient browser work driven by stable accessibility refs such as `e5` and `e12`.

pinchtab/pinchtab+1 more

AiPayGen — 250 AI tools for Agents

250+ AI tools: research, write, code, translate, analyze, scrape, memory, and more.

mcpgithubaisearchmemory

Damien829/aipaygen

2mo ago

@gosom

google-maps-scraper

Free and open-source Google Maps scraper using Docker. Use when the user wants to find businesses, extract leads, emails, reviews, or ratings from Google Maps. Triggers on requests like "find all <business type> in <city>", "scrape Google Maps for <keyword>", "get leads from Google Maps". Keywords: google maps, scrape, business, leads, restaurants, shops, places, reviews, ratings, emails, contacts.

gosom/google-maps-scraper

2mo ago

3.4K0

@lingxling

adhx

Fetch any X/Twitter post as clean LLM-friendly JSON. Converts x.com, twitter.com, or adhx.com links into structured data with full article content, author info, and engagement metrics. No scraping or browser required.

lingxling/awesome-skills-cn+46 more

1mo ago

610

@szymdzum

bdg

Use bdg CLI for browser automation via Chrome DevTools Protocol. Provides direct CDP access (60+ domains, 300+ methods) for DOM queries, navigation, screenshots, network control, and JavaScript execution. Use this skill when you need to automate browsers, scrape dynamic content, or interact with web pages programmatically.

szymdzum/browser-debugger-cli

2mo ago

1100

@brain-bootstrap