Web Scraping

Browse all content tagged with Web Scraping

Mcp servers

Puppeteer Vision MCP Server

The Puppeteer Vision MCP Server empowers AI assistants to scrape and convert web pages into Markdown, using advanced AI-driven interaction to bypass interactive barriers like CAPTCHAs and paywalls. It integrates seamlessly into AI workflows via the Model Context Protocol (MCP), streamlining web data extraction and ingestion.

5 min read
Mcp servers

ScrAPI MCP Server

The ScrAPI MCP Server empowers AI assistants to extract live web content—even from sites protected by captchas, bot detection, or geofencing. By acting as a bridge to the ScrAPI service, it enables automated scraping of HTML or Markdown for real-time data enrichment, research automation, and more.

4 min read
Mcp servers

Fetch MCP Server

The Fetch MCP Server for FlowHunt enables AI agents to retrieve and transform live web content in multiple formats, including HTML, JSON, plain text, and Markdown—empowering dynamic workflows, data extraction, and real-time content integration.

5 min read
Mcp servers

Firecrawl MCP Server

The Firecrawl MCP Server supercharges FlowHunt and AI assistants with advanced web scraping, deep research, and content discovery capabilities. Seamless integration enables real-time data extraction and automated research workflows directly within your development environment.

4 min read
Mcp servers

mcp-rquest MCP Server

The mcp-rquest MCP Server empowers AI assistants with advanced, browser-like HTTP request capabilities, robust anti-bot evasion, and document-to-Markdown conversion. Powered by the rquest engine, it enables secure, realistic web interactions and efficient handling of large web or document responses.

4 min read
Mcp servers

Oxylabs MCP Server

The Oxylabs MCP (Model Context Protocol) Server is a bridge between AI assistants and the real-world web, offering a unified API to extract, structure, and deliver clean data from any website. It enables AI models to access live web data, automate extraction, and enhance workflows with real-time information.

4 min read
Mcp servers

puremd MCP Server

The puremd MCP Server bridges AI assistants and agents with web content by unblocking and scraping sites, rendering dynamic pages, and converting resources into markdown via the pure.md platform. It powers advanced web search and seamless LLM context enrichment within FlowHunt and other AI development tools.

4 min read
Mcp servers

Dumpling AI MCP Server

The Dumpling AI MCP Server for FlowHunt enables AI assistants to connect with a wide range of external data sources, APIs, and developer tools. It empowers automated workflows for web scraping, document conversion, knowledge base management, and more, making it ideal for developers and researchers seeking to extend their AI's capabilities.

4 min read
Components

URL Retriever

Unlock web content in your workflows with the URL Retriever component. Effortlessly extract and process the text and metadata from any list of URLs—including web articles, documents, and more. Supports advanced options like OCR for images, selective metadata extraction, and customizable caching, making it ideal for building knowledge-rich AI flows and automations.

4 min read
Blog

AI-powered Data Extraction

Discover how AI-powered data extraction automates and streamlines data processing, reduces errors, and enhances business efficiency. Explore top models, extraction methods, and leading tools like Docsumo, Hevo Data, Airbyte, and Import.io.

11 min read
Glossary

Lead Scraper

Lead scraping automates the extraction of valuable contact data from online sources, enabling businesses to efficiently build high-quality lead databases for targeted marketing and sales while ensuring data privacy compliance.

10 min read

Other Tags

ai (896) automation (623) mcp server (390) flowhunt (240) integration (228) machine learning (211) mcp (209) ai integration (119) ai tools (105) productivity (90) components (75) developer tools (75) nlp (74) devops (60) chatbots (58) workflow (58) llm (57) deep learning (52) security (52) chatbot (50) ai agents (48) content creation (40) seo (39) analytics (38) data science (35) open source (35) database (33) mcp servers (33) no-code (33) ai automation (32) business intelligence (29) image generation (28) reasoning (28) content generation (26) neural networks (26) generative ai (25) python (25) compliance (24) openai (24) slack (24) computer vision (23) marketing (23) rag (23) blockchain (22) education (22) project management (22) summarization (21) api integration (20) apis (20) collaboration (20) finance (20) knowledge management (20) search (20) data (19) data analysis (19) development tools (19) workflow automation (19) prompt engineering (18) semantic search (18) documentation (17) api (16) classification (16) content writing (16) slackbot (16) customer service (15) ethics (15) transparency (15) web scraping (15) data integration (14) model evaluation (14) natural language processing (14) research (14) sql (14) text-to-image (14) business (13) creative writing (13) crm (13) data extraction (13) hubspot (13) text generation (13) ai chatbot (12) artificial intelligence (12) content marketing (12) creative ai (12) customer support (12) digital marketing (12) llms (12) monitoring (12) ocr (12) sales (12) ai agent (11) data management (11) email (11) integrations (11) observability (11) personalization (11) predictive analytics (11) regression (11) text analysis (11) web search (11)