MCP ServerSTDIOOfficialv1.0.8

Agenson Horrowitz Web Content Extractor MCP Server

Extract and clean web content into structured formats optimized for LLM processing. For developers building AI agents, chatbots, or RAG systems that need reliable web scraping without boilerplate.

io.github.agenson-horrowitz/web-content-extractor

Hosted URL

Local install

Transport

STDIO

Auth

No auth required

What the Agenson Horrowitz Web Content Extractor MCP server does

How models use it and what it is built for.

Extract and clean web content into structured formats optimized for LLM processing. For developers building AI agents, chatbots, or RAG systems that need reliable web scraping without boilerplate.

Connect to Agenson Horrowitz Web Content Extractor

Local install — runs as a subprocess.

npx @agenson-horrowitz/web-content-extractor-mcp@1.0.8

Resources

Where to find authoritative docs and source for Agenson Horrowitz Web Content Extractor.

Example prompts for Agenson Horrowitz Web Content Extractor

Paste any of these into Agent Studio after connecting Agenson Horrowitz Web Content Extractor.

  • Extract the main article text from this URL and remove navigation clutter
  • Fetch and structure product details from an e-commerce page into JSON
  • Pull all headings and paragraphs from a documentation site for indexing
  • What structured output formats does this server support for web content?

Agenson Horrowitz Web Content Extractor MCP server — FAQ

Common questions about connecting and running Agenson Horrowitz Web Content Extractor.

  • What web content formats does this MCP server output?

    The server extracts and processes web content into clean, structured formats optimized for LLM consumption. Refer to the documentation for specific output schema options (JSON, markdown, etc.).

  • How do I install and run the web-content-extractor MCP server?

    Install via npm with `npx @agenson-horrowitz/web-content-extractor-mcp@1.0.8`. It runs over stdio transport, so configure your MCP client to spawn the process and communicate via standard input/output.

  • Does this server handle JavaScript-rendered content or only static HTML?

    The registry metadata does not specify JavaScript rendering capabilities. Check the full documentation or GitHub repository for details on whether dynamic content is supported.

  • What authentication or API keys are required?

    The install command and registry metadata do not mention API keys or authentication requirements. Consult the project documentation to confirm if any credentials are needed for web requests.

  • How is this different from using a generic HTTP client or cheerio?

    This MCP server is purpose-built to clean and structure web content specifically for LLM consumption, removing boilerplate and formatting data in ways that improve AI model performance—rather than requiring you to write custom parsing logic.

Skip the local setup — run MCP in your browser

MCP Playground runs 10,000+ hosted MCP servers — GitHub, Linear, Notion, Stripe, Sentry and more — across Claude, GPT, Gemini, DeepSeek and 30+ AI models. Compare model answers side-by-side, save agent presets, share runs. Zero install.

Open Agent Studio

Related servers

More on MCP Playground