I craft unique cereal names, stories, and ridiculously cute Cereal Baby images.

mcp-web-browser
An advanced web browsing server for the Model Context Protocol (MCP) powered by Playwright, enabling headless browser interactions through a flexible, secure API.
3 years
Works with Finder
1
Github Watches
2
Github Forks
14
Github Stars
MCP Web Browser Server
An advanced web browsing server for the Model Context Protocol (MCP) powered by Playwright, enabling headless browser interactions through a flexible, secure API.
🌐 Features
- Headless Web Browsing: Navigate to any website with SSL certificate validation bypass
- Full Page Content Extraction: Retrieve complete HTML content, including dynamically loaded JavaScript
- Multi-Tab Support: Create, manage, and switch between multiple browser tabs
-
Advanced Web Interaction Tools:
- Extract text content
- Click page elements
- Input text into form fields
- Capture screenshots
- Extract page links with filtering capabilities
- Scroll pages in any direction
- Execute JavaScript on pages
- Refresh pages
- Wait for navigation to complete
- Resource Management: Automatic cleanup of unused resources after inactivity
- Enhanced Page Information: Get detailed metadata about the current page
🚀 Quick Start
Prerequisites
- Python 3.10+
- MCP SDK
- Playwright
Installation
# Install MCP and Playwright
pip install mcp playwright
# Install browser dependencies
playwright install
Configuration for Claude Desktop
Add to your claude_desktop_config.json
:
{
"mcpServers": {
"web-browser": {
"command": "python",
"args": [
"/path/to/your/server.py"
]
}
}
}
💡 Usage Examples
Basic Web Navigation
# Browse to a website
page_content = browse_to("https://example.com")
# Extract page text
text_content = extract_text_content()
# Extract text from a specific element
title_text = extract_text_content("h1.title")
Web Interaction
# Navigate to a page
browse_to("https://example.com/login")
# Input text into a form
input_text("#username", "your_username")
input_text("#password", "your_password")
# Click a login button
click_element("#login-button")
Screenshot Capture
# Capture full page screenshot
full_page_screenshot = get_page_screenshots(full_page=True)
# Capture specific element screenshot
element_screenshot = get_page_screenshots(selector="#main-content")
Link Extraction
# Get all links on the page
page_links = get_page_links()
# Get links matching a pattern
filtered_links = get_page_links(filter_pattern="contact")
Multi-Tab Browsing
# Create a new tab
tab_id = create_new_tab("https://example.com")
# Create another tab
another_tab_id = create_new_tab("https://example.org")
# List all open tabs
tabs = list_tabs()
# Switch between tabs
switch_tab(tab_id)
# Close a tab
close_tab(another_tab_id)
Advanced Interactions
# Scroll the page
scroll_page(direction="down", amount="page")
# Execute JavaScript on the page
result = execute_javascript("return document.title")
# Get detailed page information
page_info = get_page_info()
# Refresh the current page
refresh_page()
# Wait for navigation to complete
wait_for_navigation(timeout_ms=5000)
🛡️ Security Features
- SSL certificate validation bypass
- Secure browser context management
- Custom user-agent configuration
- Error handling and comprehensive logging
- Configurable timeout settings
- CSP bypass control
- Protection against cookie stealing
🔧 Troubleshooting
Common Issues
- SSL Certificate Errors: Automatically bypassed
-
Slow Page Load: Adjust timeout in
browse_to()
method - Element Not Found: Verify selectors carefully
- Browser Resource Usage: Auto-cleanup after inactivity period
Logging
All significant events are logged with detailed information for easy debugging.
📋 Tool Parameters
browse_to(url: str, context: Optional[Any] = None)
-
url
: Website to navigate to -
context
: Optional context object (currently unused)
extract_text_content(selector: Optional[str] = None, context: Optional[Any] = None)
-
selector
: Optional CSS selector to extract specific content -
context
: Optional context object (currently unused)
click_element(selector: str, context: Optional[Any] = None)
-
selector
: CSS selector of the element to click -
context
: Optional context object (currently unused)
get_page_screenshots(full_page: bool = False, selector: Optional[str] = None, context: Optional[Any] = None)
-
full_page
: Capture entire page screenshot -
selector
: Optional element to screenshot -
context
: Optional context object (currently unused)
get_page_links(filter_pattern: Optional[str] = None, context: Optional[Any] = None)
-
filter_pattern
: Optional text pattern to filter links -
context
: Optional context object (currently unused)
input_text(selector: str, text: str, context: Optional[Any] = None)
-
selector
: CSS selector of input element -
text
: Text to input -
context
: Optional context object (currently unused)
create_new_tab(url: Optional[str] = None, context: Optional[Any] = None)
-
url
: Optional URL to navigate to in the new tab -
context
: Optional context object (currently unused)
switch_tab(tab_id: str, context: Optional[Any] = None)
-
tab_id
: ID of the tab to switch to -
context
: Optional context object (currently unused)
list_tabs(context: Optional[Any] = None)
-
context
: Optional context object (currently unused)
close_tab(tab_id: Optional[str] = None, context: Optional[Any] = None)
-
tab_id
: Optional ID of the tab to close (defaults to current tab) -
context
: Optional context object (currently unused)
refresh_page(context: Optional[Any] = None)
-
context
: Optional context object (currently unused)
get_page_info(context: Optional[Any] = None)
-
context
: Optional context object (currently unused)
scroll_page(direction: str = "down", amount: str = "page", context: Optional[Any] = None)
-
direction
: Direction to scroll ('up', 'down', 'left', 'right') -
amount
: Amount to scroll ('page', 'half', or a number) -
context
: Optional context object (currently unused)
wait_for_navigation(timeout_ms: int = 10000, context: Optional[Any] = None)
-
timeout_ms
: Maximum time to wait in milliseconds -
context
: Optional context object (currently unused)
execute_javascript(script: str, context: Optional[Any] = None)
-
script
: JavaScript code to execute -
context
: Optional context object (currently unused)
🤝 Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
Development Setup
# Clone the repository
git clone https://github.com/random-robbie/mcp-web-browser.git
# Create virtual environment
python -m venv venv
source venv/bin/activate # On Windows use `venv\Scripts\activate`
# Install dependencies
pip install -e .[dev]
📄 License
MIT License
🔗 Related Projects
💬 Support
For issues and questions, please open an issue on GitHub.
相关推荐
Converts Figma frames into front-end code for various mobile frameworks.
Oede knorrepot die vasthoudt an de goeie ouwe tied van 't boerenleven
Friendly music guide for 60s-2000s songs, with links to listen online.
A unified API gateway for integrating multiple etherscan-like blockchain explorer APIs with Model Context Protocol (MCP) support for AI assistants.
Mirror ofhttps://github.com/suhail-ak-s/mcp-typesense-server
本项目是一个钉钉MCP(Message Connector Protocol)服务,提供了与钉钉企业应用交互的API接口。项目基于Go语言开发,支持员工信息查询和消息发送等功能。
Micropython I2C-based manipulation of the MCP series GPIO expander, derived from Adafruit_MCP230xx
Short and sweet example MCP server / client implementation for Tools, Resources and Prompts.
Reviews

user_OesYiV1H
As a loyal user of mcp-web-browser, I must say this tool is a game-changer. It offers a smooth and efficient browsing experience with highly intuitive features. The open-source nature as provided by random-robbie on GitHub makes it easily adaptable for various needs. I appreciate the thoughtful welcome message that enhances user engagement right from the start. Highly recommended!