AI Browser Assistant
A Chrome extension that automates browser tasks using AI. Just describe what you want to do in natural language — the AI controls the browser and gets it done.
Features
- Natural language control — Tell the AI what to do, it handles the rest
- Works on any website — No site-specific setup required
- Real-time step display — Watch exactly what the AI is doing
- Smart form filling — Fills multiple fields at once
- Visual element overlay — All interactive elements are numbered on-screen
- Auto-stop protection — Detects login walls, infinite loops, and stuck pages
- Bring your own API key — Works with DeepSeek, OpenAI, or any OpenAI-compatible API
Installation
Manual Install (Developer Mode)
- Download the code — Click the green Code button on this page → Download ZIP, then unzip it. Or if you have Git:
git clone <repo-url> - Open Chrome and go to
chrome://extensions - Enable Developer mode using the toggle in the top-right corner
- Click Load unpacked
- Select the
srcfolder inside the unzipped directory (not the root folder) - The extension icon will appear in your Chrome toolbar — click it to open the side panel
Setup
- Click the extension icon → open the side panel
- Click ⚙️ Settings
- Enter your API Key (DeepSeek / OpenAI / any OpenAI-compatible key)
- Optionally set a custom API Base URL (default:
https://api.deepseek.com/v1) - Click Save
Usage
Open any webpage, then type your task in the chat box:
1. Open GitHub user sazima's profile page and star one of their repos
2. Check the weather in Singapore
3. Go to YouTube, open a GitHub tutorial, and like it. Then check the weather in Singapore. After that, open the test repository of GitHub user sazima, and create an issue urging for an update. Note: 1. In the issue, attach the tutorial link from earlier and the local weather in Singapore. 2. At the beginning of the issue, mention that this issue was created by an AI plugin, and the plugin's repository is https://github.com/sazima/ai-browser-extension
The AI will show each step it's taking in real time. Click ■ to stop at any time.
Configuration
| Setting | Description | Default |
|---|---|---|
| API Key | Your OpenAI-compatible API key | — |
| API Base URL | API endpoint | https://api.deepseek.com/v1 |
| Max Steps | Maximum steps before auto-stop | 60 |
| Language | UI and AI response language | Browser language |
Permissions
| Permission | Reason |
|---|---|
<all_urls> |
Read and interact with any website the user wants to automate |
activeTab |
Access the currently active tab |
scripting |
Inject content script to interact with page elements |
sidePanel |
Display the chat interface in Chrome's side panel |
storage |
Save API key and settings locally on your device |
tabs |
Query the active tab's ID and URL |
Your API key is stored locally and never sent anywhere except the AI API endpoint you configure.
License
MIT
评论 · 0
发表评论