AI Browser Assistant

A Chrome extension that automates browser tasks using AI. Just describe what you want to do in natural language — the AI controls the browser and gets it done.

English | 简体中文 | 日本語 | 한국어

Features

  • Natural language control — Tell the AI what to do, it handles the rest
  • Works on any website — No site-specific setup required
  • Real-time step display — Watch exactly what the AI is doing
  • Smart form filling — Fills multiple fields at once
  • Visual element overlay — All interactive elements are numbered on-screen
  • Auto-stop protection — Detects login walls, infinite loops, and stuck pages
  • Bring your own API key — Works with DeepSeek, OpenAI, or any OpenAI-compatible API

Installation

Manual Install (Developer Mode)

  1. Download the code — Click the green Code button on this page → Download ZIP, then unzip it. Or if you have Git: git clone <repo-url>
  2. Open Chrome and go to chrome://extensions
  3. Enable Developer mode using the toggle in the top-right corner
  4. Click Load unpacked
  5. Select the src folder inside the unzipped directory (not the root folder)
  6. The extension icon will appear in your Chrome toolbar — click it to open the side panel

Setup

  1. Click the extension icon → open the side panel
  2. Click ⚙️ Settings
  3. Enter your API Key (DeepSeek / OpenAI / any OpenAI-compatible key)
  4. Optionally set a custom API Base URL (default: https://api.deepseek.com/v1)
  5. Click Save

Usage

Open any webpage, then type your task in the chat box:

1. Open GitHub user sazima's profile page and star one of their repos
2. Check the weather in Singapore
3. Go to YouTube, open a GitHub tutorial, and like it. Then check the weather in Singapore. After that, open the test repository of GitHub user sazima, and create an issue urging for an update. Note: 1. In the issue, attach the tutorial link from earlier and the local weather in Singapore. 2. At the beginning of the issue, mention that this issue was created by an AI plugin, and the plugin's repository is https://github.com/sazima/ai-browser-extension

The AI will show each step it's taking in real time. Click to stop at any time.

Configuration

Setting Description Default
API Key Your OpenAI-compatible API key
API Base URL API endpoint https://api.deepseek.com/v1
Max Steps Maximum steps before auto-stop 60
Language UI and AI response language Browser language

Permissions

Permission Reason
<all_urls> Read and interact with any website the user wants to automate
activeTab Access the currently active tab
scripting Inject content script to interact with page elements
sidePanel Display the chat interface in Chrome's side panel
storage Save API key and settings locally on your device
tabs Query the active tab's ID and URL

Your API key is stored locally and never sent anywhere except the AI API endpoint you configure.

License

MIT