Getting Started
This guide walks you through installing BrowseGenius locally, loading it into Chrome, and performing the first AI-generated run.
Prerequisites
- Node.js ≥ 18 and npm or Yarn 1.x
- A Chrome or Chromium-based browser with Developer Mode enabled
- An OpenAI API key that has access to GPT-4.1, GPT-4o, or the o1 family
- Optional: A BrowseGenius account for cloud sync and cross-device access
- Optional: An OpenPipe API key if you want to log prompts/responses
Install dependencies
yarn installThis installs both the extension dependencies and the VitePress documentation stack.
Build & load the extension
Generate the unpacked build:
bashyarn startThe dev server continuously writes bundles into
build/and watches for file changes.Open
chrome://extensions, enable Developer mode, and click Load unpacked.Select the
build/directory created by the dev server.Pin the extension to the toolbar for quick access (optional).
Configure AI providers
- Open the BrowseGenius popup (click the extension icon or use
Cmd+Shift+Y/Ctrl+Shift+Y) - Click the Settings icon
- Enter your OpenAI API Key - stored securely in browser storage only
- Select your preferred model (GPT-4.1, GPT-4o, GPT-4o mini, o1, or o1-mini)
- Optional: Enter your Extension API Key (starts with
bgx_) for cloud sync - Click Save Keys
Cloud Sync Setup (Optional)
To enable cloud sync and cross-device access:
Create a BrowseGenius account:
bashcurl -X POST https://api.browsegenius.com/api/v1/auth/signup \ -H "Content-Type: application/json" \ -d '{"email":"your@email.com","password":"your_password","name":"Your Name"}'Login and generate an Extension API Key:
bash# Get JWT token curl -X POST https://api.browsegenius.com/api/v1/auth/login \ -H "Content-Type: application/json" \ -d '{"email":"your@email.com","password":"your_password"}' # Create extension key curl -X POST https://api.browsegenius.com/api/v1/extension-keys \ -H "Authorization: Bearer YOUR_JWT_TOKEN" \ -d '{"name":"My Browser"}'Copy the API key (format:
bgx_...) and paste it in extension Settings
See Backend Integration for full details.
First capture session
- Navigate to a representative page in your target application.
- Click Capture screen in the popup. Repeat until you have up to five pivotal screens (auth, dashboards, transactions, etc.).
- Annotate each capture with notes the LLM should consider (environment, data setup, edge cases).
Generate and run a suite
- Press Generate plan. BrowseGenius templatizes the DOM, composes a GPT-4 prompt, and returns a prioritized list of test cases.
- Review the proposed flows, adjust priorities if needed, and click Run suite.
- Open the DevTools panel → BrowseGenius to watch step-by-step progress, actions, and validations.
Export evidence
After a run completes, download the HTML + JSON report bundle from the popup. The bundle contains logs, metadata, screenshots/DOM captures, and test summaries for easy sharing with your team.
chrome.downloads API. Ensure the browser grants download permission to the extension.