Automated Browser-Based Crawling at Scale

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

Browser-Based Crawling

Define the scope of your crawl workflow with an extensive array of options. Watch crawls as they run in real time to diagnose issues and ensure you are capturing exactly the pages and content you want.

Signed, Sealed, Authenticated

Crawl outputs are digitally signed to ensure a provable chain of custody.

Always On Schedule

Schedule workflows to run crawls on a recurring basis and automatically collect snapshots of a website.

Browser Profiles

Sign into websites and archive them exactly as they appear when logged in.

Live Exclusion Editing

Stop runaway crawls from getting bogged down in crawler traps without restarting the entire crawl.

ArchiveWeb.page Integration

Send archived items directly to Browsertrix from the ArchiveWeb.page browser extension.

Create Complete Collections

Four different archived items in a list, three of them are checked and added to a collection, one item (an incomplete crawl that was stopped by the user) has been omitted.

Combine archived items created through automated crawling, ArchiveWeb.page, and other tools, for viewing and export.

Single Collaborative Workspace

Work together with colleagues to create, manage, and organize crawls.

Upload Existing Archives

Bring your existing WACZ files along!

In-App Browser-Based Replay

A screenshot of an archived item being viewed from within Browsertrix.

View archived webpages directly in the browser, exactly as they appeared when crawled.

Export to Standard Files

Export your collections to a single packaged WACZ file.

Embed In Your Own Content

Embed archives into your own content using ReplayWeb.page.

Start Web Archiving Today!

Starter

$30 per month

Get Started
  • 180 minutes of crawling time (3 hours)
  • 100GB of disk space
  • Up to 2,000 pages per crawl
  • 1 concurrent crawl
  • Community forum support

Standard

$60 per month

Get Started
  • 360 minutes of crawling time (6 hours)
  • 220GB of disk space
  • Up to 5,000 pages per crawl
  • 2 concurrent crawls
  • Community forum support

Plus

$120 per month

Get Started
  • 720 minutes of crawling time (12 hours)
  • 500GB of disk space
  • Up to 10,000 pages per crawl
  • 3 concurrent crawls
  • Community forum support

Custom

Based on requirements

Schedule a Demo
  • Increased crawling time limits
  • 1TB+ base disk space
  • Increased crawl page limits
  • 4+ concurrent crawls
  • Dedicated support available

Self Hosted

Get Started

Browse our deployment documentation to get started with your own instances of Browsertrix.

Browsertrix is open source software! Browse our source code, make your own updates, and submit changes on GitHub.


Self-Hosted Support

Support contracts for self-hosted instances are available on a case-by-case basis.