Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
Define the scope of your crawl workflow with an extensive array of options. Watch crawls as they run in real time to diagnose issues and ensure you are capturing exactly the pages and content you want.
Crawl outputs are digitally signed to ensure a provable chain of custody.
Schedule workflows to run crawls on a recurring basis and automatically collect snapshots of a website.
Sign into websites and archive them exactly as they appear when logged in.
Stop runaway crawls from getting bogged down in crawler traps without restarting the entire crawl.
Send archived items directly to Browsertrix from the ArchiveWeb.page browser extension.
Combine archived items created through automated crawling, ArchiveWeb.page, and other tools, for viewing and export.
Work together with colleagues to create, manage, and organize crawls.
Bring your existing WACZ files along!
View archived webpages directly in the browser, exactly as they appeared when crawled.
Export your collections to a single packaged WACZ file.
Embed archives into your own content using ReplayWeb.page.
$30 per month
$60 per month
$120 per month
Based on requirements
Browse our deployment documentation to get started with your own instances of Browsertrix.
Browsertrix is open source software! Browse our source code, make your own updates, and submit changes on GitHub.
Support contracts for self-hosted instances are available on a case-by-case basis.