Smippo Documentation

S.M.I.P.P.O. (Structured Mirroring of Internet Pages and Public Objects) is a modern website copier that captures websites exactly as they appear in your browser. Unlike traditional crawlers that nibble at raw HTML, Smippo vacuums up everything—rendering pages in a real browser, intercepting all network requests, and creating structured offline mirrors with complete fidelity.

Why Smippo?

Traditional website copiers were built for a simpler web—static HTML served from servers. Today's websites are JavaScript-powered applications that fetch data dynamically, render content client-side, and use modern CSS techniques. Traditional tools simply can't capture these sites properly.

Smippo launches a real browser, navigates to pages, waits for JavaScript to execute, captures all network requests, and saves the fully-rendered result. You get the page exactly as you see it in your browser.

Key Features

Vacuum Architecture

Parallel workers consume sites rapidly—up to 16 simultaneous browser instances vacuuming pages, resources, and network requests in parallel.

Complete Fidelity

Captures pages exactly as rendered, including CSS-in-JS, dynamic content, lazy-loaded images, and web fonts.

Smart Filtering

Filter by URL patterns, MIME types, and file sizes. Respect robots.txt. Control scope from subdomain to entire domain.

Built-in Server

Serve captured sites locally with the built-in web server. Browse offline mirrors with proper MIME types and directory listings.

Beautiful CLI

Interactive guided mode for beginners, progress bars, and elegant terminal output. Just run smippo to start.

HAR Files

Generates HTTP Archive files for every capture—perfect for debugging, analysis, and replay.

smippo

Guides

Quickstart

Get up and running with Smippo in under 5 minutes.

Read more

Capturing Websites

Learn all the options for capturing websites with full fidelity.

Read more

Filtering

Control exactly what gets captured with URL patterns, MIME types, and size limits.

Read more

Output Structure

Understand how Smippo organizes captured files and manifests.

Read more

Core Features

Capture Command

The main command for mirroring websites. Learn all options from depth control to filtering.

Screenshot Command

Take quick screenshots of any URL with device emulation, full-page capture, and custom selectors.

Serve Command

Serve captured sites locally with the built-in web server featuring directory browsing and auto port detection.

Programmatic API

Use Smippo in your Node.js applications with the programmatic API and event system.

Was this page helpful?