Smippo Documentation
S.M.I.P.P.O. (Structured Mirroring of Internet Pages and Public Objects) is a modern website copier that captures websites exactly as they appear in your browser. Unlike traditional crawlers that nibble at raw HTML, Smippo vacuums up everything—rendering pages in a real browser, intercepting all network requests, and creating structured offline mirrors with complete fidelity.
Why Smippo?
Traditional website copiers were built for a simpler web—static HTML served from servers. Today's websites are JavaScript-powered applications that fetch data dynamically, render content client-side, and use modern CSS techniques. Traditional tools simply can't capture these sites properly.
Smippo launches a real browser, navigates to pages, waits for JavaScript to execute, captures all network requests, and saves the fully-rendered result. You get the page exactly as you see it in your browser.
Key Features
Vacuum Architecture
Parallel workers consume sites rapidly—up to 16 simultaneous browser instances vacuuming pages, resources, and network requests in parallel.
Complete Fidelity
Captures pages exactly as rendered, including CSS-in-JS, dynamic content, lazy-loaded images, and web fonts.
Smart Filtering
Filter by URL patterns, MIME types, and file sizes. Respect robots.txt. Control scope from subdomain to entire domain.
Built-in Server
Serve captured sites locally with the built-in web server. Browse offline mirrors with proper MIME types and directory listings.
Beautiful CLI
Interactive guided mode for beginners, progress bars, and elegant terminal output. Just run smippo to start.
HAR Files
Generates HTTP Archive files for every capture—perfect for debugging, analysis, and replay.
Guides
Filtering
Control exactly what gets captured with URL patterns, MIME types, and size limits.
Core Features
Capture Command
The main command for mirroring websites. Learn all options from depth control to filtering.
Screenshot Command
Take quick screenshots of any URL with device emulation, full-page capture, and custom selectors.
Serve Command
Serve captured sites locally with the built-in web server featuring directory browsing and auto port detection.