Wayback Machine Guide – View Old Websites (2026)

The web forgets. Sites disappear, pages change, content gets deleted. But the Internet Archive's Wayback Machine remembers. Since 1996, it has been archiving the web—saving snapshots of pages so you can see what any website looked like at various points in history. Whether you're researching how sites evolved, recovering lost content, or verifying what a page actually said at a specific time, the Wayback Machine is your time machine for the internet.

TL;DR

The Wayback Machine archives web pages since 1996
Visit web.archive.org and enter any URL to see historical versions
Browse snapshots by date using the calendar interface
Use TinyUtils Wayback Fixer to batch-find archived versions of broken links
Save pages yourself with "Save Page Now" for future reference

What is the Wayback Machine?

The Wayback Machine is a digital archive of the World Wide Web, maintained by the Internet Archive, a nonprofit organization based in San Francisco. It contains over 800 billion web pages captured since 1996—one of the largest archives of human knowledge ever created.

The system works by crawling the web with automated bots, similar to how search engines index content, but instead of discarding old versions, the Wayback Machine keeps them all. Each capture is a snapshot frozen in time, showing exactly what a page contained at that moment.

How the Wayback Machine Works

Crawling and Capture

Internet Archive crawlers continuously visit websites, downloading pages and their assets (images, CSS, JavaScript). The frequency depends on various factors—popular sites might be captured multiple times per day, while obscure pages might have captures months apart.

Storage and Access

Captured pages are stored with timestamps, creating a historical record. When you request an archived page, you're served the exact HTML, CSS, and assets from that capture. Links within archived pages point to other archived content when available.

Archive URLs

Archived pages have URLs in this format:

https://web.archive.org/web/[TIMESTAMP]/[ORIGINAL_URL]

For example:

https://web.archive.org/web/20100315/https://example.com

The timestamp format is YYYYMMDDHHmmss (year, month, day, hour, minute, second). This URL structure lets you link directly to specific snapshots.

What You Can Do With It

Research Website History

See how any website evolved over time. Watch company messaging change, track feature additions, observe design trends. This is invaluable for UX research, competitive analysis, academic research on web history, or pure nostalgia (remember what Amazon looked like in 1999?).

Recover Lost Content

Accidentally deleted a page? Previous version of your site destroyed? If the Wayback Machine captured it, you can recover the content. This has saved countless websites, articles, and valuable information from permanent loss.

Fix Broken Links

When cited sources go offline, replace broken links with archived versions. Your references remain functional, and readers can still access the content you cited. This preserves the integrity of research, journalism, and any content that relies on external sources.

Verify Historical Claims

Check what a website actually said at a specific time. Useful for fact-checking ("Did they really say that on their website?"), legal matters (contractual terms, terms of service at a specific date), and journalistic investigation.

Monitor Changes

Compare snapshots to see what changed between captures. Track policy updates, terms of service modifications, pricing changes, or content revisions.

Preserve Important Pages

Use "Save Page Now" to archive pages you think might disappear or that you want to reference in the future. Create your own permanent citations.

How to Use the Wayback Machine

Basic Lookup

Go to web.archive.org
Enter the URL you want to see in the search box
Press Enter or click "Browse History"
You'll see a calendar showing all available snapshots
Click any date with captures (indicated by dots)
Select a specific time from that day
Navigate the archived site normally

Understanding the Calendar

Blue circles: Snapshots exist for that date
Circle size: Larger circles mean more captures that day
Color intensity: Indicates capture density
Click a date: Shows all captures from that day with timestamps

Direct URL Access

Use the wildcard format to see all captures:

https://web.archive.org/web/*/example.com

Or construct a direct URL to a specific date:

https://web.archive.org/web/20200115/https://example.com

Saving Pages Yourself

Use "Save Page Now" at the bottom of the Wayback Machine homepage:

Enter the URL you want to archive
Click "Save Page"
The system captures a fresh snapshot
You get a permanent archived URL to cite

Pro Tips for Effective Use

Try Different URLs

If example.com/page doesn't have captures, try:

example.com (the homepage)
example.com/page/ (with trailing slash)
www.example.com/page (with or without www)

Check Neighboring Dates

If the specific date you want isn't captured, check dates before and after. Content often didn't change between captures.

Use the CDX API

For bulk lookups, developers can query the CDX Server API:

https://web.archive.org/cdx/search/cdx?url=example.com&output=json

This returns all captures in a structured format for programmatic processing.

Export and Save

For important archived pages, save a local copy. The Wayback Machine is helpful, but it’s not a hard drive you control — captures disappear, assets go missing, and sometimes a snapshot is half a page. If it matters, keep your own backup.

What It Can't Do

Login-Required Pages

Private content behind authentication isn't captured. The crawlers can't log in, so dashboards, private profiles, and gated content aren't archived.

Everything

Not every page is archived. Sites can be too obscure to be crawled, robots.txt can block archiving, or timing can miss ephemeral content.

Dynamic Content

JavaScript-heavy sites may not render correctly. Single-page applications, interactive features, and dynamically-loaded content often don't capture properly. You might see loading spinners or broken functionality.

Real-Time

There's a delay between when content is published and when it's captured. Breaking news or rapidly-changing content may not be captured in real-time.

Removed Content

Site owners can request removal of their content from the archive. Some captures may have been deleted at the owner's request.

Using TinyUtils for Batch Archive Lookups

Have multiple broken links that need archived versions? Manual lookup is tedious. TinyUtils Wayback Fixer automates the process:

Enter your broken URLs (paste multiple at once)
The tool queries the Wayback Machine for each URL
Get archive.org URLs for all available snapshots
Export results as CSV for easy processing

This saves significant time when fixing broken links across a website after sources disappear.

Frequently Asked Questions

How often are pages archived?

It varies widely. Popular sites like major news outlets might be captured multiple times per day. Lesser-known sites might have captures months or years apart. Some sites have thousands of captures; others have none.

Can I remove my site from the archive?

Site owners can request removal. The Internet Archive respects robots.txt going forward, and will remove past captures upon valid request. This isn't automatic—you need to contact them.

Is using the archive legal?

For research, journalism, personal use, and most legitimate purposes, yes. Republishing archived content at scale may have copyright implications. The archive itself is protected as a library under US law.

Why are some images/styles broken?

The archive captures what it can at the time. Resources hosted on different domains may not have been captured, third-party services may have blocked crawling, or some resources may have failed to load during capture.

Can I trust that archived content is authentic?

The Wayback Machine captures pages as they were served at the time. However, it captures what it receives—if a site served different content to different users, the archive shows what the crawler received. For legal purposes, additional verification may be needed.

How long will the archive exist?

The Internet Archive is committed to permanent preservation. It's a nonprofit with multiple funding sources and mirror locations. However, like any organization, its future isn't guaranteed. For truly critical references, maintain your own backups.

Contributing to Web Preservation

Save Pages Proactively

When you find valuable content, use "Save Page Now" to ensure it's archived. You're contributing to the historical record.

Support the Internet Archive

The Internet Archive is a nonprofit. They accept donations, which help preserve more of the web. Consider supporting them if you find the Wayback Machine valuable.

Use Archive Links

When citing sources that might disappear, include archive links alongside original URLs. This creates redundancy and ensures your references remain accessible.

Why Use Online Archive Tools?

Batch processing: Check multiple URLs at once with TinyUtils Wayback Fixer
CSV export: Get results in a format for spreadsheet processing
No API learning: Query the archive without learning its API
Workflow integration: Quickly fix broken links across your site

Start Exploring Web History

The Wayback Machine is one of humanity's most valuable digital preservation projects. Whether you're recovering lost content, researching web history, or fixing broken links, it's an essential tool.

Need to batch-process broken links? Use TinyUtils Wayback Fixer to find archived versions efficiently. For finding broken links in the first place, start with TinyUtils Dead Link Finder.

For more on maintaining link health, see our guide to fixing broken links with Archive.org and agency link checking workflow.

Find Archived URLs

Batch process URLs with Wayback Fixer.

Open Wayback Fixer →