Category Archives: Wayback Machine – Web Archive

If You See Something, Save Something – 6 Ways to Save Pages In the Wayback Machine

In recent days many people have shown interest in making sure the Wayback Machine has copies of the web pages they care about most. These saved pages can be cited, shared, linked to – and they will continue to exist even after the original page … Continue reading

Posted in Announcements, News, Wayback Machine - Web Archive | 13 Comments

Robots.txt Files and Archiving .gov and .mil Websites

The Internet Archive is collecting webpages from over 6,000 government domains, over 200,000 hosts, and feeds from around 10,000 official federal social media accounts. Some have asked if we ignore URL exclusions expressed in robots.txt files. The answer is a … Continue reading

Posted in News, Wayback Machine - Web Archive | 3 Comments

Please: Help Build the 2016 U.S. Presidential Election Web Archive

Help us build a web archive documenting reactions to the 2016 Presidential Election. You can submit websites and other online materials, and provide relevant descriptive information, via this simple submission form. We will archive and provide ongoing access to these … Continue reading

Posted in Announcements, Archive-It, News, Wayback Machine - Web Archive | 8 Comments

I CAN HAZ MEME HISTORY??

Jason Scott presents Internet Memes of the last 20 Years at the Internet Archive’s 20th anniversary celebration. ——– It’s always going to be an open question as to what parts of culture will survive beyond each generation, but there’s very … Continue reading

Posted in News, Wayback Machine - Web Archive | 2 Comments

Defining Web pages, Web sites and Web captures

The Internet Archive has been archiving the web for 20 years and has preserved billions of webpages from millions of websites. These webpages are often made up of, and link to, many images, videos, style sheets, scripts and other web objects. … Continue reading

Posted in Announcements, News, Wayback Machine - Web Archive | 4 Comments

The Hidden Shifting Lens of Browsers

Some time ago, I wrote about the interesting situation we had with emulation and Version 51 of the Chrome browser – that is, our emulations stopped working in a very strange way and many people came to the Archive’s inboxes … Continue reading

Posted in Emulation, Technical, Wayback Machine - Web Archive | 2 Comments

No More 404s! Resurrect dead web pages with our new Firefox add-on.

Have you ever clicked on a web link only to get the dreaded “404 Document not found” (dead page) message? Have you wanted to see what that page looked like when it was alive? Well, now you’re in luck. Recently the Internet Archive … Continue reading

Posted in Announcements, Wayback Machine - Web Archive | 10 Comments

Fixing Broken Links on the Internet

Today the Internet Archive announces a new initiative to fix broken links across the Internet.  We have 360 billion archived URLs, and now we want you to help us bring those pages back out onto the web to heal broken … Continue reading

Posted in Announcements, News, Wayback Machine - Web Archive | 13 Comments

Blacked Out Government Websites Available Through Wayback Machine

  (from the Internet Archive’s Archive-it group: Announcing the first ever Archive-It US Government Shutdown Notice Awards!  ) Congress has caused the U.S. federal government to shut down and important websites have gone dark.  Fortunately, we have the Wayback Machine to … Continue reading

Posted in Announcements, News, Wayback Machine - Web Archive | Tagged , | 32 Comments

80 terabytes of archived web crawl data available for research

Internet Archive crawls and saves web pages and makes them available for viewing through the Wayback Machine because we believe in the importance of archiving digital artifacts for future generations to learn from.  In the process, of course, we accumulate … Continue reading

Posted in News, Wayback Machine - Web Archive | 43 Comments

HTTP Archive joins with Internet Archive

It was announced today that HTTP Archive has become part of Internet Archive. The Internet Archive provides an archive of web site content through the Wayback Machine, but we do not capture data about the performance of web sites.  Steve … Continue reading

Posted in News, Wayback Machine - Web Archive | 6 Comments

Want to discuss the Wayback Machine or Internet Archive’s web archive?

Over on the web group’s blog is a post inviting input on the Wayback Machine and Internet Archive’s web archive. You can post comments and suggestions. There’s also some useful links. Check it out at http://iawebarchiving.wordpress.com/2010/07/06/wayback-machine-web-archiving-open-thread-july-2010/ -Jeff Kaplan

Posted in News, Wayback Machine - Web Archive | 3 Comments

New Firefox Add-on for searching the Wayback Machine

Fellow time travelers, We have a new Firefox add-on that allows you to search the Wayback Machine from your browser. You can get it at: https://addons.mozilla.org/en-US/firefox/addon/162148/. For those who have yet to travel back in time, the Internet Archive Wayback … Continue reading

Posted in News, Wayback Machine - Web Archive | 8 Comments

GeoCities, Preserved!

There’s a chance that in the 1990s, you were more familiar with neighborhoods on GeoCities than with the neighborhoods in your own town. As one of the most popular and oldest (nearly 15 years running) sites for self expression on … Continue reading

Posted in Wayback Machine - Web Archive | Tagged , | 15 Comments

Wayback Machine comes to life in new home

The Wayback Machine is a 150 billion page web archive with a front end to serve it through the archive.org website. Today the new machine came to life, so if you using the service, you are using a 20′ by … Continue reading

Posted in Announcements, Wayback Machine - Web Archive | 1 Comment

Wayback Machine has 85 Billion Archived Webpages

The Internet Archive’s Wayback Machine now has 85,898,456,616 archived web objects in it, and is available, as always, to the public for free. A snapshot of the World Wide Web is taken every 2 months and donated to the Internet … Continue reading

Posted in Announcements, Wayback Machine - Web Archive | 4 Comments