Lost Landscapes of San Francisco: Fundraiser Benefitting Internet Archive Dec 18th

FerryBldgFromWaterDuskRick Prelinger’s Lost Landscapes of San Francisco is a movie happening that brings old-time San Francisco footage and our community together in an interactive crowd-driven event.   Showing in the majestic Internet Archive building,  your ticket donation will benefit the Internet Archive, which suffered a major fire in November. Please give generously to support the rebuilding effort.

December 18, 2013
6pm Reception
7:30pm Film

300 Funston Ave.
San Francisco, CA 94118

Get tickets through Brown Paper Tickets.

TouristsGGBopening1936ATripDownMarketStreet1906_1Lost Landscapes returns for its 8th year, bringing together both familiar and unseen archival film clips showing San Francisco as it was and is no more. Blanketing the 20th-century city from the Bay to Ocean Beach, this screening includes newly-discovered images of Playland and Sutro Baths; the waterfront; families living and playing in their neighborhoods; detail-rich streetscapes of the late 1960s; the 1968 San Francisco State strike; Army and family life in the Presidio; buses, planes, trolleys and trains; a selected reprise of greatest hits.

As usual, the viewers make the soundtrack — audience members are asked to identify places and events, ask questions, share their thoughts, and create an unruly interactive symphony of speculation about the city we’ve lost and the city we’d like to live in.


Posted in Announcements, Event, Movie Archive, News | 5 Comments

Bitcoin Friday Sale at the Internet Archive Store

While the rest of the world lines up early for Black Friday we here at the Internet Archive Store offer online deals on Bitcoin Friday. From Friday, November 29 through Sunday, December 1 you can purchase our two most popular items on sale when you use Bitcoin. The Internet Archive Hat is $5.12 off and the Internet Archive Sweatshirt is $10.24 off. Bitcoin Friday at store.archive.org.

To get the discount at The Internet Archive Store use:
Coupon code: hat512                                                          Coupon code: sweat1024

Posted in News | 1 Comment

Start of Community Wireless Backbone in Richmond CA

With the cooperation of the City of Richmond (thank you!), a group of volunteers and Internet Archive staff are starting to set up backbone repeaters in Richmond California to build a Community Wireless network.   Here is Colyer Dupont showing a “tier 2″ dish that will then be used to repeat to the neighborhood.   The equipment came from the Internet Archive, and installed on Ormond’s property by Ormond, John Easterday, and Dupont.

In the next several months as this becomes easier and the reliability is proven or improved we hope to have our first users come online.

Mayor and City Council of Richmond make a first step to see how they will help.   To get involved in the Richmond project, we have a forum on this page: https://archive.org/communitywireless



Posted in News | 2 Comments

A Dream to Preserve TV News, on the Road to Realization… with Your Help

On The Media’s TLDR
Huffington Post
Fast Company

We are about to receive a remarkable private collection of video taped U.S. television news that spans 35 years.  We welcome contributions of TV news recorded before the year 2001 to help broaden our research library.

M_Stokes4Marion Marguerite Stokes, a librarian, social justice advocate and TV interview program host, believed that it was vital to preserve television news.

Mrs. Stokes started recording news at home in 1977 — and never stopped. Before her death in December 2012 she recorded 140,000 video cassettes. Her family searched for a home for her unique collection and found us in June.

It is a unique collection of local news from Boston (1977-1986) and Philadelphia (1986-2012), as well as all the national news. The Boston era is particularly notable for the busing/desegregation strife that raged throughout.

Marion Stokes’ amazing commitment to preserve television news, a passion that few at the time entirely understood, shaped the daily lives of her children growing up and, later, visits of her grandchildren. Her dream of using this collection for the public good can now be fulfilled.

In just a few days, four large shipping containers on trucks will be winding their way across the country to our Richmond, California physical archive. The digitization of such a huge collection will take a number of years and funding we have yet to raise.

Join us in helping to realize Marion Stokes’ gift to the future and make it available to all, forever, for free.  Please consider making a contribution, right now!

Posted in Announcements, News | Tagged , , , | 2 Comments

NSA Comedy Tour with Will Durst! A Night of Comedy, Ethics & Tech on Dec. 11th in SF

NSAcomedyEthicsInTech presents a fun night of Comedy, Ethics & Technology to help protect the Fourth Amendment and our constitutional rights and freedoms. The goal of the event is to entertain, educate and bring to light issues concerning ethical use of technology and how it can help or curtail individual rights and freedoms. This holiday charity event is focused on increasing public awareness through expert panel discussions on how those rights are being violated by the National Security Administration.  “NSA Comedy Tour™” is focused on promoting the issues, causes and challenges that humanity faces as it adopts the ever changing tools and technologies that have taken us by storm.


Wednesday, December 11, 2013
Reception 6:00pm
Comedy and Panel 7:00-9:00 PM
Internet Archive
300 Funston Ave, San Francisco, 94118

Please buy a ticket, have fun, and support good causes!

Panel Members:

Will Durst, America’s Funniest Political Comedian
Cindy Cohn, Legal Director of Electronic Frontier Foundation
Brewster Kahle, Digital Librarian and Founder of Internet Archive


Vahid Razavi, Founder of EthicsInTech & BizCloud®


Inder Comar, WitnessIraq.com
Janet Weil, CodePink.org

A percentage of all ticket proceeds from this event will be donated to organizations & causes below (selection process by ticket buyers):

Electronic Frontier Foundation
Internet Archive
Veterans For Peace


Posted in Announcements, News | 1 Comment

Let the Robots Read! A Victory for Fair Use

A resounding judgement in the Google Books case means that the act of digitizing books is not in-and-of-itself infringing.    In legal-speak, the judge ruled that digitizing books is “fair”.   This is a big deal in that it allows machines, or robots, to read books.      What someone does with the book after it is in digital form might break the law, but just getting it in digital form does not.   This is helpful to the Internet Archive’s book project, digital libraries in general, and the public at large.

How did we get here?   There were book scanning projects in the early 2000′s, including the Million Books Project and Project Gutenberg (both of which Internet Archive was involved in), but many of these did not venture beyond out-of-copyright books.   Google boldly started scanning all books, but were sued by the Authors Guild and AAP.   They proposed a settlement that would have created a monopoly and changed copyright law, and was therefore rejected by Judge Chin.    The Internet Archive was happy with this decision because we did not want to see central control of all out-of-print or orphan works.

At this point, without a settlement the case proceeded to find if Google’s digitizing of in-copyright works and showing “snippets” of pages infringes on the monopoly rights bestowed on publishers and authors by the government.

Judge Chin soundly ruled that what Google was not infringing.   The judgement is quite readable, and is recommended.   The Author’s Guild has said they will appeal.

What does this mean?   It means that having machines read books is allowable under United States law.    This is an important because more and more research is being done with the assistance of computers.   If computers could not be used to help in research by storing full works in memory, then people would be back to writing quotations on note cards or typing in short sections onto their computers.    Clearly this does not make sense, and, thankfully Judge Chin thought so too.

The Internet Archive has been digitizing modern books for many years for the blind and dyslexic, but also to aid in lending books to the public.    This decision will not directly effect what the Internet Archive is doing, but puts some possible legal issues on more solid ground.

Let the robots read!   A clear victory for fair use.




Posted in Announcements, News | 1 Comment

Birthday of the Defensive Patent License: Friday, Nov 15, 4:30-8:00 in SF


(Video of event)


Please join us to celebrate the birthday of the
Defensive Patent License (“DPL”)!

Short Program and Birthday Party
Friday, November 15, 2013

Panel Discussion 4:30-6:00 PM
Reception 6:00-8:00 PM

Internet Archive
300 Funston Ave, San Francisco, 94118

Click here for more information

Click here to RSVP


DPL Launch Conference
Friday, February 28, 2014

Brower Center
2150 Allston Way, Berkeley, 94704

Posted in Announcements | 1 Comment

Fire Update: Lost Many Cameras, 20 Boxes. No One Hurt.

Scanning Center Fire

Scanning Center with Fire Damage to Left of Main Building

As fires go, we were lucky.   We are still assessing what happened but this is where we stand:

* No one was hurt.

* Lost a 130 sq meter side-building (1300 sq feet) that held scanning equipment.  We operate 30 scanning centers, and this was one of them.    Our offices were not affected.

* We lost maybe 20 boxes of books and film, some irreplaceable, most already digitized, and some replaceable.   From our point of view this is the worst part.   We lost an array cameras, lights, and scanning equipment worth hundreds of thousands of dollars.  Insurance will cover some but not all of this.

* We do not know the cause, but there is no evidence of foul play.

* An outpouring of support has lead to over 1500 donations totaling over $60,000 in the first 2 days.  We also have received new offers to digitize more materials that will help keep our staff working. This is so helpful.  Thank you for your confidence and support in our mission.

* No servers were affected.   If some had been damaged, we have backups in different locations.   An electrical conduit was damaged, but all digital services were functional within 6 hours, fully operational in 10 hours.

* All employees of the scanning center were back scanning again, using repurposed equipment, within 48 hours.   Our administrative and computer operations staff have worked hard to get life back to some sort of normal for everyone.   We are rattled, but back being productive.   The side of our neighbor’s building was damaged so the tenants will be disrupted until that is repaired.

* Despite the fire, we were able  to hold a pre-planned event celebrating the birthday of Aaron Swartz 3 days after the fire.

All in all we were lucky, and we are very thankful for the support from everyone.  While rattling to have a fire, and expensive, we have had little significant operational disruption.    We are looking for lessons to learn and will apply them.

Lets keep making copies– a key towards preservation and access.

Thank you, all.

Scanning Center, to the left of the Main Archive Building, was Damaged
Scanning Center, beside the Main Archive Building, was Damaged
Original Post:

This morning at about 3:30 a.m. a fire started at the Internet Archive’s San Francisco scanning center.  The good news is that no one was hurt and no data was lost.  Our main building was not affected except for damage to one electrical run.  This power issue caused us to lose power to some servers for a while.

Some physical materials were in the scanning center because they were being digitized, but most were in a separate locked room or in our physical archive and were not lost.   Of those materials we did unfortunately lose, about half had already been digitized.   We are working with our library partners now to assess.

The San Francisco Fire Department was fast and great.   Our city supervisor and a representative of the mayor’s office have come by to check up on us.    There has been a pulling together on the Internet as news has spread.

This episode has reminded us that digitizing and making copies are good strategies for both access and preservation.  We have copies of the data in the Internet Archive in multiple locations, so even if our main building had been involved in the fire we still would not have lost the amazing content we have all worked so hard to collect.

Fire in the Scanning Center

Fire in the Scanning Center

An early estimate shows we may have lost about $600,000 worth of high end digitization equipment, and we will need to repair or rebuild the scanning building.   It is in difficult times like these that we turn to our community.

What help could we use?

  • Funding.   Your donations will help us rebuild the scanning capabilities in books, microfilm, and movies.
  • Scanning.  The employees affected by the fire will need continued digitization work at our alternate location while we recover.
Posted in Announcements | Tagged | 124 Comments

Please Come: Aaron Swartz Reception at the Internet Archive Fri Nov 8th in SF

(CNET Article)

On November 8th from 6:30 pm to 9:00 pm, you are invited to a reception, talks, and hackathon at the Internet Archive 300 Funston Ave, in San Francisco.

Suggested donation of $5, Bitcoin accepted.

5PM Hackathon
6:30PM Reception
7:30PM Brief Talks

Hackathon Introductions In memory of our dear friend and collaborator, Aaron Swartz, whose social, technical, and political insights still touch us daily, Noisebridge and the Internet Archive will be hosting a reception on what would have been Aaron’s 27th birthday, Friday, November 8, 2013.

(Please RSVP)

Posted in Announcements, News | 2 Comments

Reader Privacy at the Internet Archive

Reader Privacy(NYTimes article, Video of Announcement with Daniel Ellsberg)

The Internet Archive has extended our reader privacy protections by making the site encrypted by default.   Visitors to archive.org and openlibrary.org will https unless they try to use http.

For several years, the Internet Archive has tried to avoid keeping Internet Protocol (IP) addresses of our readers.   Web servers and other software that interacts with web users record IP addresses in their logs by default which leaves a record that makes it possible to  reconstruct who looked at what.     The web servers on Archive.org and OpenLibrary.org were modified to take the IP addresses, and encrypt them with a key that changes each day making it very difficult to reconstruct any users behavior.   This approach still allows us to know how many people have used our services (now over 3 million a day!)  but not to know who is who or where readers are coming from.   For those that are uploading or writing to our services we do keep some IP address for a short period to help us battle spam.   For books that are checked out from our Open Library service, we record which patron has checked out the book but not the IP address of their computer.

Today we are going further than this.    Based on the revelations of bulk interception of web traffic as it goes over the Internet,  we are now protecting the reading behavior as it transits over the Internet by encrypting the reader’s choices of webpages all the way from their browser to our website.   We have done this by implementing the encrypted web protocol standard, https, and making it the default.  It is still possible to retrieve files via http to help with backward compatibility, but most users will soon be using the secure protocol.

Users of the Wayback Machine, similarly will use the secure version by default, but can use the http version which will help playback some complicated webpages.

This is in line with the principles from the ALA and a campaign by the EFF.



Posted in Announcements, News | 9 Comments

Microcomputer Software Lives Again, This Time in Your Browser

The miracle is now so commonplace that it’s invisible: we have the ability to watch video, listen to music, and read documents right in our browsers. You might get a hankering to hear some old time radio, or classic television programs, or maybe read up some classic children’s books, you’re just a couple clicks away from having them right there, in front of you. Not so with classic software. To learn and experience older programs, you have to track down the hardware and media to run it, or download and install emulators and acquire/install cartridge or floppy images as you boot up the separate emulator program, outside of the browser. Unlike films or video or audio, it was a slower, more involved process to experience software.

Until now.


JSMESS is a Javascript port of the MESS emulator, a mature and breathtakingly flexible computer and console emulator that has been in development for over a decade and a half by hundreds of volunteers. The MESS emulator runs in a large variety of platforms, but is now able to run embedded in most modern browsers, including Firefox, Chrome, Safari and Internet Explorer.

Today, the Internet Archive announces the Historical Software Archive, a collection of prominent and historically notable pieces of software, able to be run immediately in your browser.  They range from pioneering applications to obscure forgotten utilities, and from peak-of-perfection designs to industry-crashing classics.


Turning computer history into a one-click experience bridges the gap between understanding these older programs and making them available in a universal fashion. Acquisition, for a library, is not enough – accessibility is where knowledge and lives change for the better. The JSMESS interface lets users get to the software in the quickest way possible.

We asked a number of people to look at the Historical Software section, and here were their comments:

“Bringing microcomputer software back from floppy drives and cassette tapes is an important task not just for nostalgia but so we can learn from the good work of tens of thousands of people in our not-so-distant past.   The Internet Archive’s first steps towards bringing it up in a web browser is very encouraging and we at DigiBarn look forward to working with the Archive to bring the best of that era back again.”

- Dr. Bruce Damer, Curator, DigiBarn Computer Museum

“We have come a long way in digital and software preservation – far enough along that problems of discovery and access are looming on the horizon.  It’s comforting to know that the Internet Archive is developing solutions for these problems, so that people can use the software we save.”

- Henry Lowood, Curator for History of Science & Technology Collections, Stanford University Libraries

The Internet Archive has given us a remarkable opportunity to make the past present once again through its in-browser emulation. Now enthusiasts, students, scholars, historians from all corners of the globe can quickly and easily access software that would normally require fairly sophisticated technological expertise. I expect we will soon recognize this as a crucial development in digital preservation and access.”

- Lori Emerson, Media Archaeology Lab at the University of Colorado

“Emulation in a browser means embedding digital history in the everyday experience of surfing the Web. Not as screenshots or scans, but as living history, dynamic and interactive, inviting and even seductive. I look forward to weird wormholes and portals into our past appearing everywhere.”

- Matt Kirschenbaum, Associate Director, Maryland Institute for Technology in the Humanities (MITH)

“The team at the Internet Archive have managed not just to preserve some of the most memorable bits and bytes of the last 3 decades of personal computing, they have given us all a way to execute them in a browser.  The past is now  playable at a stable URL.”

- Doug Reside, Digital Curator for the Performing Arts, NYPL
“The Internet Archive is one of the most interesting and important new repositories for historians, curators and anyone interested in the preservation of recent culture.  The emulator is an exceptional new tool that will make possible all kinds of investigations that heretofore were limited to specialists.  It is a wonderful achievement.”
- Deborah Douglas, Director of Collections, MIT Museum

Many, many individuals have contributed to the JSMESS project. The project makes extensive use of the Emscripten compiler project, headed by Alon Zakai at Mozilla.org. JSMESS is a non-affiliated port of the MESS emulator. MESS is the result of years of effort by hundreds of contributors, a number of them anonymous, who have continued to work daily to provide the most accurate emulation of historical machinery. JSMESS and MESS are not affiliated projects. The JSMESS team includes Justin de Vesine, John Vilk, Andre D, Justin Kerk, Vitorio Miliano, and Jason Scott; countless others have contributed documentation, testing and feedback about the functioning of the project. Integration with the Internet Archive’s internals are the result of efforts by Alex Buie, Hank Bromley, Samuel Stoller and Tracey Jaquith. 

Update: The introduction of the Historical Software Collection and JSMESS has been covered in The Register, Engadget, PC World, Slashgear, and The Verge (twice!)

Posted in Announcements, News, Software Archive | 94 Comments

Fixing Broken Links on the Internet

No More 404s

Today the Internet Archive announces a new initiative to fix broken links across the Internet.  We have 360 billion archived URLs, and now we want you to help us bring those pages back out onto the web to heal broken links everywhere.

When I discover the perfect recipe for Nutella cookies, I want to make sure I can find those instructions again later.  But if the average lifespan of a web page is 100 days, bookmarking a page in your browser is not a great plan for saving information.  The Internet echoes with the empty spaces where data used to be.  Geocities – gone.  Friendster – gone.  Posterous – gone.  MobileMe – gone.

Imagine how critical this problem is for those who want to cite web pages in dissertations, legal opinions, or scientific research.  A recent Harvard study found that 49% of the URLs referenced in U.S. Supreme Court decisions are dead now.  Those decisions affect everyone in the U.S., but the evidence the opinions are based on is disappearing.

In 1996 the Internet Archive started saving web pages with the help of Alexa Internet.  We wanted to preserve cultural artifacts created on the web and make sure they would remain available for the researchers, historians, and scholars of the future.  We launched the Wayback Machine in 2001 with 10 billion pages.  For many years we relied on donations of web content from others to build the archive.  In 2004 we started crawling the web on behalf of a few, big partner organizations and of course that content also went into the Wayback Machine.  In 2006 we launched Archive-It, a web archiving service that allows librarians and others interested in saving web pages to create curated collections of valuable web content.  In 2010 we started archiving wide portions of the Internet on our own behalf.  Today, between our donating partners, thousands of librarians and archivists, and our own wide crawling efforts, we archive around one billion pages every week.  The Wayback Machine now contains more than 360 billion URL captures.


FTC.gov directed people to the Wayback Machine during the recent shut down of the U.S. federal government.

We have been serving archived web pages to the public via the Wayback Machine for twelve years now, and it is gratifying to see how this service has become a medium of record for so many.  Wayback pages are cited in papers, referenced in news articles and submitted as evidence in trials.  Now even the U.S. government relies on this web archive.

We’ve also had some problems to overcome.  This time last year the contents of the Wayback Machine were at least a year out of date.  There was no way for individuals to ask us to archive a particular page, so you could only cite an archived page if we already had the content.  And you had to know about the Wayback Machine and come to our site to find anything.  We have set out to fix those problems, and hopefully we can fix broken links all over the Internet as a result.

Up to date.  Newly crawled content appears in the Wayback Machine about an hour or so after we get it.  We are constantly crawling the Internet and adding new pages, and many popular sites get crawled every day.

Save a page. We have added the ability to archive a page instantly and get back a permanent URL for that page in the Wayback Machine.  This service allows anyone — wikipedia editors, scholars, legal professionals, students, or home cooks like me — to create a stable URL to cite, share or bookmark any information they want to still have access to in the future.  Check out the new front page of the Wayback Machine and you’ll see the “Save Page” feature in the lower right corner.

Do we have it?  We have developed an Availability API that will let developers everywhere build tools to make the web more reliable.  We have built a few tools of our own as a proof of concept, but what we really want is to allow people to take the Wayback Machine out onto the web.

Fixing broken links.  We started archiving the web before Google, before Youtube, before Wikipedia, before people started to treat the Internet as the world’s encyclopedia. With all of the recent improvements to the Wayback Machine, we now have the ability to start healing the gaping holes left by dead pages on the Internet.  We have started by working with a couple of large sites, and we hope to expand from there.

WordPress.com is one of the top 20 sites in the world, with hundreds of millions of users each month.  We worked with Automattic to get a feed of new posts made to WordPress.com blogs and self-hosted WordPress sites.  We crawl the posts themselves, as well as all of their outlinks and embedded content – about 3,000,000 URLs per day.  This is great for archival purposes, but we also want to use the archive to make sure WordPress blogs are reliable sources of information.  To start with, we worked with Janis Elsts, a developer from Latvia who focuses on WordPress plugin development, to put suggestions from the Wayback into his Broken Link Checker plugin.  This plugin has been downloaded 2 million times, and now when his users find a broken link on their blog they can instantly replace it with an archived version.  We continue to work with Automattic to find more ways to fix or prevent dead links on WordPress blogs.

Wikipedia.org is one of the most popular information resources in the world with  almost 500 million users each month.  Among their millions of amazing articles that all of us rely on, there are 125,000 of them right now with dead links.  We have started crawling the outlinks for every new article and update as they are made – about 5 million new URLs are archived every day.  Now we have to figure out how to get archived pages back in to Wikipedia to fix some of those dead links.  Kunal Mehta, a Wikipedian from San Jose, recently wrote a protoype bot that can add archived versions to any link in Wikipedia so that when those links are determined to be dead the links can be switched over automatically and continue to work.  It will take a while to work this through the process the Wikipedia community of editors uses to approve bots, but that conversation is under way.

Every webmaster.  Webmasters can add a short snippet of code to their 404 page that will let users know if the Wayback Machine has a copy of the page in our archive – your web pages don’t have to die!

We started with a big goal — to archive the Internet and preserve it for history.  This year we started looking at the smaller goals — archiving a single page on request, making pages available more quickly, and letting you get information back out of the Wayback in an automated way.  We have spent 17 years building this amazing collection, let’s use it to make the web a better place.

Thank you so much to everyone who has helped to build such an outstanding resource, in particular:

Adam Miller
Alex Buie
Alexis Rossi
Brad Tofel
Brewster Kahle
Ilya Kreymer
Jackie Dana
Janis Elsts
Jeff Kaplan
John Lekashman
Kenji Nagahashi
Kris Carpenter
Kristine Hanna
Kunal Mehta
Martin Remy
Raj Kumar
Ronna Tanenbaum
Sam Stoller
SJ Klein
Vinay Goel

Posted in Announcements, News, Wayback Machine | 13 Comments

Free “404: File Not Found” Handler for Webmasters to Improve User Experience

nomore404_lThe Internet Archive today is launching a free service to help webmasters improve their user experience by augmenting their website’s 404 Page Not Found page to link to the Wayback Machine in the case that it has it.    Therefore users trying to get to any pages that might have been on a previous version of your website will now be given the option to go to the Wayback Machine.

To embed a link to the Wayback Machine on your site’s 404 pages, just include this line in your error page:

<div id="wb404"/>
<script src="https://archive.org/web/wb404.js"> </script>

If an archived page is not found, then nothing will appear, if it is found, then your user will see:

For instance, the Internet Archive has installed this on its 404 error handling page.    We had a page, before 2004, that is still referenced on the web.   Now, instead of people getting a 404: File Not Found error, they get a page that includes a link to the page in the Wayback Machine.



Posted in Announcements, News | 11 Comments

NSA TV Clip Library

When the American people find out how their government has secretly interpreted the Patriot Act, they are going to be stunned and they are going to be angry.  Senator Ron Wyden May 26, 2011

Recent revelations of the extent of National Security Agency surveillance and weakening of our digital infrastructure give substance to the warnings of Senator Wyden and others. To assist journalists and other concerned citizens in reflecting on these issues, the Internet Archive has created a curated library of short television news clips presenting key statements and other representations.

NSA-issues TV News Quote Library

The experimental, Chrome and Safari only, library launches today with more than 700 chronologically ordered television citations drawn from the Archive’s television news research service. The TV quotes can be browsed by rolling over clip thumbnails, queried via transcripts and sorted for specific speakers. Citation links, context, links to source broadcasters and options to borrow can be explored by following the More/Borrow links on each thumbnail.

Posted in News, Television Archive, Video Archive | Tagged , | 2 Comments

NEW! The Internet Archive Store is open

The Internet Archive Hat

The Internet Archive Hat

In the navigation bar on our blog is a new link, Internet Archive Store. We often receive requests for Internet Archive gear such as shirts, hats, etc. So, we decided to make it available to all the folks that want it. At store.archive.org you’ll find hats, sweatshirts, mugs, buttons and other items. We expect to add some new items in the coming weeks. Your purchases will, of course, be the envy of all your archive-want-to-be friends but you will also be helping the Internet Archive. Shipping is free on all items. And, you can even use bitcoins!

Internet Archive Coffee Mug

Internet Archive
Coffee Mug

Please visit the store.archive.org and maybe pick up a few things for the archivist in you.

Thank you.

The Internet Archive team

Posted in News | 2 Comments

Borrow Top Children’s Books

nypllogoRecently the New York Public Library published a list of their choices for the top 100 children’s books from the last 100 years.   We took a look at the books we have available via Open Library, and it turns out we have scanned a lot of them!  There is no cost to borrow books through Open Library, you just need a free account.  Check out the FAQ if you need help.

If a book on this list isn’t linked, that means we don’t have a digital copy yet.  If you would like to donate the physical book to us, we can add it to the queue of books to be scanned and it will become available in the future.

You may also be interested in the freely downloadable books from the Children’s Library collection on archive.org.

Books with the lock icon lockicon are only available to the print disabled.

Amelia BedeliaA
Alexander and the Terrible, Horrible, No Good, Very Bad Day
. Judith Viorst. Illus. by Ray Cruz. (1972)
All-of-a-Kind Family. Sydney Taylor, illustrated by Helen John. (1951)
Amelia Bedelia. Peggy Parish, illustrated by Fritz Siebel. (1963)
The Arrival. Shaun Tan. (2007)

lockiconBark, George. Jules Feiffer. (1999)
lockiconBecause of Winn-Dixie. Kate DiCamillo. (2000)
Bread and Jam for FrancesBen’s Trumpet. Rachel Isadora. (1979)
Big Red Lollipop. Rukhsana Khan. Illus. by Sophie Blackall. (2010)
The Birchbark House. Louise Erdrich. (1999)
The Book of Three. Lloyd Alexander. (1964)
The Borrowers. Mary Norton. Illus. by Beth Krush and Joe Krush. (1953)
El Gallo De Bodas: A Traditional Cuban Folktale. Lucía M. González. Illus. by Lulu Delacre. (1994)
Bread and Jam for Frances. Russell Hoban. illustrated by Lillian Hoban. (1964)
Bridge to Terabithia. Katherine Paterson. (1977)
Brown Bear, Brown Bear, What Do You See? Bill Martin, Jr. Illus. by Eric Carle. (1967)

Curious GeorgeC
lockiconCaps for Sale. Esphyr Slobodkina. (1938)
lockiconThe Cat in the Hat. Dr. Seuss. (1957)
lockiconChains. Laurie Halse Anderson. (2008)
A Chair For My Mother. Vera B. Williams. (1982)
Charlotte’s Web. E.B. White. Illus. by Garth Williams. (1952)
Chato’s Kitchen. Gary Soto. Illus. by Susan Guevara. (1995)
lockiconChicka Chicka Boom Boom. Bill Martin, Jr. and John Archambault. Illus. by Lois Ehlert. (1989)
Corduroy. Don Freeman. (1976)
Curious George. H.A. Rey. (1941)

D'Aulaires' Book of Greek MythsD
D’Aulaires’ Book of Greek Myths. Ingri D’Aulaire and Edgar Parin D’Aulaire. (1962)
lockiconDon’t Let the Pigeon Drive the Bus! Mo Willems. (2003)

lockiconEsperanza Rising. Pam Muñoz Ryan. (2000)

Freight Train. Donald Crews. (1978)
Frog and Toad Are Friends. Arnold Lobel. (1970)
From the Mixed-Up Files of Mrs. Basil E. Frankweiler. E.L. Konigsburg. (1967)

Grandfather's JourneyG
George and Martha. James Marshall. (1972)
The Giver. Lois Lowry. (1993)
Go, Dog. Go! P.D. Eastman. (1961)
lockiconGoodnight Moon. Margaret Wise Brown. Illus. by Clement Hurd. (1947)
Grandfather’s Journey. Allen Say. (1993)
lockiconThe Graveyard Book. Neil Gaiman. Illus. by Dave McKean. (2008)
Green Eggs and Ham. Dr. Seuss. (1960)

lockiconHarold and the Purple Crayon. Crockett Johnson. (1955)
Harriet the Spy. Louise Fitzhugh. (1964)
Harry Potter and the Sorcerer’s Stone. J.K. Rowling. (1998)
Hatchet. Gary Paulsen. (1989)
The Hobbit. J.R.R. Tolkien. (1937)
Holes. Louis Sachar. (1998)

lockiconThe Invention of Hugo Cabret. Brian Selznick. (2007)

Joseph Had a Little Overcoat. Simms Taback. (1999)
Jumanji. Chris Van Allsburg. (1981)
Just a Minute: A Trickster Tale and Counting Book. Yuyi Morales. (2003)

Lon Po PoL
Lilly’s Purple Plastic Purse. Kevin Henkes. (1996)
lockiconThe Lion and the Mouse. Jerry Pinkney. (2009)
The Lion, the Witch and the Wardrobe. C.S. Lewis. (1950)
The Little House. Virginia Lee Burton. (1942)
The Little Prince. Antoine de Saint-Exupéry. (1943)
lockiconLocomotion. Jacqueline Woodson. (2003)
Lon Po Po: A Red-Riding Hood Story From China. Ed Young. (1989)

MatildaMadeline. Ludwig Bemelmans. (1939)
Make Way for Ducklings. Robert McCloskey. (1941)
Matilda. Roald Dahl. Illus. by Quentin Blake. (1988)
Meet Danitra Brown. Nikki Grimes. Illus. by Floyd Cooper. (1994)
Millions of Cats. Wanda Gág. (1928)
Miss Nelson is Missing! Harry Allard. Illus. by James Marshall. (1977)
Mr. Popper’s Penguins. Richard and Florence Atwater. Illus. by Robert Lawson. (1938)
Mrs. Frisby and the Rats of NIMH. Robert C. O’Brien. (1971)
Mufaro’s Beautiful Daughters: An African Tale. John Steptoe. (1987)
Mrs. Frisby and the Rats of NIMHMy Father’s Dragon. Ruth Stiles Gannett. Illus. by Ruth Chrisman Gannett (1948)
lockiconMy Name is Yoon. Helen Recorvits. Illus. by Gabi Swiatkowska. (2003)

lockiconOlivia. Ian Falconer. (2000)
One Crazy Summer. Rita Williams-Garcia. (2010)

lockiconThe People Could Fly: American Black Folktales. Virginia Hamilton. Illus. by Leo/Diane Dillon. (1985)
The Phantom Tollbooth. Norton Juster. Illus. by Jules Feiffer. (1961)
Roll of Thunder, Hear My CryPierre: A Cautionary Tale in Five Chapters and a Prologue. Maurice Sendak. (1962)
Pink and Say. Patricia Polacco. (1994)
Pippi Longstocking. Astrid Lindgren. (1950)

Ramona the Pest. Beverly Cleary. (1968)
lockiconRickshaw Girl. Mitali Perkins. Illus. by Jamie Hogan. (2007)
Roll of Thunder, Hear My Cry. Mildred D. Taylor. (1976)
Rumpelstiltskin. Paul O. Zelinsky. (1986)

Strega NonaA Sick Day for Amos MCGee. Philip Stead. Illus. by Erin E. Stead. (2010)
The Snowy Day. Ezra Jack Keats. (1962)
Starry River of the Sky. Grace Lin. (2012)
The Stories Julian Tells. Ann Cameron. Illus. by Ann Strugnell. (1981)
The Story of Ferdinand. Munro Leaf. Illus. by Robert Lawson. (1936)
Strega Nona. Tomie dePaola. (1975)
Swimmy. Leo Lionni. (1963)
Sylvester and the Magic Pebble. William Steig. (1969)

The True Story of the 3 Little PigsT
Tales of a Fourth Grade Nothing. Judy Blume. (1972)
The Tales of Uncle Remus: The Adventures of Brer Rabbit. Julius Lester. Illus. by Jerry Pinkney. (1987)
Tar Beach. Faith Ringgold. (1991)
Ten, Nine, Eight. Molly Bang. (1983)
Tomie dePaola’s Mother Goose. Tomie dePaola. (1985)
The True Story of the Three Little Pigs. Jon Scieszka. Illus. by Lane Smith. (1989)
Tuesday. David Wiesner. (1991)

The Very Hungry Caterpillar. Eric Carle. (1969)

A Wrinkle in TimeW
The Watsons Go to Birmingham—1963. Christopher Paul Curtis. (1995)
The Westing Game. Ellen Raskin. (1978)
lockiconWhen You Reach Me. Rebecca Stead. (2009)
lockiconWhere Is the Green Sheep? Mem Fox. Illus. by Judy Horacek. (2004)
lockiconWhere the Wild Things Are. Maurice Sendak. (1963) – or borrow in Spanish
Why Mosquitoes Buzz in People’s Ears. Verna Aardema. Illus. by Leo/Diane Dillon. (1975)
Winnie-the-Pooh. A.A. Milne. Illus. by Ernest H. Shepard. (1926)
A Wrinkle in Time. Madeleine L’Engle. (1962)


Posted in Books Archive | Comments Off

Celebrate at the Internet Archive — 1024 — Thursday Oct. 24th

(Full Video of event.   Announcement as posts: 1, 2, and 3)

Internet Archive invites you to a fun evening in San Francisco on October 24th for our once-a-year celebration and announcements of new services. (And it just so happens to fall on 1024, which our fellow geeks will recognize as 2^10.)  We will drink and be merry with our friends, then gather together to tell you about the new steps we’re taking to guarantee permanent, free access to the world’s knowledge.  

No More 404s

October 24, 2013
Free Admission, Donations Welcome
6pm – 7pm : Cocktails and Reception
7pm – 8pm : Announcements
300 Funston Ave., San Francisco CA 94118


Please RSVP - we don’t want to run out of wine!

Some of the things we’ll share include:

  • No more broken links. Help wipe out dead links on the Internet with new tools and APIs to replace dead links with archived versions.  Down with 404s!
  • Reader PrivacyQuotable Television News.  A new interface for the TV News Research Service will facilitate journalists, bloggers and your news-addicted relatives to search, quote short clips and borrow from a massive, searchablearchive of U.S.television news programs.
  • Reader Privacy for All.  We are helping to protect the reading habits of our users from prying eyes by increasing encryption and keeping less user data.
  • Data tape with HamurabiBringing Old Software Back to Life.  First steps to bring the software for Apple II’s, Commodore 64′s etc back from cassette and to the web.
  • Petabytes, Gigabits, and More.  Come see for yourself!
Posted in Announcements, News | Tagged | 27 Comments

Blacked Out Government Websites Available Through Wayback Machine


(from the Internet Archive’s Archive-it group: Announcing the first ever Archive-It US Government Shutdown Notice Awards!  )

Congress has caused the U.S. federal government to shut down and important websites have gone dark.  Fortunately, we have the Wayback Machine to help.

Many government sites are displaying messages saying that they are not being updated or maintained during the government shut down, but the following sites are some who have completely shut their doors today.  Clicking the logos will take you to a Wayback Machine archived capture of the site.    Please donate to help us keep the government websites available.  You can also suggest pages for us to archive so that we can document the shut down.

National Oceanic and Atmospheric Administration
National Park Service
Library of Congress
National Science Foundation
Federal Communication Commission
Bureau of the Census
U.S. Department of Agriculture
United States Geological Survey
U.S. International Trade Commission
Federal Trade Commission
National Aeronautics and Space Administration
International Trade Administration
Corporation for National and Community Service


Posted in Announcements, News, Wayback Machine | Tagged , | 32 Comments

Celebrate at the Internet Archive — 1024 — Thursday Oct. 24th

Internet Archive invites you to a fun evening in San Francisco on October 24th for our once-a-year celebration and announcements of new services. (And it just so happens to fall on 1024, which our fellow geeks will recognize as 2^10.)  We will drink and be merry with our friends, then gather together to tell you about the new steps we’re taking to guarantee permanent, free access to the world’s knowledge.  

No More 404s

October 24, 2013
Free Admission, Donations Welcome
6pm – 7pm : Cocktails and Reception
7pm – 8pm : Announcements
300 Funston Ave., San Francisco CA 94118


Please RSVP - we don’t want to run out of wine!

Some of the things we’ll share include:

  • No more broken links. Help wipe out dead links on the Internet with new tools and APIs to replace dead links with archived versions.  Down with 404s!
  • Reader PrivacyQuotable Television News.  A new interface for the TV News Research Service will facilitate journalists, bloggers and your news-addicted relatives to search, quote short clips and borrow from a massive, searchablearchive of U.S.television news programs.
  • Reader Privacy for All.  We are helping to protect the reading habits of our users from prying eyes by increasing encryption and keeping less user data.
  • Data tape with HamurabiBringing Old Software Back to Life.  First steps to bring the software for Apple II’s, Commodore 64′s etc back from cassette and to the web.
  • Petabytes, Gigabits, and More.  Come see for yourself!
Posted in Event, News | 2 Comments

Over 7,000 Free Audio Books: Librivox and its New Look!

Librivox logoIn 2005, Hugh McGuire asked:
“Can the net harness a bunch of volunteers to help bring books in the public domain to life through podcasting?”

The answer is yes. Thanks to the help of many, LibriVox, the nonprofit organization he leads, has made tremendous progress in producing and distributing free audiobooks of public domain work.

The LibriVox site has recently undergone a major facelift, making it far easier to browse and find great public domain audiobooks. In addition, the underlying software that helps thousands of volu

nteers contribute to LibriVox has been completely rebuilt. This rebuild project was funded by the

Andrew W. Mellon Foundation, and donations from the public. LibriVox continues to use the Internet Archive to host all it’s audio and web infrastructure.

Thanks to:

The thousands of volunteer readers who bring over 100 new books a month originally in Project Gutenberg, and other public domain sources (including, of course, the Internet Archive) to the listening public.

With over 7,000 audio books, LibriVox is one of the largest publishers of audiobooks in the world, and certainly the largest publisher of free public domain audiobooks.

The Millions of Listeners who download over three million LibriVox audiobooks every month.

The Andrew W. Mellon Foundation, and Don Waters at their Scholarly Communications and Information Technology programme, for providing funding for the revamp of the LibriVox website, and underlying technology that runs the project.

Free Hosting by the Internet Archive.

Pro bono Legal services from Diana Szego of Orrick, Herrington & Sutcliffe.

And the relentless good cheer of Hugh McGuire who over the last eight years has created this fabulous service, and continued to make contributions to open (e)book publishing with PressBooks.com@hughmcguire

Please donate!
This project needs ongoing support for servers and software upgrades.

Posted in Announcements, Audio Archive, News | 12 Comments