Tag Archives: internet archive

Celebrating 100 million tasks (uploading and modifying archive.org content)

Just over 8-1/2 years ago, I wrote a multi-process daemon in PHP that we refer to as “catalogd”.  It runs 24 hours a day, 7 days a week, no rest!

It is in charge of uploading all content to our archive.org servers, and all changes to uploaded files.

We recently passed the 100 millionth “task” (upload or edit to an archive “item”).

After starting with a modest 100 or so tasks/day, we currently run nearly 100,000 tasks/day.  We’ve done some minor scaling, but of the most part, the little daemon has become our little daemon that could!

Here’s to the next 100 million tasks at archive.org!


We’ve dropped the www.! Our preferred/canonical url is now http://archive.org

Dear Patrons,

Last Thursday we pushed out changes to drop the “www.” prefix from our urls
so that we have the newer/shorter style urls start like:


We intend to keep this change permanently.

We know there will be a few minor breaks here and there especially from some third-party applications that might not handle “301 Moved Temporarily” redirects (if you have something flash-based that needs http://www.archive.org/crossdomain.xml we caught that breakage and that url still works now (that is, it can be either requested either with or without the lead “www.” as an exception now). We’re happy to work with anyone having issues — feel free to reply to this post and let us know.

Best wishes, and now go spend those four characters saved on something fun 😉

Open Library Buying e-Books from Publishers

The Internet Archive is on campaign to buy e-Books from publishers and authors; making more digital books available to readers who prefer using laptops, reading devices or library computers.  Publishers such as Smashwords, Cursor and A Book Apart have already contributed e-Books to OpenLibrary.org – offering niche titles and the works of best-selling “indy” authors including Amanda Hocking and J.A. Konrath.

“Libraries are our allies in creating the best range of discovery mechanisms for writers and readers—enabling open and browser-based lending through the OpenLibrary.org means more books for more readers, and we’re thrilled to do our part in achieving that.” – Richard Nash, founder of Cursor.

American libraries spend $3-4 billion a year on publisher’s materials.  OpenLibrary.org and its more than 150 partnering libraries around the US and the world are  leading the charge to increase their combined digital book catalog of 80,000+ (mostly 20th century) and 2 million+ older titles.

“As demand for e-Books increases, libraries are looking to purchase more titles to provide better access for their readers.” – Digital Librarian Brewster Kahle, Founder of the Internet Archive.

This new twist on the traditional lending model promises to increase e-book use and revenue for publishers. OpenLibrary.org offers an e-Book lending library and digitized copies of classics and older books as well as books in audio and DAISY formats for those qualified readers.

Brewster Kahle receives the Zoia Horn Intellectual Freedom Award

Brewster Kahle and Zoia Horn

On December 17, 2010 Brewster Kahle received the Zoia Horn Intellectual Freedom Award for successfully challenging a National Security Letter (NSL) issued by the FBI that demanded personal information about a user of  Internet Archive’s site, archive.org.

You can see the award presentation and hear Brewster recount the entire ordeal.

A number of articles were written about it at the time including:

FBI Backs Off From Secret Order for Data After Lawsuit

Brewster Kahle offers a cookbook for fighting security letters

From the articles:
“What we wanted to do out of this was to leave a very public cookbook for how to push back. That was our goal in our negotiations with the FBI. We would not have settled without being able to talk about what the letters look like, how to push back and who to call.” -Brewster Kahle

Zoia Horn presented the award and spoke of her own ordeal as the first librarian to be jailed for refusing to divulge information that violated her belief in intellectual freedom during the 1972 conspiracy trial of the “Harrisburg Seven” anti-war activists.

You can also see photos of the lunch event at Internet Archive prior to the presentation in the great hall.

-Jeff Kaplan

A Virtual Dogear: Using Bookmarks on the Archive

With so much information packed into the Internet Archive, it’s often hard to remember what your favorite items are or to go back to an item that you haven’t finished reading/watching/listening to. A simple way to keep track of your favorites is to use the bookmark feature, which is found on the left hand side of each item under “Resources.”

Clicking here will flag the item and allow you to keep everything you love on the Archive in one spot. You can create your own collection to refer back to or share with friends who may be new to the Archive or looking for some fresh material. You can also use the Bookmark Explorer to see some of the most recent and popular bookmarked items on the Archive.

My bookmarked items can be found at this page. Feel free to share your own bookmarks here!

–Cara Binder

Bookmark and Share