<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Internet Archive Blogs</title>
	<atom:link href="http://blog.archive.org/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.archive.org</link>
	<description>A blog from the Collections Team at archive.org</description>
	<lastBuildDate>Sat, 12 May 2012 13:40:24 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>&#8220;Many Libraries: As the world&#8217;s books go online, we must resist centralization&#8221;  Technology Review, published by MIT</title>
		<link>http://blog.archive.org/2012/05/11/many-libraries-as-the-worlds-books-go-online-we-must-resist-centralization-technology-review-published-by-mit/</link>
		<comments>http://blog.archive.org/2012/05/11/many-libraries-as-the-worlds-books-go-online-we-must-resist-centralization-technology-review-published-by-mit/#comments</comments>
		<pubDate>Fri, 11 May 2012 17:45:25 +0000</pubDate>
		<dc:creator>junegoldsmith</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://blog.archive.org/?p=4709</guid>
		<description><![CDATA[The Internet has put universal access to knowledge within our grasp. Now we need to put all of the world&#8217;s literature online. This is easier to do than it might seem, if we resist the impulse to centralize and build &#8230; <a href="http://blog.archive.org/2012/05/11/many-libraries-as-the-worlds-books-go-online-we-must-resist-centralization-technology-review-published-by-mit/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>The Internet has put universal access to knowledge within our grasp. Now we need to put all of the world&#8217;s literature online. This is easier to do than it might seem, if we resist the impulse to centralize and build only a few monolithic libraries.</p>
<p>Centralization can lead to price controls, censorship without due process, lack of reader privacy, and resistance to innovators. We need lots of publishers, booksellers, authors, and readers—and lots of libraries. If many actors work together, we can have a robust, distributed publishing and library system, possibly resembling the World Wide Web.</p>
<p>The courts struck down as monopolistic an attempt by top libraries and Google to build a massive e-book collection. They proposed a collective licensing system, the Book Rights Registry, that would have the right to license exclusively to Google any book not claimed by an author or publisher. It would have limited options for readers. Now some proponents of the nonprofit Digital Public Library of America (DPLA) are encouraging legislative action that we fear might lead to a similar collective licensing approach (see &#8220;The Library of Utopia.&#8221;)</p>
<p>We could be helping nurture the seeds of a distributed library and publishing system—seeds that have already been planted.</p>
<p>All libraries could lend e-books, just as they lend physical books, avoiding a dependency on centralized databases. Libraries are already buying as many e-books as they can, and even small libraries can offer large collections: a single hard drive can hold over 150,000 books (as searchable color PDFs) and their catalogue data. It is not difficult to lend books digitally, as technologies used by Netflix and Amazon show.</p>
<p>Patrons of thousands of libraries can already borrow over 200,000 purchased and scanned e-books free from the Internet Archive. Most large publishers have recently banned e-book sales to libraries, but we hope this restriction is temporary. Even as we acquire current e-books, we need to scan existing ones, but again, this work is already under way. We scan 1,000 books a day at 31 libraries in seven countries with funding from libraries and foundations.</p>
<p>Scanning centers such as those at the Boston Public Library and the Library of Congress digitize hundreds of books a day. Libraries working with the Internet Archive have already put over two million public-domain books online for free downloading and lending, and for use by people unable to read printed books.</p>
<p>Now is our chance to build an online library accessible to all. To equal the Boston Public Library or university libraries like those at Yale or Princeton, we need 10 million books. These could be acquired in four years for approximately $160 million. The DPLA, with its broad support, can help build this library system, or it could end up building an overly centralized library by using collective licensing systems like the Book Rights Registry. If we work together, we can achieve universal access to knowledge by building on the positive lessons of the Internet and World Wide Web.</p>
<p>Brewster Kahle is the founder of the Internet Archive. Rick Prelinger is an archivist, writer, and filmmaker.</p>
<p><a href="http://www.technologyreview.com/article/40261/">http://www.technologyreview.com/article/40261/</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.archive.org/2012/05/11/many-libraries-as-the-worlds-books-go-online-we-must-resist-centralization-technology-review-published-by-mit/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>We want to buy your books!  Internet Archive Letter to Publishers</title>
		<link>http://blog.archive.org/2012/05/02/we-want-buy-your-books-internet-archive-letter-to-publishers/</link>
		<comments>http://blog.archive.org/2012/05/02/we-want-buy-your-books-internet-archive-letter-to-publishers/#comments</comments>
		<pubDate>Wed, 02 May 2012 23:09:39 +0000</pubDate>
		<dc:creator>peter</dc:creator>
				<category><![CDATA[Books Archive]]></category>
		<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://blog.archive.org/?p=4629</guid>
		<description><![CDATA[Thank you for your willingness to invest in the future of publishing and readership. Libraries and publishers have a lot in common: we connect writers with readers which promotes literacy, scholarship, and citizenship. We want to buy more digital books &#8230; <a href="http://blog.archive.org/2012/05/02/we-want-buy-your-books-internet-archive-letter-to-publishers/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><strong></strong>Thank you for your willingness to invest in the future of publishing and readership. Libraries and publishers have a lot in common: we connect writers with readers which promotes literacy, scholarship, and citizenship.</p>
<p>We want to buy more digital books from you.</p>
<p>We currently buy, lend, and preserve eBooks from publishers and booksellers, but we have not found many eBooks for sale at any price.  The Internet Archive is running standard protection systems to lend eBooks from our servers through our websites, <a href="http://openlibrary.org">openlibrary.org</a> and <a href="http://archive.org">archive.org</a>.   In this way, we strive to provide a seamless experience for our library patrons that replicates a traditional library check-out model, but now with eReaders and searching.</p>
<p>By buying eBooks from you, we hope to continue the productive relationship between libraries and publishers. By respecting the rights and responsibilities that have evolved in the physical era, we believe we will all know how to act: one patron at a time, restrictions on copying, re-format for enduring access, and long term preservation.</p>
<p>We understand these are early days, and prices will evolve.  What we would like to do, however, is not lose the relationship libraries have built up with publishers just because we are now buying and lending electronic books rather than physical ones.</p>
<p>Our checkbook is open.   Please sell to us.</p>
<p>Sincerely,</p>
<p><a href="mailto:brewster@archive.org">Brewster Kahle</a></p>
<p>Digital Librarian, Internet Archive</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p><em>(inspired by one from Douglas County&#8217;s in Colorado)</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.archive.org/2012/05/02/we-want-buy-your-books-internet-archive-letter-to-publishers/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Improved theora/ogg video derivatives!</title>
		<link>http://blog.archive.org/2012/05/01/improved-theoraogg-video-derivatives/</link>
		<comments>http://blog.archive.org/2012/05/01/improved-theoraogg-video-derivatives/#comments</comments>
		<pubDate>Tue, 01 May 2012 21:02:43 +0000</pubDate>
		<dc:creator>traceypooh</dc:creator>
				<category><![CDATA[Technical]]></category>
		<category><![CDATA[Video Archive]]></category>
		<category><![CDATA[derivatives]]></category>
		<category><![CDATA[ffmpeg]]></category>
		<category><![CDATA[movies]]></category>
		<category><![CDATA[theora]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://blog.archive.org/?p=4685</guid>
		<description><![CDATA[We&#8217;ve made our ogg video derivatives slightly better via: minor bump up to &#8220;thusnelda&#8221; release &#8220;upgrade&#8221; from 1-pass video encoding to 2-pass video encoding direct ffmpeg creation of the video (you&#8217;ll need to re/compile ffmpeg minimally with &#8220;&#8211;enable-libtheora &#8211;enable-libvorbis&#8221; configure &#8230; <a href="http://blog.archive.org/2012/05/01/improved-theoraogg-video-derivatives/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve made our ogg video derivatives slightly better via:</p>
<ul>
<li>minor bump up to &#8220;thusnelda&#8221; release</li>
<li>&#8220;upgrade&#8221; from 1-pass video encoding to 2-pass video encoding</li>
<li>direct ffmpeg creation of the video (you&#8217;ll need to re/compile ffmpeg minimally with &#8220;&#8211;enable-libtheora &#8211;enable-libvorbis&#8221; configure flags)</li>
</ul>
<p><code></p>
<p>ffmpeg -y -i ‘camels.avi’ -q:vscale 3 -b:v 512k -vcodec libtheora -pix_fmt yuv420p -vf yadif,scale=400:300 -r 20 -threads 2 -map_metadata -1,g:0,g -pass 1 -an -f null /dev/null;</p>
<p>ffmpeg -y -i ‘camels.avi’ -q:vscale 3 -b:v 512k -vcodec libtheora -pix_fmt yuv420p -vf yadif,scale=400:300 -r 20 -threads 2 -map_metadata -1,g:0,g -pass 2 -map 0:0 -map 0:1 -acodec libvorbis -ac 2 -ab 128k -ar 44100 -metadata TITLE=’Camels at a Zoo’ -metadata LICENSE=’http://creativecommons.org/licenses/by-nc/3.0/’ -metadata DATE=’2004′ -metadata ORGANIZATION=’Dumb Bunny Productions’ -metadata LOCATION=http://archive.org/details/camels camels.ogv</p>
<p></code></p>
<p>some notes:</p>
<ul>
<li>You&#8217;d want to adjust the &#8220;scale=WIDTH:HEIGHT&#8221; accordingly, as well as the &#8220;-r FRAMES-PER-SECOND&#8221; related args, to your source video.</li>
<li>I made a small patch to allow *both* bitrate target *and* quality level for theora in ffmpeg, after comparing the other popular tool &#8220;ffmpeg2theora&#8221; code with the libtheoraenc.c inside ffmpeg.  It may not be necessary, but I believe I saw *slightly* better quality coming out of theora/thusnelda ogg video.  For what it&#8217;s worth, my minor patch is here:  <a href="http://archive.org/~tracey/downloads/patches/ffmpeg-theora.patch">http://archive.org/~tracey/downloads/patches/ffmpeg-theora.patch</a></li>
<li>The way we compile ffmpeg (ubuntu/linux) is <a href="http://archive.org/~tracey/downloads/ffmpeg-README.txt">here</a>.  (Alt MacOS version <a href="http://archive.org/~tracey/downloads/macff.sh.txt">here</a> )</li>
<li>(Edited post above after I removed this step) <strike>It&#8217;s *quite* odd, I realize to have ffmpeg transcode both the audio/video together, only to split/demux them back out temporarily.  However, for some videos, the &#8220;oggz-comment&#8221; step would wipe out the first video keyframe and cause unplayability in chrome (and the expected visual artifacts for things that could play it).   So, we split, comment the audio track, then re-stitch it back together.</strike></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://blog.archive.org/2012/05/01/improved-theoraogg-video-derivatives/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Downloading in bulk using wget</title>
		<link>http://blog.archive.org/2012/04/26/downloading-in-bulk-using-wget/</link>
		<comments>http://blog.archive.org/2012/04/26/downloading-in-bulk-using-wget/#comments</comments>
		<pubDate>Thu, 26 Apr 2012 01:28:09 +0000</pubDate>
		<dc:creator>internetarchive</dc:creator>
				<category><![CDATA[Technical]]></category>

		<guid isPermaLink="false">http://blog.archive.org/?p=4643</guid>
		<description><![CDATA[If you&#8217;ve ever wanted to download files from many different archive.org items in an automated way, here is one method to do it. BEFORE YOU BEGIN You will need to have wget installed (it&#8217;s free!), and it helps to have &#8230; <a href="http://blog.archive.org/2012/04/26/downloading-in-bulk-using-wget/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>If you&#8217;ve ever wanted to download files from many different archive.org items in an automated way, here is one method to do it.</p>
<p><strong>BEFORE YOU BEGIN</strong><br />
You will need to have <a href="http://www.gnu.org/software/wget/" target="_blank">wget</a> installed (it&#8217;s free!), and it helps to have some understanding of basic <a href="http://mally.stanford.edu/~sr/computing/basic-unix.html" target="_blank">unix commands</a>.  It may also help to read <a href="http://blog.archive.org/2011/03/31/how-archive-org-items-are-structured/" target="_blank">How Archive.org Items Are Structured</a> so that you understand terminology used here.</p>
<p>The basic method for using wget to download files is:</p>
<ol>
<li>Generate a list of item identifiers from which you wish to grab files</li>
<li>Create a directory to hold the downloaded files</li>
<li>Construct your wget command to retrieve the appropriate files</li>
<li>Run the command and wait for it to finish</li>
</ol>
<p>We strongly recommend trying this process with ONE identifier first to make sure you get what you want as output before you try to download files from many items.</p>
<p><strong>Step 1: Generate list of identifiers</strong></p>
<p>The list of identifiers should be in plain text and should contain one identifier per line with no extraneous characters or spaces.  You can name it anything you&#8217;d like, but I will use the name<em> itemlist.txt</em> in all examples here.</p>
<p>How do you get a list of identifiers?  Of course you can just paste or type them into a file individually if you&#8217;d like, but if you are working with large numbers of items this will be impractical.  The easiest way to get a list is to use <a href="http://archive.org/advancedsearch.php" target="_blank">advanced search</a> to return a csv file.</p>
<p>Determine your search query using the search engine.  In this example, I am looking for items in the Prelinger collection with the subject &#8220;Health and Hygiene.&#8221;  There are currently 41 items that match <a href="http://archive.org/search.php?query=collection%3Aprelinger%20AND%20subject%3A%22Health%20and%20hygiene%22" target="_blank">this query</a>.  Once you&#8217;ve figured out your query:</p>
<ol>
<li>Go to the advanced search page and paste your query into the query box of the section titled &#8220;Advanced Search returning JSON, XML, and more.&#8221;</li>
<li>Choose &#8220;identifier&#8221; from the list of fields to return.</li>
<li>Optionally sort the results (sorting by identifier is handy)</li>
<li>Enter a number into the &#8220;Number of results&#8221; box that matches (or is higher than) the number of results your query returns</li>
<li>Choose the &#8220;CSV format&#8221; radio button.</li>
<li>Click the search button (may take a while depending on how many results you have)</li>
</ol>
<p><img class="aligncenter size-full wp-image-4646" title="advancedsearch" src="http://blog.archive.org/wp-content/uploads/2012/04/advancedsearch.png" alt="Advanced Search" width="759" height="485" /></p>
<p>An alert box will ask if you want your results &#8211; click &#8220;OK&#8221; to proceed.  A file called search.csv will be downloaded to your default download location (often your Desktop or your Downloads folder).  The contents of the CSV will look like this:</p>
<blockquote><p>&#8220;identifier&#8221;<br />
&#8220;AboutFac1941&#8243;<br />
&#8220;Attitude1949&#8243;<br />
&#8220;BodyCare1948&#8243;<br />
&#8220;Cancer_2&#8243;<br />
&#8220;Careofth1949&#8243;<br />
&#8220;Careofth1951&#8243;<br />
&#8220;CityWate1941&#8243;</p></blockquote>
<p>You&#8217;ll need to remove the first heading line &#8220;identifier&#8221; and remove the double quotes from each line.  The easiest way to do this is to open the file in Excel or a similar spreadsheet program, copy the column of identifiers (minus the heading), and paste it into a text program like TextWrangler or Notepad.  Save your new text file as itemlist.txt (or whatever name you prefer).  The contents of the file should now look like this:</p>
<blockquote><p>AboutFac1941<br />
Attitude1949<br />
BodyCare1948<br />
Cancer_2<br />
Careofth1949<br />
Careofth1951<br />
CityWate1941</p></blockquote>
<p>You can use this advanced search method to create lists of thousands of identifiers, although we don&#8217;t recommend using it to retrieve more than 10,000 or so items at once (it will time out at a certain point).</p>
<p><strong>Step 2: Create a directory for downloaded files</strong></p>
<p>Open a terminal emulator, such as Terminal (on Mac) or Cygwin (on Windows), and navigate to the location where you&#8217;d like to store your downloaded files.  Make sure there is sufficient storage space here.  You may want to store files on an external drive, in a folder on your Desktop, etc.  This example assumes you want to create a folder on your Desktop.</p>
<ol>
<li>Change directories (cd) to your Desktop</li>
<li>Make sure itemlist.txt is on your Desktop</li>
<li>Create a new folder (directory) for your downloads called archivedownloads (mkdir archivedownloads) &#8211; this folder can be called anything you like</li>
<li>cd archivedownloads</li>
</ol>
<p><strong>Step 3: Create wget command</strong></p>
<p>Your wget command will look something like this:</p>
<blockquote><p>wget -r -H -nc -np -nH &#8211;cut-dirs=2 -e robots=off -i ../itemlist.txt -B &#8216;http://www.archive.org/download/&#8217;</p></blockquote>
<p>Explanation of each option in the wget command:</p>
<p style="padding-left: 60px;">-r<br />
recursive download; required in order to move from the item identifier down into its individual files</p>
<p style="padding-left: 60px;">-H<br />
enable spanning across hosts when doing recursive retrieving (the initial URL for the directory will be on www.archive.org, and the individual file locations will be on a specific datanode)</p>
<p style="padding-left: 60px;">-nc<br />
no clobber; if a local copy already exists of a file, don&#8217;t download it again (useful if you have to restart the wget at some point, as it avoids re-downloading all the files that were already done during the first pass)</p>
<p style="padding-left: 60px;">-np<br />
no parent; ensures that the recursion doesn&#8217;t climb back <strong>up</strong> the directory tree to other items (by, for instance, following the &#8220;../&#8221; link in the directory listing)</p>
<p style="padding-left: 60px;">-nH<br />
no host directories; when using -r, wget will create a directory tree to stick the local copies in, starting with the hostname ({datanode}.us.archive.org/), unless -nH is provided</p>
<p style="padding-left: 60px;">&#8211;cut-dirs=2<br />
completes what -nH started by skipping the hostname; when saving files on the local disk (from a URL like<a href="http://%7Bdatanode%7D.us.archive.org/%7Bdrive%7D/items/%7Bidentifier%7D/%7Bidentifier%7D.pdf" target="_top">http://{datanode}.us.archive.org/{drive}/items/{identifier}/{identifier}.pdf</a>), skip the /{drive}/items/ portion of the URL, too, so that all {identifier} directories appear together in the current directory, instead of being buried several levels down in multiple {drive}/items/ directories</p>
<p style="padding-left: 60px;">-e robots=off<br />
archive.org datanodes contain robots.txt files telling robotic crawlers not to traverse the directory structure; in order to recurse from the directory to the individual files, we need to tell wget to ignore the robots.txt directive</p>
<p style="padding-left: 60px;">-i ../itemlist.txt<br />
location of input file listing all the URLs to use; &#8220;../itemlist&#8221; means the list of items should appear one level up in the directory structure, in a file called &#8220;itemlist.txt&#8221; (you can call the file anything you want, so long as you specify its actual name after -i)</p>
<p style="padding-left: 60px;">-B &#8216;http://www.archive.org/download/&#8217;<br />
base URL; gets prepended to the text read from the -i file (this is what allows us to have just the identifiers in the itemlist file, rather than the full URL on each line)</p>
<p> Additional options that may be needed sometimes:</p>
<p style="padding-left: 60px;">-A<br />
-R<br />
accept-list and reject-list, either limiting the download to certain kinds of file, or excluding certain kinds of file; for instance, -R _orig_jp2.tar,_jpg.pdf would download all files except those whose names end with _orig_jp2.tar or _jpg.pdf, and -A &#8220;*zelazny*&#8221; -R .ps would download all files containing zelazny in their names, except those ending with .ps. See <a href="http://www.gnu.org/software/wget/manual/html_node/Types-of-Files.html" target="_top">http://www.gnu.org/software/wget/manual/html_node/Types-of-Files.html</a> for a fuller explanation.</p>
<p> <strong>Step 4: Run the command</strong></p>
<p>Run the command from within the directory or folder you created to hold the downloaded files.  You will see your progress on the screen.  If you have sorted your itemlist.txt alphabetically, you can estimate how far through the list you are based on the screen output.</p>
<p>Depending on how many files you are downloading and their size, it may take quite some time for this command to finish running.</p>
<p><strong>Tips:</strong></p>
<ul>
<li>You can terminate the command by pressing &#8220;control&#8221; and &#8220;c&#8221; on your keyboard simultaneously while in the terminal window.</li>
<li>If your command will take a while to complete, make sure your computer is set to never sleep and turn off automatic updates.</li>
<li>If you think you missed some items (e.g. due to machines being down), you can simply rerun the command after it finishes.  The &#8220;no clobber&#8221; option in the command will prevent already retrieved files from being overwritten, so only missed files will be retrieved.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://blog.archive.org/2012/04/26/downloading-in-bulk-using-wget/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Internet Hall of Fame</title>
		<link>http://blog.archive.org/2012/04/24/internet-hall-of-fame/</link>
		<comments>http://blog.archive.org/2012/04/24/internet-hall-of-fame/#comments</comments>
		<pubDate>Tue, 24 Apr 2012 08:58:33 +0000</pubDate>
		<dc:creator>brewster</dc:creator>
				<category><![CDATA[Announcements]]></category>

		<guid isPermaLink="false">http://blog.archive.org/?p=4610</guid>
		<description><![CDATA[(brewster is in fashionable white) Brewster Kahle is honored to be inducted today into the newly formed Internet Hall of Fame administrated by the Internet Society.  As the Society that administers much of the Internet turns 20 years old, they have &#8230; <a href="http://blog.archive.org/2012/04/24/internet-hall-of-fame/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.archive.org/wp-content/uploads/2012/04/internet-hall-of-fame-group1.jpg"><img class="aligncenter size-large wp-image-4626" title="internet-hall-of-fame-group" src="http://blog.archive.org/wp-content/uploads/2012/04/internet-hall-of-fame-group1-1024x433.jpg" alt="" width="640" height="270" /></a>(brewster is in fashionable white)</p>
<p>Brewster Kahle is honored to be inducted today into the newly formed <a href="http://www.internetsociety.org/20th/internet-hall-fame" target="_blank">Internet Hall of Fame</a> administrated by the <a href="http://www.internetsociety.org/" target="_blank">Internet Society</a>.  As the Society that administers much of the Internet turns 20 years old, they have introduced this award at their annual meeting in Geneva.</p>
<p>&#8220;Brewster Kahle is the founder and Digital Librarian of the Internet Archive in 1996. An entrepreneur and Internet pioneer, Brewster invented the first Internet publishing system and helped put newspapers and publishers online in the 1990′s.&#8221; &#8211;<a href="http://www.internethalloffame.org/official-biography-brewster-kahle">Official Biography of the Internet Society</a></p>
<p>The Internet Archive&#8217;s work, to help build a robust library system reinforcing the publishing systems of the Internet including the World Wide Web, is likely to have been a reason for this award.   As well, this is for Brewster&#8217;s earlier work on the first Internet publishing system and search engine, Wide Area Information Servers, in 1989.</p>
<p><a href="http://blog.archive.org/wp-content/uploads/2012/04/Slide1.jpg"><img class="aligncenter size-full wp-image-4683" title="Brewster Kahle hall of fame slide" src="http://blog.archive.org/wp-content/uploads/2012/04/Slide1.jpg" alt="" width="720" height="405" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.archive.org/2012/04/24/internet-hall-of-fame/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>LibriVox Free Audiobook Project Receives Generous Mellon Support for Upgrade</title>
		<link>http://blog.archive.org/2012/04/05/librivox-free-audiobook-project-receives-generous-mellon-support-for-upgrade/</link>
		<comments>http://blog.archive.org/2012/04/05/librivox-free-audiobook-project-receives-generous-mellon-support-for-upgrade/#comments</comments>
		<pubDate>Thu, 05 Apr 2012 21:23:02 +0000</pubDate>
		<dc:creator>Jeff Kaplan</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Audio Archive]]></category>

		<guid isPermaLink="false">http://blog.archive.org/?p=4584</guid>
		<description><![CDATA[LibriVox.org, the world&#8217;s largest producer of free public domain audiobooks, and the Internet Archive are pleased to announce a generous grant from the Andrew W. Mellon Foundation, on the heels of a recent landmark achievement: 100 million downloads of the &#8230; <a href="http://blog.archive.org/2012/04/05/librivox-free-audiobook-project-receives-generous-mellon-support-for-upgrade/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><a href="http://archive.org/details/librivoxaudio"><img class="alignnone  wp-image-4585" src="http://blog.archive.org/wp-content/uploads/2012/04/librivoxlogo.png" alt="" width="660" height="178" /></a><a href="http://librivox.org/">LibriVox.org</a>, the world&#8217;s largest producer of free public domain audiobooks, and the Internet Archive are pleased to announce a generous grant from the Andrew W. Mellon Foundation, on the heels of a recent landmark achievement: 100 million downloads of the over 5,000 free <a href="http://archive.org/details/librivoxaudio">LibriVox audiobooks</a> from the <a href="http://archive.org/">Internet Archive</a>.</p>
<p>The Mellon grant will go towards rebuilding LibriVox&#8217;s technical infrastructure, and improving accessibility of the LibriVox website.</p>
<p>&#8220;It&#8217;s fantastic to get this support from the Mellon Foundation,&#8221; said LibriVox founder Hugh McGuire. &#8220;It will be put to good use, helping our hard-working volunteers create many more free audiobooks.&#8221;</p>
<p>LibriVox, a volunteer project of the Internet Archive, gets volunteers from around the world to make audio recordings of public domain texts, and gives those recordings away for free. All LibriVox audiobooks are hosted at the Internet Archive.</p>
<p>Founded in 2005, LirbiVox has to date produced 5,371 free audiobooks, in 31 languages. Popular audiobooks include &#8220;<a href="http://archive.org/details/adventures_holmes">The Adventures of Sherlock Holmes</a>,&#8221; by Arthur Conan Doyle, &#8220;<a href="http://archive.org/details/huck_finn_librivox">Adventures of Huckleberry Finn</a>&#8221; by Mark Twain, and &#8220;<a href="http://archive.org/details/jane_eyre_ver03_0809_librivox">Jane Eyre</a>,&#8221; by Charlotte Brontë. In addition to novels, the LibriVox collection includes numerous texts of importance from philosophers such as Kant, Descartes, and Hume, political documents such as the &#8220;<a href="http://archive.org/details/universal_declaration_librivox">Universal Declaration of Human Rights</a>,&#8221; and scientific texts including Einstein&#8217;s &#8220;<a href="http://archive.org/details/relativity_librivox">Relativity</a>,&#8221; and Darwin&#8217;s &#8220;<a href="http://archive.org/details/origin_species_librivox">Origin of the Species</a>.&#8221;</p>
<p>&#8220;The LibriVox collection is one of the most popular on the Internet Archive,&#8221; said Brewster Kahle, Founder and Director of the Internet Archive. &#8220;100 million downloads is awesome. LibriVox is an integral part of our commitment to making important texts available to the world in the best format for people, and we are thrilled at the support from the Mellon Foundation.&#8221;</p>
<p>Cori Samuel, a long-time LibriVox volunteer, who has recorded some of the project&#8217;s more popular books, was in shock at the numbers. &#8220;It&#8217;s hard to believe that what started out as a small project among some passionate people on the web has turned into something so big. It&#8217;s incredible to imagine that we could have touched the lives of 100 million listeners.&#8221;</p>
<p>For more information, please contact Hugh McGuire, LibriVox founder: <a href="mailto:hughmcguire@gmail.com">hughmcguire@gmail.com</a>.   <a href="http://librivox.org/2012/04/05/jobs-were-hiring-a-tech-project-manager-and-a-developer/">Job posting</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.archive.org/2012/04/05/librivox-free-audiobook-project-receives-generous-mellon-support-for-upgrade/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Brewster Kahle featured in People Magazine</title>
		<link>http://blog.archive.org/2012/04/04/brewster-kahle-featured-in-people-magazine/</link>
		<comments>http://blog.archive.org/2012/04/04/brewster-kahle-featured-in-people-magazine/#comments</comments>
		<pubDate>Wed, 04 Apr 2012 17:01:49 +0000</pubDate>
		<dc:creator>Jeff Kaplan</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://blog.archive.org/?p=4575</guid>
		<description><![CDATA[Brewster Kahle is featured in the April 2, 2012 edition of &#8220;People Magazine&#8221;. The Internet Archive Physical Archive is making news. Check it out at your local newstand!]]></description>
			<content:encoded><![CDATA[<p>Brewster Kahle is featured in the April 2, 2012 edition of &#8220;People Magazine&#8221;. The Internet Archive Physical Archive is making news. Check it out at your local newstand!</p>
<p><a href="http://blog.archive.org/wp-content/uploads/2012/04/BrewsterKahlePeople.jpg"><img class="alignnone size-large wp-image-4576" title="BrewsterKahlePeople" src="http://blog.archive.org/wp-content/uploads/2012/04/BrewsterKahlePeople-749x1024.jpg" alt="" width="640" height="874" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.archive.org/2012/04/04/brewster-kahle-featured-in-people-magazine/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>We&#8217;ve dropped the www.!   Our preferred/canonical url is now http://archive.org</title>
		<link>http://blog.archive.org/2012/03/29/weve-dropped-the-www-our-preferredcanonical-url-is-now-httparchive-org/</link>
		<comments>http://blog.archive.org/2012/03/29/weve-dropped-the-www-our-preferredcanonical-url-is-now-httparchive-org/#comments</comments>
		<pubDate>Thu, 29 Mar 2012 19:53:33 +0000</pubDate>
		<dc:creator>traceypooh</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[audio]]></category>
		<category><![CDATA[bookmarks]]></category>
		<category><![CDATA[internet archive]]></category>
		<category><![CDATA[live music archive]]></category>
		<category><![CDATA[movies]]></category>
		<category><![CDATA[video]]></category>

		<guid isPermaLink="false">http://blog.archive.org/?p=4570</guid>
		<description><![CDATA[Dear Patrons, Last Thursday we pushed out changes to drop the &#8220;www.&#8221; prefix from our urls so that we have the newer/shorter style urls start like: http://archive.org We intend to keep this change permanently. We know there will be a &#8230; <a href="http://blog.archive.org/2012/03/29/weve-dropped-the-www-our-preferredcanonical-url-is-now-httparchive-org/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Dear Patrons,</p>
<p>Last Thursday we pushed out changes to drop the &#8220;www.&#8221; prefix from our urls<br />
so that we have the newer/shorter style urls start like:</p>
<p><a href="http://archive.org">http://archive.org</a></p>
<p>We intend to keep this change permanently.</p>
<p>We know there will be a few minor breaks here and there especially from some third-party applications that might not handle &#8220;301 Moved Temporarily&#8221; redirects (if you have something flash-based that needs http://www.archive.org/crossdomain.xml we caught that breakage and that url still works now (that is, it can be either requested either with or without the lead &#8220;www.&#8221; as an exception now).  We&#8217;re happy to work with anyone having issues &#8212; feel free to reply to this post and let us know.</p>
<p>Best wishes, and now go spend those four characters saved on something fun <img src='http://blog.archive.org/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://blog.archive.org/2012/03/29/weve-dropped-the-www-our-preferredcanonical-url-is-now-httparchive-org/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Wayback Machine machines have moved</title>
		<link>http://blog.archive.org/2012/03/29/wayback-machine-machines-are-moving/</link>
		<comments>http://blog.archive.org/2012/03/29/wayback-machine-machines-are-moving/#comments</comments>
		<pubDate>Thu, 29 Mar 2012 00:48:21 +0000</pubDate>
		<dc:creator>brewster</dc:creator>
				<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://blog.archive.org/?p=4565</guid>
		<description><![CDATA[Update:  the Wayback Machine (minus about 3% of the data) is back up.   We will work on that last bit next week.   Yippie. &#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;- Sorry for the interruption, but we are moving most of the petabytes of the Wayback Machine &#8230; <a href="http://blog.archive.org/2012/03/29/wayback-machine-machines-are-moving/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Update:  the Wayback Machine (minus about 3% of the data) is back up.   We will work on that last bit next week.   Yippie.</p>
<p>&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;-</p>
<p>Sorry for the interruption, but we are moving most of the petabytes of the Wayback Machine from their happy home in a shipping container on the Oracle/Sun campus to our new datacenter.</p>
<p>We are now 1/2 way through the move, and 1/2 of the machines are down.    We are not certain if they will be up tomorrow.    We are looking to move the rest of the machines next Tuesday, so we will have some interruptions then as well.</p>
<p>We will update this post as things move along.</p>
<p>Again, we apologize.</p>
<p>-brewster</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.archive.org/2012/03/29/wayback-machine-machines-are-moving/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Sharing Works: 100,000 concert recordings for free</title>
		<link>http://blog.archive.org/2012/03/27/sharing-works-100000-concert-recordings-for-free/</link>
		<comments>http://blog.archive.org/2012/03/27/sharing-works-100000-concert-recordings-for-free/#comments</comments>
		<pubDate>Tue, 27 Mar 2012 21:43:37 +0000</pubDate>
		<dc:creator>brewster</dc:creator>
				<category><![CDATA[Announcements]]></category>

		<guid isPermaLink="false">http://blog.archive.org/?p=4530</guid>
		<description><![CDATA[We just hit a milestone: 100,000 free concert recordings from over 5,200 bands! A fun trip through the history: The Grateful Dead started a tradition in the 60&#8242;s to allow taping of their concerts and sharing those tapes as long as &#8230; <a href="http://blog.archive.org/2012/03/27/sharing-works-100000-concert-recordings-for-free/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p><a href="http://archive.org/details/etree"><img class="alignleft size-full wp-image-4545" title="etree band logos" src="http://blog.archive.org/wp-content/uploads/2012/03/bandlogos.jpg" alt="etree band logos" width="630" height="171" /></a>We just hit a milestone: <a href="http://archive.org/details/etree">100,000 free concert recordings</a> from over <a href="http://archive.org/browse.php?collection=etree&amp;field=%2Fmetadata%2Fcreator">5,200 bands</a>!</p>
<p>A fun trip through the history:</p>
<p><a href="http://archive.org/details/GratefulDead"><img class="alignright size-medium wp-image-4537" title="The Grateful Dead" src="http://blog.archive.org/wp-content/uploads/2012/03/gdmain3-300x146.jpg" alt="The Grateful Dead" width="300" height="146" /></a>The Grateful Dead started a tradition in the 60&#8242;s to <a href="http://archive.org/details/GratefulDead">allow taping o</a><a href="http://archive.org/details/GratefulDead">f</a><a href="http://archive.org/details/GratefulDead"> their concerts</a> and sharing those tapes as long as no one made any money. Key: no one made any money.</p>
<p>A generation of deadheads became fantastic archivists. The songs were annotated, the loss in quality through copying was minimized. New tapers were brought into the fold.</p>
<p>Other bands, and their fans, started to join in: taping became one of the ways to participate in a community around a band&#8217;s music.</p>
<p>When music went digital, they wanted to put an end to generational loss through copying by passing on uncompressed files. They even developed a way to compress these files losslessly saving file size without losing quality: <a href="http://www.etree.org/shncom.html">shn</a> then <a href="https://en.wikipedia.org/wiki/FLAC">flac</a> formats, I believe, were developed by the tapers.</p>
<p><a href="http://etree.org/"><img class="alignleft size-full wp-image-4538" title="etree" src="http://blog.archive.org/wp-content/uploads/2012/03/etreelogo.jpg" alt="etree logo" width="175" height="61" /></a>The Internet made digital music trading possible, but the files were big. The ETree community started <a href="http://db.etree.org/">a database</a> of shows to help facilitate accurate transfer. John Gilmore and John Perry Barlow translated the <a href="http://web.archive.org/web/20050324025618/http://dead.net/hotline_info/NEW_DOCUMENTS/mp3.html">Dead&#8217;s taper permission</a> to the Internet generation.</p>
<p>The Internet Archive offered to the ETree community to provide &#8220;unlimited storage, unlimited bandwidth, forever, for free&#8221; in an email in 2002. The reply came back:  &#8220;We don&#8217;t believe you. But if you could, that would be our dream.&#8221; We decided to give it a try.    But posting concerts on a website was different from tape trading so we decided to ask for some sort of permission from the taper friendly bands. ETree volunteers started to ask for email confirmation and these were then put in each band&#8217;s <a href="http://archive.org/details/CowboyJunkies">collection page</a>.</p>
<p>About 1-3 bands sign up and 30 concerts are uploaded every day ever since! Volunteer administrator Tyler Huff has been keeping a <a href="http://archive.org/post/415550/100000-items-in-the-live-music-archive">milestone post</a> going.</p>
<p>These concerts have been downloaded or listened to almost 800,000,000 times. The Grateful Dead collection alone is over 65,000,000 for the over <a href="http://archive.org/details/GratefulDead">8,000 concert recordings</a> on the site.</p>
<p>The total data space for the live music archive is now 112,000,000,000,000 Bytes (112TB), and the Grateful Dead collection is 12TBytes.</p>
<p>This is working for the Internet Archive because our mission is to make the cultural works of humankind available to all. It is also showing that we can share and be happy about it.</p>
<p>Thank you to all the 5,200 bands that have participated. Thank you to the over 5,800 people that have uploaded shows. Thank you to the fabulous admins Tyler Huff, John Dailey, Dave Mallick, Diana Hamilton, Peter Hedeman, Brad LeBlanc, Marc Pujol, Tom Horton, Mark Goldey and Tom Anderson. Thank you to the Grateful Dead and those that created and supported this tradition such as John Gilmore, John Perry Barlow, Geoff Sears, and Matt Vernon. Thank you to the Archive staff such as Jon Aizen, Parker Thompson, Tracey Jaquith, Alexis Rossi, and Jeff Kaplan.</p>
<p>Support Live Music!  Please go out to see a band or join one.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.archive.org/2012/03/27/sharing-works-100000-concert-recordings-for-free/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

