Category Archives: Live Music Archive

2022 Empowering Libraries Year in Review

The Internet Archive launched the Empowering Libraries campaign in 2020 to defend equal access to library services for all. Since then, threats to libraries have only grown, so our fight continues. As 2022 draws to a close, here’s a look back through some of our library’s milestones and accomplishments over the year.

In the news

  • When the war in Ukraine started, volunteers began using the Wayback Machine and other online tools to preserve Ukrainian websites and digital collections. The effort, Saving Ukrainian Cultural Heritage Online (SUCHO), now has more than 1,500 volunteers working to preserve more than 5,000 web sites and 50TB of data. 
    • Watch a compelling story about SUCHO from CBS News featuring Quinn Dombrowski, one of the project leaders from Stanford University, and Mark Graham, director of the Wayback Machine.
    • In May, we partnered with Better World Books on a book drive supporting Ukrainian scholars. BWB customers were able to donate $1 at checkout to acquire books cited in the Ukrainian-language Wikipedia for the Internet Archive to preserve, digitize, and link to citations in Wikipedia.
  • In October, we introduced Democracy’s Library, a free, open, online compendium of government research and publications from around the world. We hosted an in-person celebration that highlighted the critical importance of free and open access to government publications, and have continued framing out what Democracy’s Library is and why it’s necessary.
  • Internet Archive Canada opened its new headquarters in Vancouver, BC, alongside the Association of Canadian Archivists 2022 Conference.
  • More than 1,000 authors have spoken out on behalf of libraries, demanding that publishers and trade associations put the digital rights of librarians, readers, and authors ahead of shareholder profits. 
  • In a tumultuous year on social media, Internet Archive has added a Mastodon server. Why? We need a game with many winners, not just a few powerful players.
  • In an OpEd for TIME, Brewster Kahle, founder and digital librarian of the Internet Archive, warned, “the instability occasioned by Twitter’s change in ownership has revealed an underlying instability in our digital information ecosystem.”

The internet reacts to the lawsuit against our library

  • On July 7, 2022, the Internet Archive filed a motion for summary judgment, asking a federal judge to rule in our favor and end a radical lawsuit, filed by four major publishing companies, that aims to criminalize library lending. Check out the Hachette v. Internet Archive page at EFF for all filings and resources.
  • We hosted a press conference on July 8 about the lawsuit featuring statements from Brewster Kahle (Internet Archive) and Corynne McSherry (EFF), plus powerful impact statements from medical school librarian Benjamin Saracco and author and editor Tom Scocca.
  • Interest in the lawsuit crossed over into mainstream channels following a viral tweet about the filing, which kicked off a lengthy online conversation about library rights, digital lending and digital ownership.
  • After a series of standard filings across the summer and early fall, on October 8, Internet Archive filed the final brief in support of our motion for summary judgment, asking the Court to dismiss the lawsuit because our lending program is a fair use.
  • What does the lawsuit mean for the future of libraries? Internet Archive’s policy counsel, Peter Routhier, considers how the publishers view libraries based on their filings.
  • Check out the Hachette v. Internet Archive page at EFF for all filings and resources.
  • One message really resonated online—people were surprised to learn that the Internet Archive has a physical archive that preserves all the physical books we’ve acquired and digitized. 

eBooks, #OwnBooks & digital ownership

  • 2022 might go down as the year that people started to really understand what it means when libraries & individuals can no longer own content, like when streaming-only content vanishes from media platforms.
  • Musician Max Collins wrote in Popula how “owning media is now an act of countercultural defiance,” walking readers through his first-hand example of how the streaming model doesn’t work for artists, only corporations.
  • Brewster Kahle published, “Digital Books wear out faster than Physical Books,” countering the notion put forward by publishers that ebooks don’t wear out. In fact, Brewster notes that ebooks require “constant maintenance—reprocessing, reformatting, re-invigorating or they will not be readable or read.”
  • Brewster’s post sparked the interest of LA Times business columnist Michael Hiltzik, who expanded on the issues around digital ownership in “Here’s why you can’t ‘own’ your ebooks.”
  • To celebrate why it’s important to own books, and to help bring visibility to issues around digital ownership, we launched the participatory #OwnBooks campaign, which invited people to share photos with the oldest book, or most treasured volume, from their personal collection, like this signed copy of The Phantom Tollbooth.
  • Author Glyn Moody published his latest book, Walled Culture, as a free ebook that you can download and own, or as a physical book that you can purchase in print.
  • More publishers joined the movement to sell—not license—ebooks to libraries, including independent publisher 11:11 Press.

The future of libraries

  • In February, we launched Library as Laboratory, a new series exploring the computational use of Internet Archive collections. The series included segments from digital humanities scholars, computational scientists, web archiving professionals and other researchers.
  • To help librarians and other information professionals better understand the decentralized web, Internet Archive partnered with the Metropolitan New York Library Council, DWeb, and Library Futures for a six-part series, Imagining a Better Online World: Exploring the Decentralized Web
  • During this year’s National Library Week, we invited readers to Meet the Librarians who work at the Internet Archive, highlighting the new roles our librarians lead in support of our mission, “Universal Access to All Knowledge.”
  • Internet Archive joined with Creative Commons, Wikimedia Foundation and others in the Movement for a Better Internet, a collaborative effort to ensure that the internet’s evolution is guided by public interest values.
  • Lila Bailey, Internet Archive’s senior policy fellow, and Michael Menna, policy fellow from Stanford University, released their report,”Securing Digital Rights for Libraries: Towards an Affirmative Policy Agenda for a Better Internet,” regarding libraries’ role in shaping the next iteration of the internet

Milestones

  • Dave Hansen, one of the authors of the white paper on controlled digital lending, was named the new executive director of Authors Alliance.
  • Carl Malamud received this year’s Internet Archive Hero Award for his lifelong mission to make public information freely available to the public.
  • We hosted the first in-person Library Leaders Forum in three years, preceded by a virtual Forum that brought together hundreds of digital library enthusiasts to explore issues related to digital ownership and the future of library collections.
  • We hosted a joint webinar with OCLC about our resource sharing pilots, including how to request articles from the Internet Archive via interlibrary loan.
  • The Music Library Association made its publications openly available at Internet Archive.
  • We began gathering content to support the newly announced Digital Library of Amateur Radio and Communications (DLARC), and then quickly surpassed 25,000 items in the collection.
  • DISCMASTER, a new software tool, allows users to search across the contents of the tens of thousands of archived CD-ROMs at the Internet Archive.
  • In August we celebrated the 20th anniversary of the Live Music Archive with a historical tour of the effort, which has resulted in hundreds of thousands of live sets available for listening at archive.org.

Donations

  • Colgate University donated more than 1.5 million microfiche cards for preservation and digitization, covering topics including Census data, documents from the Department of Education, Congressional testimony, CIA documents, and foreign news translated into English.
  • Facing an uncertain future, Hong Kong bookstore owner Albert Wan closed his pro-democracy, independent bookstore and donated the books to the Internet Archive for preservation and digitization.
  • Do you have physical collections you’d like to donate to the Internet Archive? Check out our help document.

Book talks

New additions to the Internet Archive for July 2022

Many items are added to the Internet Archive’s collections every month, by us and by our patrons. Here’s a round up of some of the new media you might want to check out. Logging in might be required to borrow certain items. 

Notable new collections from our patrons: 

Books – 78,091 New items in July

This month we’ve added books on varied subjects in more than 20 languages. Click through to explore, but here are a few interesting items to start with:

Audio Archive – 91,636 New Items in July

The audio archive contains recordings ranging from alternative news programming, to Grateful Dead concerts, to Old Time Radio shows, to book and poetry readings, to original music uploaded by our users. Explore.

LibriVox Audiobooks – 119 New Items in July

Founded in 2005, Librivox is a community of volunteers from all over the world who record audiobooks of public domain texts in many different languages. Explore.

78 RPMs and Cylinder Recordings – 8,888 New Items in July

Listen to this collection of 78rpm records, cylinder recordings, and other recordings from the early 20th century. Explore.

Live Music Archive – 965 New Items in July

The Live Music Archive is a community committed to providing the highest quality live concerts in a lossless, downloadable format, along with the convenience of on-demand streaming (all with artist permission). Explore.

Movies – 135 New Items in July

Watch feature films, classic shorts, documentaries, propaganda, movie trailers, and more! Explore.

New additions to the Internet Archive for May 2022

Many items are added to the Internet Archive’s collections every month, by us and by our patrons. Here’s a round up of some of the new media you might want to check out. Logging in might be required to borrow certain items. 

Notable new collections from our patrons: 

Books – 52,300 New items in May

This month we’ve added books on varied subjects in more than 20 languages. Click through to explore, but here are a few interesting items to start with:

Audio Archive – 89,325 New Items in May

The audio archive contains recordings ranging from alternative news programming, to Grateful Dead concerts, to Old Time Radio shows, to book and poetry readings, to original music uploaded by our users. Explore.

LibriVox Audiobooks – 92 New Items in May

Founded in 2005, Librivox is a community of volunteers from all over the world who record audiobooks of public domain texts in many different languages. Explore.

78 RPMs and Cylinder Recordings – 112 New Items in May

Listen to this collection of 78rpm records, cylinder recordings, and other recordings from the early 20th century. Explore.

Live Music Archive – 807 New Items in May

The Live Music Archive is a community committed to providing the highest quality live concerts in a lossless, downloadable format, along with the convenience of on-demand streaming (all with artist permission). Explore.

Netlabels223 New Items in May

This collection hosts complete, freely downloadable/streamable, often Creative Commons-licensed catalogs of ‘virtual record labels’. These ‘netlabels’ are non-profit, community-built entities dedicated to providing high quality, non-commercial, freely distributable MP3/OGG-format music for online download in a multitude of genres. Explore.

Movies – 110 New Items in May

Watch feature films, classic shorts, documentaries, propaganda, movie trailers, and more! Explore.

New additions to the Internet Archive for April 2022

Many items are added to the Internet Archive’s collections every month, by us and by our patrons. Here’s a round up of some of the new media you might want to check out. Logging in might be required to borrow certain items. 

Notable new collections from our patrons: 

  • Chris Cromwell Rare Reel to Reel Tapes – Rare and recovered reel-to-reel tapes from a variety of sources and preserved by Chris Cromwell. 
  • 1940s Classic TV – Television from the 1940s.
  • Game Shows Archive – A collection of game shows throughout television history, involving chance, skill and luck, usually presided over by a host and providing in-show commercials.
  • Dutch Television – Television programs and videos in the Dutch language, or from the Netherlands.

Books – 50,109 New items in April

This month we’ve added books on varied subjects in more than 20 languages. Click through to explore, but here are a few interesting items to start with:

Audio Archive – 150,224 New Items in April

The audio archive contains recordings ranging from alternative news programming, to Grateful Dead concerts, to Old Time Radio shows, to book and poetry readings, to original music uploaded by our users. Explore.

LibriVox Audiobooks – 99 New Items in April

Founded in 2005, Librivox is a community of volunteers from all over the world who record audiobooks of public domain texts in many different languages. Explore.

78 RPMs and Cylinder Recordings – 6,745 New Items in April

Listen to this collection of 78rpm records, cylinder recordings, and other recordings from the early 20th century. Explore.

Live Music Archive – 909 New Items in April

The Live Music Archive is a community committed to providing the highest quality live concerts in a lossless, downloadable format, along with the convenience of on-demand streaming (all with artist permission). Explore.

Netlabels111 New Items in April

This collection hosts complete, freely downloadable/streamable, often Creative Commons-licensed catalogs of ‘virtual record labels’. These ‘netlabels’ are non-profit, community-built entities dedicated to providing high quality, non-commercial, freely distributable MP3/OGG-format music for online download in a multitude of genres. Explore.

Movies – 55 New Items in April

Watch feature films, classic shorts, documentaries, propaganda, movie trailers, and more! Explore.

New additions to the Internet Archive for March 2022

Many items are added to the Internet Archive’s collections every month, by us and by our patrons. Here’s a round up of some of the new media you might want to check out. Logging in might be required to borrow certain items. 

Notable new collections from our patrons: 

Books – 60,379 New items in March

This month we’ve added books on varied subjects in more than 20 languages. Click through to explore, but here are a few interesting items to start with:

Audio Archive – 93,954 New Items in March

The audio archive contains recordings ranging from alternative news programming, to Grateful Dead concerts, to Old Time Radio shows, to book and poetry readings, to original music uploaded by our users. Explore.

LibriVox Audiobooks – 122 New Items in March

Founded in 2005, Librivox is a community of volunteers from all over the world who record audiobooks of public domain texts in many different languages. Explore.

78 RPMs and Cylinder Recordings – 7,423 New Items in March

Listen to this collection of 78rpm records, cylinder recordings, and other recordings from the early 20th century. Explore.

Live Music Archive – 1,098 New Items in March

The Live Music Archive is a community committed to providing the highest quality live concerts in a lossless, downloadable format, along with the convenience of on-demand streaming (all with artist permission). Explore.

Netlabels186 New Items in March

This collection hosts complete, freely downloadable/streamable, often Creative Commons-licensed catalogs of ‘virtual record labels’. These ‘netlabels’ are non-profit, community-built entities dedicated to providing high quality, non-commercial, freely distributable MP3/OGG-format music for online download in a multitude of genres. Explore.

Movies – 25 New Items in March

Watch feature films, classic shorts, documentaries, propaganda, movie trailers, and more! Explore.

What’s New in February 2022

Here are some of the notable new additions to the Internet Archive from February 2022. (Logging in might be required to borrow certain items.)

Notable new collections: 

We’ve been reorganizing some of the items uploaded by our users, and these collections of magazines struck us as particularly interesting:

Books 45,073

This month we’ve added books in more than 20 languages. Here are a few good ones to start with:

Audio Archive 73,305

The audio archive contains recordings ranging from alternative news programming, to Grateful Dead concerts, to Old Time Radio shows, to book and poetry readings, to original music uploaded by our users.

The LibriVox Free Audiobook Collection 118

Founded in 2005, Librivox is a community of volunteers from all over the world who record audio versions of public domain texts: poetry, short stories, whole books, even dramatic works, in many different languages.

78 RPMs and Cylinder Recordings 8,840

Listen to this collection of 78rpm records, cylinder recordings, and other recordings from the early 20th century.

Live Music Archive 892

The Live Music Archive is a community committed to providing the highest quality live concerts in a lossless, downloadable format, along with the convenience of on-demand streaming.

Netlabels 263

The Netlabels collection hosts complete, freely downloadable/streamable, often Creative Commons-licensed catalogs of virtual record labels.

Internet Arcade 5

The Internet Arcade is a web-based library of arcade (coin-operated) video games from the 1970s through to the 1990s, emulated in JSMAME, part of the JSMESS software package. Containing hundreds of games ranging through many different genres and styles, the Arcade provides research, comparison, and entertainment in the realm of the Video Game Arcade.

New additions to the Internet Archive for January 2022

Many items are added to the Internet Archive’s collections every month, by us and by our patrons. Here’s a round up of some of the new media you might want to check out. Logging in might be required to  borrow certain items. 

Notable new collections: 

Books 40,695

This month we’ve added books on varied subjects in more than 20 languages. Click through to explore, but here are a few interesting items to start with:

Audio Archive 79,099

The audio archive contains recordings ranging from alternative news programming, to Grateful Dead concerts, to Old Time Radio shows, to book and poetry readings, to original music uploaded by our users.

The LibriVox Free Audiobook Collection 98

Founded in 2005, Librivox is a community of volunteers from all over the world who record audiobooks of public domain texts in many different languages.

 

78 RPMs and Cylinder Recordings 6,849

The Great 78 Project! Listen to this collection of 78rpm records, cylinder recordings, and other recordings from the early 20th century.

Live Music Archive 799

The Live Music Archive is a community committed to providing the highest quality live concerts in a lossless, downloadable format, along with the convenience of on-demand streaming (all with artist permission).

Netlabels 486

This collection hosts complete, freely downloadable/streamable, often Creative Commons-licensed catalogs of ‘virtual record labels’. These ‘netlabels’ are non-profit, community-built entities dedicated to providing high quality, non-commercial, freely distributable MP3/OGG-format music for online download in a multitude of genres.

Audio / Video player updated – to jwplayer v8.2

We updated our audio/video (and TV) 3rd party JS-based player from v6.8 to v8.2 today.

This was updated with some code to have the same feature set as before, as well as new:

  • much nicer cosmetic/look updates
  • nice “rewind 10 seconds” button
  • controls are now in an updated control bar
  • (video) ‘Related Items’ now uses the same (better) recommendations from the bottom of an archive.org /details/ page
  • Airplay (Safari) and Chromecast basic casting controls in player
  • playback speed rate control now easier to use / set
  • playback keyboard control with SPACE and left , right and up, down keys
  • (video) Web VTT (captions) has much better user interface and display
  • flash is now only used to play audio/video if html5 doesnt work (flash does not do layout or controls now)

Here’s some before / after screenshots:

archive.org download counts of collections of items updates and fixes

Every month, we look over the total download counts for all public items at archive.org.  We sum item counts into their collections.  At year end 2014, we found various source reliability issues, as well as overcounting for “top collections” and many other issues.

archive.org public items tracked over time

archive.org public items tracked over time

To address the problems we did:

  • Rebuilt a new system to use our database (DB) for item download counts, instead of our less reliable (and more prone to “drift”) SOLR search engine (SE).
  • Changed monthly saved data from JSON and PHP serialized flatfiles to new DB table — much easier to use now!
  • Fixed overcounting issues for collections: texts, audio, etree, movies
  • Fixed various overcounting issues related to not unique-ing <collection> and <contributor> tags (more below)
  • Fixes to character encoding issues on <contributor> tags

Bonus points!

  • We now track *all collections*.  Previously, we only tracked items tagged:
    • <mediatype> texts
    • <mediatype> etree
    • <mediatype> audio
    • <mediatype> movies
  • For items we are tracking <contributor> tags (texts items), we now have a “Contributor page” that shows a table of historical data.
  • Graphs are now “responsive” (scale in width based on browser/mobile width)

 

The Overcount Issue for top collection/mediatypes

  • In the below graph, mediatypes and collections are shown horizontally, with a sample “collection hierarchy” today.
  • For each collection/mediatype, we show 1 example item, A B C and D, with a downloads/streams/views count next to it parenthetically.   So these are four items, spanning four collections, that happen to be in a collection hierarchy (a single item can belong to multiple collections at archive.org)
  • The Old Way had a critical flaw — it summed all sub-collection counts — when really it should have just summed all *direct child* sub-collection counts (or gone with our New Way instead)

overcount

So we now treat <mediatype> tags like <collection> tags, in terms of counting, and unique all <collection> tags to avoid items w/ minor nonideal data tags and another kind of overcounting.

 

… and one more update from Feb/1:

We graph the “difference” between absolute downloads counts for the current month minus the prior month, for each month we have data for.  This gives us graphs that show downloads/month over time.  However, values can easily go *negative* with various scenarios (which is *wickedly* confusing to our poor users!)

Here’s that situation:

A collection has a really *hot* item one month, racking up downloads in a given collection.  The next month, a DMCA takedown or otherwise removes the item from being available (and thus counted in the future).  The downloads for that collection can plummet the next month’s run when the counts are summed over public items for that collection again.  So that collection would have a negative (net) downloads count change for this next month!

Here’s our fix:

Use the current month’s collection “item membership” list for current month *and* prior month.  Sum counts for all those items for both months, and make the graphed difference be that difference.  In just about every situation that remains, graphed monthly download counts will be monotonic (nonnegative and increasing or zero).

 

 

Music Analysis Beginnings

As mentioned in our recent Building Music Libraries post, we are working with researchers at Columbia University and UPF in Barcelona to run their code on the music collection to help their research and to provide new analyses that could help with exploration and understanding.

We are doing some pilot runs to generate files which some close observers may see in the music item directories on archive.org.  Audio fingerprints from audfprint are .afpt and music attributes from Essentia are in _esslow.json.gz (download sample) and _esshigh.json.gz.

Spectrogram of a Grateful Dead track

Spectrogram of a Grateful Dead track

We are also creating image files showing the audio spectrum used.  We hope this is useful for those that want to see if files have been compressed in the past (even if they are posted as flac files now).  There is also a .png for each audio file of a basic waveform that is being used in the archive’s beta site as eye candy.

More as it happens, but we wanted you know there is some progress and you will see some new files.  If you have proposed other analyses that would benefit from being run over a large corpus, please let us know by contacting info at archive dot org.

Thank you to the researchers and the Archive programmers who are working together to make this happen.