TV News Record: State of the Union, past and future

A round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman. Additional research by Robin Chin.

When President Donald Trump takes the podium to deliver his first official State of the Union address to a joint session of Congress on Tuesday, January 30,  he’ll be following in the footsteps of the nation’s very first president, George Washington, long before there was cable TV or radio.

In Washington’s time, the speech was not yet known as the State of the Union, but the annual message, and according to Donald Ritchie, former U.S. Senate historian, seen here in a clip from C-Span, the practice was “to [physically] cut the State of the Union message up into paragraphs and create committees to address each one of the issues the president suggested.” There were no standing committees in Congress at that time. Now it’s fact-checkers who examine the speech, line by line, and since 2017, we’ve been annotating our TV news programs with fact-checks of Trump, top administration officials, and the four top congressional leaders, Democrat and Republican.

Make history by being a beta tester of FactStream, a new free app for iPhone or iPad, which will deliver live fact-checks of Trump’s State of the Union address from national fact-checking organizations. The app is a product of Duke Reporters Lab Tech & Check collective, of which the Internet Archive’s TV News Archive is a member. We’ll be adding the fact-checks to the TV News Archive, too.

At the TV News Archive, we’ve got historical footage of some past State of the Union addresses, listed below. Last year we annotated Trump’s address to Congress – not officially a State of the Union, since he was newly inaugurated – with fact-checks from our fact-checking partners,, PolitiFact, and The Washington Post’s Fact Checker. Fact-checks are noted with a red check mark on the TV News Archive filmstrip screen.

For example, the above segment of Trump’s 2017 speech, marked with a red check mark, was fact-checked by both PolitiFact and The Washington Post’s Fact Checker. Trump said, “According to data provided by the Department of Justice, the vast majority of individuals convicted of terrorism and terrorism related offenses since 9/11 came here from outside of our country.”

PolitiFact’s Miriam Valverde rated this claim as “mostly false”: “Trump’s statement contains an element of truth but ignores critical facts that would give a different impression. We rate it Mostly False.” Michelle Ye Hee Lee, writing for The Washington Post’s Fact Checker gave the claim “four Pinocchios,” stating it relied on “a grossly exaggerated misuse of federal data.”

Past State of the Union addresses

2016: Barack Obama

2015: Barack Obama

2014: Barack Obama

2013: Barack Obama

2012: Barack Obama

2011: Barack Obama

2010: Barack Obama

1995: Bill Clinton

1988: Ronald Reagan (no closed captioning)

1980: Jimmy Carter  (no closed captioning)

1975:  Gerald Ford

1969: Lyndon Johnson

1965: Lyndon Johnson

1963: John F. Kennedy (no closed captioning)

1961: John F. Kennedy (no closed captioning)

1942: Franklin D. Roosevelt (no closed captioning)

Posted in Announcements, News, Television Archive | Tagged , , , , , , , , , , | 4 Comments

Digital Books on

Many people think of the Internet Archive as just the Wayback Machine or just one collection or another, but there is much more.  For instance, books!

As a nonprofit library we buy and lend books to the public, but we do even more than that. Working with hundreds of libraries, we buy ebooks, digitize physical books, offer them to the print-disabled, and lend books to one reader at a time, all for free via and is the website that offers free public access to all sorts of materials uploaded by users, collected by the Internet Archive, and digitized by the Internet Archive. includes books, music, video, webpages, and software., a site that is maintained by the Internet Archive, is a catalog of books with the mission to offer “One webpage for every book.”  This open source catalog site, started in 2005, is editable by its users and has many code contributors. It links to various resources about that book, for instance, links to and to buy the book, to local libraries that own the book, to for print-disabled access or to borrow a digitized version of the book, and to other sites that have digital versions.

The goals of libraries are preservation and access. For physical books, we buy and receive donations of hundreds of thousands of books that we preserve for the long term in archival, non-circulating stacks. Support for this comes from libraries, used book vendors, foundations, and tens of thousands of individual donors to the Internet Archive, a public charity.

We also work with more than 500 libraries to help digitize their books, now more than 3 million of them, to preserve them digitally and offer online access. These libraries make their older books (mostly pre-1923)  available for free public downloading, and fantastically over 25 million older books are viewed every month.

Unfortunately, the books of the 20th century are largely not available either physically or digitally. These graphs show how the 20th century’s books are not available through Amazon for purchase, or from the Internet Archive. Some have reasoned this is because of copyright. 1923 is a special date in US copyright law because works published before this date are in the  Public Domain, while afterwards copyright status can be very complicated. Unfortunately, 1923 in these graphs also demarks a sharp drop in commercial availability of many books. These books are often only available through libraries.

Starting 10 years ago the Internet Archive began digitizing modern books, mostly from the 20th century,  for access by the blind and dyslexic. Those that are certified disabled by the Library of Congress get a decryption key for accessing Library of Congress scanned books. This key can also decrypt digitized books available on This combined with special formats for the blind and dyslexic of the older books has brought millions of books to people that have had difficulty in the past. We are working to make these books more available to these communities in other special formats.

Publishers have been using digital protection technologies for years for ebooks sold to retail customers, often referred to as DRM (digital rights management).  Libraries lend ebooks using the same DRM, and the Internet Archive has followed that lead, using Adobe Digital Editions.

The digital protection allows books to be lent via downloads that disappear (or become inaccessible) when the loan period ends (e.g. two weeks).  For users who prefer to read their ebooks directly in a browser, the same thing happens. The book becomes inaccessible at the end of the loan period, and the next reader in line has a chance to borrow it.

While it is technically possible to break the digital protections of these technologies, it is illegal to do so. Moreover, the typical user does not do this, allowing for a flourishing ebook marketplace for current books. The Internet Archive is able to make available for loan older books that are not available in ebook format. In every case, an authorized print copy has been acquired and made unavailable for simultaneous loan.

Many of the books in our collection are books that libraries believe to be of historical importance such that they do not want to throw them away, but are not worth keeping on their physical shelves. The digitized versions are therefore made available to a single user at a time, while the physical book no longer circulates. Since the books which are lent using the controlled digital lending technologies are limited to one reader at a time, it works best for “long tail” books, books that are not available in other ways. Fortunately, many of these books are wonderful and important and we are proud to bring them to a generation of digital learners who may not have physical access to major public libraries.

We hope many more libraries start controlled digital lending of their books as this is a way to bring public access to the purchases and collections they have built over centuries.

We have recently made available a small number of books (currently 61 books) published between 1923 and 1941 under a provision of US Copyright law that was written to permit libraries to copy and lend titles that are no longer subject to commercial exploitation, and selection is currently overseen by  lawyers expert in US copyright law.

As a completely separate service from buying ebooks and loaning to users with controlled digital lending, the Internet Archive offers free hosting for cultural works (texts, audio, moving images) that are uploaded by the general public. Millions of documents from court cases, and digitized books from other projects such as the Google book program and the Digital Library of India have been uploaded over the years.

When a rights holder wants a work that was uploaded by a user taken down, a well known “Notice and Takedown” procedure is in place. The Internet Archive takes prompt action and follows the procedure, generally resulting in the work being taken down.

Where is this all going?  We are looking for partners and ideas to help bring more books to more people in more ways. More books (and more accessible books) for the print disabled, complete collections of books from the 20th century online and available, clickable footnotes for books cited in Wikipedia to bring up the full text on the right page, and many more books in bookstores and libraries. This generation of digital learners is looking for this, is expecting this. Collectively, libraries, booksellers, publishers, and authors– old and new– share these same interests.  The good news is the technologies are now available– we all have to do our parts to do to serve digital learners everywhere.

As a library, we strive to provide “Universal Access to All Knowledge.” The digital technologies make this a feasible dream.  We are working with publishers, booksellers, authors, other libraries, and most of all digital learners to find balanced and respectful ways to try to achieve this goal. If you want to help, or have ideas on what we can do to get there, please let us know.


Posted in Announcements, News | 1 Comment

Building Digital 78rpm Record Collections Together with Minimal Duplication

78_mama-yo-quiero_joaquin-garay-al-wallace-orchestra-e-b-marks_gbia0034720aBy working together, libraries who are digitizing their collections can minimize duplication of effort in order to save time and money to preserve other things.  This month we made progress with 78rpm record collections.

The goal is to bring many collections online as cost effectively as possible. Ideally, we want to show each online collection as complete but only digitize any particular item once. Therefore one digitized item may belong virtually to several collections. We are now doing this with 78rpm records in the Great 78 Project.

It starts with great collections of 78s (18 contributors so far). For each record, we look up the record label, catalog number, and title/performer, to see if we have it already digitized. If we have it already, then we check the condition of the digitized one against the new one– if we would improve the collection, we digitize the new one. If we do not need to digitize it, we add a note to the existing item that it now also belongs to another collection, as well as note where the duplicate physical item can be found.

For instance, the KUSF collection we are digitizing has many fabulous records we have never seen before including sound effect records.  But about half are records we have digitized better copies of before, so we are not digitizing most of those. We still attribute the existing digital files to the KUSF collection so it will have a digital file in the online collection for each of their physical discs.

It takes about half the time to find a record is a duplicate than to fully digitize it, and given that we are now seeing about half of our records not needing to be digitized, we are looking for ways to speed this up.

OCLC has many techniques to help with deduplication of books and we are starting to work with them on this, but for 78s we are making progress in this way. Please enjoy the 78s.

Thank you to GeorgeBlood L.P., Jake Johnson, B. George, and others.

Posted in 78rpm, Announcements, Audio Archive, News | Comments Off on Building Digital 78rpm Record Collections Together with Minimal Duplication

The Lost Landscapes of San Francisco: A Benefit for the Internet Archive — Monday, January 29

by Rick Prelinger

Internet Archive presents the movie 12th annual Lost Landscapes of San Francisco on Monday, January 29 at 7:30 pm at our headquarters in San Francisco. The show will be preceded by a small reception at 6:30 pm, when doors will also open.

Buy Tickets Here

I’ve been collecting historical footage of San Francisco and the Bay Area in earnest since 1993, when we acquired the collection assembled by noted local historian and film preservationist Bert Gould. Since that time I’ve worked to collect film material showing the history of this dynamic and complex region. Much of it is online for free viewing, downloading and reuse as part of the Prelinger Collection. Many great things have happened at the Archive showings: people have recognized their relatives in the films, and many have seen their own streets and neighborhoods as they’ve never before seen them.

Combining favorites from past years with this year’s footage discoveries, this feature-length program shows San Francisco’s neighborhoods, infrastructures, celebrations and people from the early 20th century through the 1970s. New sequences this year include North Beach clubs and nightlife, colorful New Deal labor graphics, early BART footage, a scooters’ rights demonstration (!), unbuilt sand dunes in the Sunset, Barbra Streisand and Ryan O’Neal shooting WHAT’S UP DOC? on location in the Richmond District, more footage of the mysterious Running Man in Chinatown and on Nob Hill, Bay Area activism, birthdays and Thanksgiving in the Outer Mission in the late 1940s, Latino families dancing on Ocean Beach, and much, much more.

As always, the audience makes the soundtrack! This is an excellent venue for the show, as the shape of the Great Room makes it easy for participants to hear one another’s comments. Come prepared to identify places, people and events, to ask questions and to engage in spirited real-time repartee with fellow audience members, and look for hints of San Francisco’s future in the shape of its lost past.

Monday, January 29th
6:30 pm Reception
7:30 pm Interactive Film Program

Internet Archive
300 Funston Avenue
San Francisco, CA 94118

Buy Tickets Here!

bitcoin accepted then email

Posted in Announcements, Event | 5 Comments

Why We Do What We Do

This month we were powered by 75,000 donations big and small.  One supporter in Zimbabwe even sent us Twenty Billion dollars!  But what keeps us going is, quite simply, you.  Your words of encouragement and support remind us why we do what we do.  They make us want to do more.

We asked our supporters “Why do you donate to the Internet Archive?” Here’s a selection of recent replies:

I love the Archive and I love the Wayback Machine. What is true may not always be clear. But by looking at the past we can see what is and isn’t true. You enable that vital process to occur. Thank you! — Jack

I donated because, “we can only keep what we have by giving it away.”  — Joe

I have used your site many times. I was homeless and broke. Said if I ever had anything to give, I would be sure to do so. So here I am.  –Rebecca

Where would any of us be without the Wayback Machine? — Melissa

I’m a university professor and depend heavily on the Internet for my required readings. In this case, I am having my students read a book published in 1838. How else would I get them access to such a text?  – Madeleine

With the impending closure of Storify and the Library of Congress’s decision to stop archiving Twitter, I was reminded just how vital this service is.  — Lee

You are filling a serious need, the Internet isn’t a safe place for sensitive information and it’s good to know you have our backs.  — Tony

The most trusted name in knowledge for free. What Google wanted to be. I do hope you get the major foundational support you deserve. And support from Old Time Radio and Jazz aficionados like I am. Great literature as well. And film, cartoons. is the cultural library for “the rest of us.”  –Dennis

I’m an amateur historian and writer. I appreciate being able to find information on your site that would otherwise require trips around the country to dusty archives or libraries!  –Rita

Downloaded a few really great live shows from (especially from Songs: Ohia, Magnolia Electric Co & Cowboy Junkies). With my small donation I wanted to give a little something back to you.  –Roland

Love your old time radio & television programs. You are preserving our history while so many others are trying to change or erase it. I feel your preservation efforts are important to our culture & freedom. Those who forget (or erase) history are doomed to repeat it. Keep up the good work. Wish I could afford more. –-Randy

Why I donated? This pretty much sums it up for me:  “Fixed more than 3 million broken links in Wikipedia using the Wayback Machine.  Saved 200 terabytes of government data that might have disappeared.”  –JVK

Books from your archive have been a big help for my genealogical research. –Jim

It is a great initiative and I’ve saved 50 times the amount I donated with this site. I am more than happy to chip in where I can.  –Em

Hiding society’s collective knowledge away behind copyright paywalls is an enormous problem.  I want to thank you for making it more accessible, and support your effort.       –Jerry

For the love of this site over the years and the continual availability of great live music by incredible musicians…and their fans! — A Fan

Want to hear programs I’ve missed on KPFA Radio, especially Democracy Now!                –Jeanne

KBCA was a Los Angeles radio station in my youth.  Just being able to hear a program broadcast again is a rare, but welcome treat to me.  –Byron

Dear Internet Archive, I have been using this service for years and found all the classic scientific work from the leading scientists of the 20th century. I could not have found these books anywhere else, not even big university libraries.  Excellent efforts,  –Farooq

Your service is extraordinary, necessary and a gift.  Thank you. — Bella

A small gift can go a long way if everyone does so. –Richard

For some extremely obscure but historically important publications, I can now download, thanks to you and similar digitising archives, a digital copy straight to my computer from Melbourne, Australia. Only a decade or two ago, I would have needed to locate one of the few copies that exists, travel to the library in which the copy was held, and sit in a reading room taking notes, or taking one-off photocopies, still not keyword searchable.  Thanks so much.  –Kale

I believe knowledge needs to be preserved, and shared.  Censorship whether it is by government, corporations or individuals, through legislation, fascist threats or economic censorship (ie youtube)  is a detriment to all humankind.  — TD

I get both pleasure and (sometimes) insight from being able to read texts from the original editions, where the visual impact of the content is what the author most probably intended.  One cannot get this in any other way, apart from the very occasional good luck to be given (or to purchase) old editions. The Internet Archive is a wonderful resource.  — Jeremy

Thank you for keeping Venketaramana’s talk on Bhagavad Geetha…it changed my way of thinking. I am still growing. — Nunu

Born in the 1960s but stumbled on old radio as a kid—these shows are my Prozac.  Thank you.  –Julie

I’ve listened to Theatre Five, and MindWebs and other Old Time Radio for years now and couldn’t afford donate. I would push the donate button and turn back every time. I finally can now, and I will again.  –Derrick

I listen to Libra Vox and benefit from finding books online for free in family history work.  You are very valuable.  I appreciate donations that don’t just go to large overhead in top heavy charities.  Thank You for all you do.  –Janeal

I love learning and am so grateful to have a place to learn anything I want!  Thank you for this gift! –Judi

To the Internet Archive & Open Library Communities:  You are an amazing bunch!  Thank YOU for using knowledge in all its many forms to inspire, learn, and enjoy life more fully.  Here’s to growing,  improving, and sharing even more in 2018.


Posted in Announcements, News | 12 Comments

30 Days of Stuff

Jason Scott, free-range archivist, reporting in as 2017 draws to a close.

As part of our end-of-year fundraising drive, I thought it might be fun to tweet highlighted parts of the vast stacks of content that the Internet Archive makes available for free to millions. A lot of folks know about our Wayback Machine and its 20+ years of website history, but there’s petabytes of media and works available to see throughout the site. I called it “30 Days of Stuff”, and for the last 30 days I’ve been pointing out great items at the Archive, once a day.

You won’t have to swim upstream through my tweets; here on the last day, I’ve compiled the highlighted items in this entry. Enjoy these jewels in the Archive’s collection, a small sample of the wide range of items we provide.

Books and Texts

  • The Latch Key of my Bookhouse was one of the first books scanned by the Internet Archive in its book scanner tests, and it’s a 1921 directory of Children’s Literature that is filled with really nice illustrations that came out great.
  • As part of our ever-growing set of Defense Technical Information Center collection, we have The Role of the Citizens Band Radio Service and Travelers Information Stations In Civil Preparedness Emergencies Final Report, a 1978 overview of CB Radio and what role it might play in civil emergencies. Many thousands of taxpayer-funded educational and defense items are mirrored in this collection.
  • Also in the DTIC collection is The Battalion Commander’s Handbook 1980, which besides the crazy front page of stamps, approvals and sign-offs, is basically an manager’s handbook written from the point of view of the US Army.
  • There are hundreds of tractor manuals at the Archive. Hundreds! Of all types, languages (a lot of them Russian) and level of information. Tractors are one of those tools that can last generations and keeping the maintenance on them in the field can make a huge difference in livelihood.
  • A lovely 1904 catalog for plums called The Maynard Plum Catalogue was scanned in with one of our partner organizations and it’s a breathless and inspiring declaration of the future wonder of the plums this wizard of plum-growing, Luther Burbank, was bringing to the world.
  • Xerox Corporation released “A Metamorphosis of Creative Copying” in 1964, which seems to function as both promotion for Xerox and a weird gift to give to your kids to color in.
  • In 2014, a short zine called The Tao of Bitcoin was released, telling people the dream of $10,000 bitcoin would be real.
  • The 1888 chapbook Goody Two-Shoes has lovely illustrations, and a fine short story.
  • Working with a lovely couple who brought in a 1942 black-owned-businesses directory, I scanned the pages by hand and put them up into this item.
  • Inside that directory was an ad for a school of whistling that said it taught using the methods of Agnes Woodward, and a quick scan of the Archive’s stacks showed that we had an entire copy of her book Whistling as an Art!
  • The medical treatise Sleep and Its Derangements, from 1869, is William A. Hammond, MD’s overview of sleep, and what can go wrong. Scanned from the Francis A. Countway Library of Medicine, it’s one of many thousands of books we’ve scanned with partners.
  • Let Hartman Feather Your Nest could be described as “A furniture catalog” in the same way the Sistine Chapel could be described as “a place of worship”. The catalog is a thundering, fist-pounding declaration of the superiority of the Hartman enterprise and the quality and breadth of furniture and service that will arrive at your door and be backed up to the far reaches of time.


  • Photoplay considered itself the magazine for the motion picture industry in the first part of the 20th century, and this multi-volume compilation of photos, articles and advertisements is a truly lovely overview.
  • There’s over 140 issues of the classic Maximum RockNRoll zine, truly the king of music zines for a very long time. On its newsprint pages are howls and screeches of all manner of punk, rock and the needs of musicians.
  • A magazine created by the Walt Disney Company to trumpet various parts of Disneyland and its attractions was called Vacationland, and this Fall 1965 issue covers all sorts of stuff about the park’s first decade.


  • Rescued from a warehouse years ago, a collection of Hollywood movie “B-Roll”, unused secondary scenes often filmed by different crew, has been digitized. My personal favorite is [Western Film Scenes], which is circa 1950s footage of a Western Town, all of it utterly fake but feeling weirdly real, to be used in a western. Don’t miss everyone standing around looking right at you and looking like they agree quite energetically with you!
  • No compilation could be complete without the legendary Duck and Cover, a cartoon/PSA that explained the simple ways to avoid injury in a nuclear blast. Just lie down! It’ll be fine. Please note: This Probably Won’t Work. But the song is very catchy.
  • The very weird Electric Film Format Acid Test from 1990 has a semi-interested model holding up a color bar plate in a wide, wide variety of film and video formats. Filmed just a few blocks away from the Internet Archive’s current headquarters.
  • I snuck in a 1992 interview with the Archive’s founder, Brewster Kahle, back when he was 33 and working at WAIS, a company or two before the Archive and where he is asked about his thoughts on information and gathering of data. It’s quite interesting to hear the consistency of thought.
  • The Office of War Information worked with Disney to create “Dental Health“, a film to show to troops about proper dental care. It’s a combination of straightforward animation and industrial film-making worth enjoying.


  • We have a collection of hours of the radio show The Shadow from 1938-1939, starring  Orson Welles at 23, at the height of his performance powers, playing the dual main role.
  • For Christmas Eve, we pointed to “Christmas Chopsticks”, a 1953 78rpm record of “Twas the Night Before Christmas” performed to the tune of the classic piano piece “Chopsticks”; one of tens of thousands of 78rpm records the Archive has been adding this year.
  • On Christmas, a user of the Archive uploaded two obscure albums he’d purchased on eBay – remnants of the S. S. Kresge Company, which became K-Mart, and which were played over the PA system for shoppers. He got his hands on Albums #261 and #294.
  • Earlier in the month before the user uploaded those Christmas albums, I linked to a different holiday collection of K-Mart items, a 1974 Reel-to-Reel that started with a K-Mart jingle and went full holiday from there.
  • Before he was a (retired) talk show host, and before he was a stand-up comedian, David Letterman worked and trained in radio. Happily, we have recordings of Dave Letterman, DJ, from when he was 22, at Ball State University.
  • Ron “Boogiemonster” Gerber has been hosting his weekly pop music recycling radio show, “Crap from the Past”, for over 25 years, and he’s been uploading and cataloging his show to the Archive for well over 10 of those years, including all the way back to the beginning of his show. The full Crap From The Past archive is up and is hundreds of hours of fun.
  • The truly weird “Conquer the Video Craze” is a 1982 record album with straightforward descriptions of how to beat games like Centipede, Defender, Stargate, Dig Dug, and more. This album has been sampled from by multiple DJs to bring that extra spice to a track.
  • Over 3,000 shows at the DNA Lounge are at the archive, including “Bootie: Gamer Night“, which combines mash-up tracks and video games. Bootie has been playing at DNA Lounge for years, and puts the audio from one song with the singing from another, and… it’s quite addicting, like games. This night was for the nearby Game Developers’ Conference being held the same week.


  • In 2011, as part of a “retrocomputing” competition, we saw the release of “Paku-Paku”, a pac-clone program which ran in an obscure early PC-Compatible graphics mode that was very colorful and very small (160×100) and was built perfectly for it. You can play the game in your browser by clicking here.
  • Psion Chess is a game for the Macintosh that can play both you and itself with pretty high levels of skill and really sharp and crisp black and white graphics.  It makes a really great screensaver in self-playing mode.

People often overuse a phrase like “Barely scratched the surface”, but I assure you there are millions of amazing items in the archive, and it’s been a pleasure to bring some to light. While the 30 Days of Stuff was a fun way to stretch out a month of fundraising with stuff to see every day, we’re here 24/7 to bring you all these items, and welcome you finding jewels, gems and clunkers throughout our hard drives whenever you want.

Thanks for another year!

Posted in Announcements, Cool items, Emulation, Image Archive, Movie Archive, Music, Software Archive, Video Archive, Wayback Machine - Web Archive | 8 Comments

TV News Record: The year in TV news visualizations

Thanks for being part of our community at the TV News Archive. As 2017 draws to a close, we’ve chosen six of our favorite visualizations using TV News Archive data. We look forward to assisting many more journalists and researchers in what will likely be an even more tumultuous news year. 

The New York Times: Mueller indictments

The New York Times editorial page used our Third Eye chyron collection to produce an analysis of TV news coverage of major indictments of Trump campaign officials by special counsel Robert Mueller: “The way each network covered the story – or avoided it – is a sign of how the media landscape has become ever more politicized in the Trump era. ”

credit: Taylor Adams, Jessia Ma, and Stuart A. Thompson, The New York Times, “Trump Loves Fox & Friends,” November 1, 2017.

FiveThirtyEight: hurricane coverage

Writing for, Dhrumil Mehta demonstrated that TV news broadcasters paid less attention to Puerto Rico’s hurricane Maria than to hurricanes Harvey and Irma, which hit mainland U.S. primarily in Texas and Florida. Mehta used TV News Archive data via Television Explorer.

credit: Dhrumil Mehta, “The Media Really Has Neglected Puerto Rico,” FiveThirtyEight, September 28, 2017.

TV News Archive: face-time for lawmakers

Using our Face-o-Matic data set, we found that Sen. Majority Leader Mitch McConnell, R., Ky., gets the most face-time on cable TV news, and MSNBC features his visage more than the other networks examined. Fox News features the face of House Minority Leader Nancy Pelosi, D., Calif., more than any other cable network.

Vox:  Mueller’s credibility

Vox’s Alvin Chang used Television Explorer to explore how Fox News reports on Mueller’s credibility. This included showing how often Fox news mentioned Mueller in the context of former presidential candidate Hillary Clinton.

Alvin Chang, “A week of Fox News transcripts shows how they began questioning Mueller’s credibility,” Vox, October 31, 2017.

The Trace: coverage of shootings

Writing for The Trace, Jennifer Mascia presented findings from Television Explorer showing how coverage of shootings declines rapidly: “Two days after 26 people were massacred in a Texas church, the incident — one of the worst mass shootings in American history — had nearly vanished from the major cable news networks.”

Credit: Jennifer Mascia, “Data Shows Shrinking Cable News Cycles for This Fall’s Mass Shootings,” The Trace, December 5, 2017.

The Washington Post: What TV news networks covered in 2017

Philip Bump of The Washington Post crunched Television Explorer data to look at coverage of eleven major news stories by five national news networks. Here’s his visualization of TV news coverage of “sexual assault,” which shows how coverage increased at the end of the year as dozens of prominent men in media, politics, and entertainment were accused of sexual harassment or assault.

Philip Bump, “What national news networks were talking about during 2017, The Washington Post, December 15, 2017.

Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.

Posted in Announcements, News, Television Archive | Tagged , , , , , , , , , , , , , , , , , | 2 Comments

Pineapple Fund Gifts $1M in Bitcoin to the Internet Archive!

Thank you Pineapple Fund

This year, Christmas came early to the Internet Archive. On Saturday, the generous philanthropist behind the Pineapple Fund gave $1 million dollars in Bitcoin to the Internet Archive. This anonymous crypto-philanthropist explains, “I saw the promise of decentralized money and decided to mine/buy/trade some magical internet tokens. …Donating most of it to charity is what I’m doing.” We so admire this donor using Bitcoin as the currency of giving this season, and are honored to be the recipients of such a gift. Whoever you are, you are doing a world of good. Thank you.

Permanent access to websites, software, books, music…that is our mission. These gifts help make it happen.

It is especially gratifying to see those who dreamed big about Bitcoin using their wealth to support innovation across the world. The Internet Archive has been actively involved in this community since 2011. Encouraged, we are dreaming big and dreaming open.

Thank you to Pineapple Fund and thank you to the thousands that have contributed already.

Please consider donating this year— it really helps. Looking forward to 2018!

More information about the Pineapple Fund:
Original Reddit Post
Reddit Post Update
Pineapple Fund Interview with Hacker Noon
Pineapple Fund Interview with Bitcoin Magazine

Posted in Announcements, News | 20 Comments

TV News Record: Face-o-Matic on Trump, McConnell, and Pelosi; PolitiFact picks “2017 Lie of the Year”

A biweekly round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman.

This week we take a dive into nearly six months of Face-o-Matic facial recognition data. We also display the news clips behind PolitiFact’s top picks for lies and misstatements of the year, most of them seen and heard on TV news.

Face-o-Matic reveals cable news persistent patterns

With nearly six-months of data available, we’re finding certain persistent patterns on how cable networks make editorial choices in displaying the faces of President Donald Trump and top congressional leaders on TV screens.

First, Trump trumps the congressional leaders for on-screen face time, by many degrees of magnitude.


Second, of the four congressional leaders, Sen. Majority Leader Mitch McConnell, R., Ky., gets the most face-time on TV, and MSNBC features his visage more than the other networks examined. Fox News features the face of House Minority Leader Nancy Pelosi, D., Calif. more than any other cable network.

Face-o-Matic, an experimental service, developed in collaboration with the start-up Matroid, tracks the faces of selected high level elected officials on major TV cable news channels: CNN, Fox News, MSNBC, and the BBC.  Face-o-Matic data is available to the media, researchers, and the public, updated daily here. Stay tuned: we are in the process of testing new faces to add to Face-o-Matic: living past presidents and recent major political party nominees.

PolitiFact announces “2017 Lie of the Year”

Our fact-checking partner PolitiFact has announced its annual Lie of the Year: President Donald Trump’s statement in May 2017 to NBC’s Lester Holt, “This Russia thing…is a made-up story.” Below is the interview on the Internet Archive’s TV News Archive.

Writes PolitiFact editor, Angie Drobnic Holan:

In both classified and public reports, U.S. intelligence agencies have said Russian President Vladimir Putin ordered actions to interfere with the election. Those actions included the cyber-theft of private data, the placement of propaganda against particular candidates, and an overall effort to undermine public faith in the U.S. democratic process.

Trump’s statement about Russia was both the PolitiFact’s editors pick for Lie of the Year, and the top one chosen by PolitiFact’s readers in an online poll by “an overwhelming margin.” Readers also called out other fact-checks to highlight for the year, choosing from a list of ten from the editors, or writing in their own nominations.

The top vote-getter after Trump’s statement on Russia was Idaho Republican Rep. Raul Labrador’s claim that “Nobody dies because they don’t have access to health care.” “While the exact number of deaths saved by having health insurance is uncertain, the researchers we contacted agreed that the number is higher than zero–probably quite a bit higher,” writes PolitiFact reporter Louis Jacobson, who rated the claim “Pants on Fire.”

In third place was Sean Spicer’s claim, “that was the largest audience to witness an inauguration, period,” referring to the presidential inauguration in January 2017. PolitiFact at the time rated that claim “Pants on Fire”: “Spicer suggested 720,000 attended Trump’s inauguration, while organizers said they expected 700,000 to 900,000, and Trump himself estimated 1.5 million. All of those figures are less than the 1.8 million people who attended Obama’s 2009 inaugural.”

To see more video of fact-checks chosen by PolitiFact’s readers and editors as lies or misstatements of the year, see this list, which includes links both to TV news clips of public officals making statements on the air, along with links to fact-checks of those statements.

Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.



Posted in Announcements, News, Television Archive | Tagged , , , , , , , , , , , , | 5 Comments

Dreaming Open, Dreaming Big

Disheartened by anti-NetNeutrality moves in the US, but inspired by the reddit poster offering to donate $87million in bitcoin (5057BTC) to good causes (including the awesome EFF)?

It got us dreaming open and dreaming big: What could we do if the open world had lots of money, and specifically what could the Internet Archive do with it?

$100m (5930 BTC)
Bring 4 million books to billions of people and improve the quality of information on the internet for everyone.


$7m (415 BTC)

Rebuild the building next to ours to house interns, fellows, and researchers to create the next generation of open.


$12m (711 BTC)

Bring a million songs from the 78rpm era to the internet by fueling the Great 78 Project.



$5m (296 BTC)
Digitize and preserve the sound archives of the Boston Public Library, 250,000 discs (LP’s, 78’s, and CD’s), to make a public library of music for all.



$52.5M (3113 BTC)
Preserve and keep available our existing 35 petabytes of data forever. Based on a study by David Rosenthal of LOCKSS project at Stanford University, the forever cost of storing a terabyte is $1500 USD. Just long term preservation of the 200TB of US government information from the end of the last administration is $300k (17 BTC).

$15m (889 BTC)
Pay all Internet Archive employees with bitcoin for 2018.



physical archive$10m (593 BTC)
Buy the next building to store the millions of books records and films being donated to the Internet Archive. We are now filling up our current two buildings.



$4m (237 BTC)

Launch the Decentralized Web as a project to build a more private, reliable, and flexible web.


In fact, we are working towards all of these projects. But they can go much faster with donations and interest on the part of you, our Internet Archive supporters. Contributions of all sizes make a huge difference. The average donation this year is about $20. Together we can build a robust open world.

Posted in Announcements, News | 2 Comments