MIT Press Classics Available Soon at Archive.org

For more than eighty years, MIT Press has been publishing acclaimed titles in science, technology, art and architecture.  Now, thanks to a new partnership between the Internet Archive and MIT Press, readers will be able to borrow these classics online for the first time. With generous support from Arcadia, this partnership represents an important advance in providing free, long-term public access to knowledge.

“These books represent some of the finest scholarship ever produced, but right now they are very hard to find,” said Brewster Kahle, founder and Digital Librarian of the Internet Archive. “Together with MIT Press, we will enable the patrons of every library that owns one of these books to borrow it online–one copy at a time.”

This joint initiative is a crucial early step in Internet Archive’s ambitious plans to digitize, preserve and provide public access to four million books, by partnering widely with university presses and other publishers, authors, and libraries.  The Internet Archive is one of eight groups named semi-finalists in 100&Change, a global competition for a single $100 million grant from the John D. and Catherine T. MacArthur Foundation. The competition seeks bold solutions to critical problems of our time.

MIT Press’ Kelly McDougall (l) and Editor, Amy Brand, holding one of the publisher’s classic books.

MIT Press Director, Amy Brand said, “One of my top ambitions for the MIT Press has been to ensure that our entire legacy of publications is digitized, accessible, searchable, discoverable now and in perpetuity. Partnering with Internet Archive to achieve this objective is a dream come true not only for me and my colleagues at the Press, but also for many of our authors whose earlier works are completely unavailable or not easily accessible.”  

Lending online permits libraries to fulfill their mission in the digital age, allowing anyone  to borrow through the ether copies of works they own,” said Professor Peter Baldwin, co-founder of Arcadia.  “The IA-MIT collaboration is a big step in the direction of realizing a universal library, accessible to anyone, anywhere.”

One of the hundreds of titles coming soon to archive.org

We will be scanning an initial group of 1,500 MIT Press titles at Internet Archive’s Boston Public Library facility, including Cyril Stanley Smith’s 1980 book, From Art to Science: Seventy-Two Objects Illustrating the Nature of Discovery, and Frederick Law Olmsted and Theodora Kimball’s Forty Years of Landscape Architecture: Central Park, which was published in 1973. The oldest title in the group is Arthur C. Hardy’s 1936 Handbook of Colorimetry.

John Palfrey, Head of School at Phillips Academy Andover and well-known public access advocate, described the partnership as “a truly ground-breaking development in open scholarship that I hope will inspire other university presses to follow suit, since so many excellent and important books are effectively out of circulation by virtue of being analog-only in a digital world.”

The Internet Archive has already begun digitizing MIT Press’ backlist and they will be available at archive.org soon. The entire MIT Press backlist should be available by the end of 2017.

Posted in News | Leave a comment

TV news fact-checked: Gianforte, Gingrich, Pelosi & more

By Nancy Watzman and Katie Dahl

In this week’s round-up from the TV News Archive,  our fact-checking partners declare that Greg Gianforte, now Montana’s U.S. House representative-elect, was the aggressor in a conflict with a reporter; Newt Gingrich spread a conspiracy theory; House Minority Leader Nancy Pelosi stretched claims about how veterans could be hurt under the House GOP health care bill; and White House budget director Mick Mulvaney double-counted money.

Claim: Guardian reporter’s aggression, not Gianforte’s, caused altercation (flip that)

On May 24 a campaign spokesperson for Greg Gianforte, who has since won the Montana U.S. House race, said, “Tonight, as Greg was giving a separate interview in a private office, The Guardian‘s Ben Jacobs entered the office without permission, aggressively shoved a recorder in Greg’s face, and began asking badgering questions. Jacobs was asked to leave. After asking Jacobs to lower the recorder, Jacobs declined. Greg then attempted to grab the phone that was pushed in his face. Jacobs grabbed Greg’s wrist, and spun away from Greg, pushing them both to the ground. It’s unfortunate that this aggressive behavior from a liberal journalist created this scene at our campaign volunteer BBQ.”

As reported by John Kruzel and Smitha Rajan for PolitiFact, a Fox News reporter was in the room at the time and gave this account. “…Gianforte grabbed Jacobs by the neck with both hands and slammed him into the ground behind him. Faith, Keith and I watched in disbelief as Gianforte then began punching the reporter.” Gianforte has since apologized.

Claim: DNC staffer assassinated after giving emails to WikiLeaks (unsupported)

Newt Gingrich, a former Republican House Speaker, said in a TV interview, “we have this very strange story now of this young man who worked for the Democratic National Committee, who apparently was assassinated at 4 in the morning, having given WikiLeaks something like 23,000. I’m sorry, 53,000 emails and 17,000 attachments.”

“Gingrich Spreads Conspiracy Theory,” read a headline from FactCheck.org. Eugene Kiely reported “there’s no evidence for his claim.” PunditFact, a project of PolitiFact, gave Gingrich its worst fact-check rating, Pants on Fire.  Lauren Carroll reported, “Hours after Fox published its report, (Rod) Wheeler recanted. He told CNN that he hadn’t seen the evidence himself, and his knowledge of Rich’s alleged email contact with WikiLeaks came from the national Fox News reporter, not his own investigative work.”

(Note: Kiely also made use of the Wayback Machine in his piece, linking to a now-deleted Fox News story now saved at the Internet Archive. Washington Post reporters Kristine Phillips and Peter Holley published similar links in their story on how Fox News retracted its story on Seth Rich.)

Claim: seven million veterans will lose tax credit for their families in health care bill (three Pinocchios)

During a speech at a conference hosted by the Center for American Progress, Rep. Nancy Pelosi, D., Calif., said of the House-passed GOP health care reform bill, “Seven million veterans will lose their tax credit for their families in this bill.”

Michelle Ye Hee Lee reported for The Washington Post’s Fact Checker that “veterans ‘could’ — not ‘will,’ as Pelosi says — lose tax credits if the current protections don’t carry over under a new health law… Would it affect 7 million veterans and their families? Not necessarily.”

Claim: economic growth will pay for both eliminating the deficit and tax cuts (wait a minute)

In a press conference about President Trump’s proposed 2018 fiscal budget, White House budget director Mick Mulvaney said “we get to an actual balance on this budget within the 10-year window,” because “we will bring back 3% economic growth to this country and those numbers are assumed in this budget. By the way if we don’t the budget will never balance. You will never see a balanced budget again. We refuse to accept that the new normal in this country. Three percent was the old normal. Three percent will be the new normal again under the Trump administration and that is part and parcel with the foundation of this budget.” Treasury Secretary Steve Mnuchin also claimed economic growth would pay for the proposed revenue-neutral tax plan, “This will pay for itself with growth and with reduced — reduction of different deductions and closing loopholes.”

“Wait a minute, say tax and budget experts, that’s double-counting the same money,” reported Robert Farley of FactCheck.org. Roberton Williams of the Tax Policy Center told FactCheck.org that you can’t assume growth will balance the budget and offset tax cuts, “Both of those are not plausible,” he said

Claim: Manafort and others visited Moscow during the campaign (mostly false)

In a TV interview, Rep. Maxine Waters, D., Calif., said “I really do believe that much of what you saw coming out of Trump’s mouth was a play from Putin’s playbook… I think you can see visits, you know, to Moscow made during the campaign by (Paul) Manafort and others.”

“From what’s on the public record, Manafort didn’t go at all, and (Carter) Page went once… Waters made it sound like this was a regular occurrence. We rate this claim Mostly False.” Jon Greenberg reported for PolitiFact.

Claim: Wisconsin high-risk pool had 8 or 9 plans, people could go to any doctor, and premiums and copays were cheaper than Obamacare (half true)

In response to criticism from Democrats for the House-passed health care proposal, Rep. Paul Ryan, R., Wis., said “In Wisconsin, we had a really successful high-risk pool. Ten percent of the people in the individual market in Wisconsin were in the state high-risk pool. They had eight or nine plans to choose from. They could go to any doctor or any hospital they wanted. And their premiums and copays were cheaper than they are under Obamacare today.”

For PolitiFact, Tom Kertscher reported “He’s essentially on target on the first two parts, but not on the third… it can’t be flatly stated that the high-risk pool plans were cheaper than Obamacare plans for comparable coverage.”

To receive the TV News Archive’s email newsletter, subscribe here.

Posted in Television Archive | Tagged , , , , , , , , , , , , , , , | Leave a comment

Internet (Film) Archive – A Screening: Monday June 5 at 7 pm

Join us for an evening of fun, nostalgia and learning with a screening of the rarest, corniest and weirdest films from the Internet Archive’s collection of Educational Media. This curated screening of digitized and 16mm films will also include favorites as voted by IA users and staff.

RSVP at eventbrite.com

Browse the collection at archive.org/details/educationalfilms.

Nominate your favorite films at https://www.surveymonkey.com/r/WZFS2MD

Posted in Movie Archive, News | Leave a comment

Re: User account breach

The FBI helpfully told us that they found a copy of the Archive’s user database, dated prior to 2012, during one of their investigations. This database did not have much information that is not on the website, but it had lightly encrypted passwords of the users at the time. We have since upped the encryption level.

We have not noticed any uptick in compromised account activity at the Archive, so we’d bet against past malicious use. We will be emailing all Archive patrons who held accounts prior to 2012, containing much of the same information you see here.

We are sorry for this inconvenience.

Posted in News | 2 Comments

TV news fact-checked: Comey, Schumer, McMaster, Mueller

It was a yet another extraordinary week in U.S. politics, with a series of explosive news reports centering on President Donald Trump. The TV News Archive is saving history as it happens, as well as linking relevant fact-checks by FactCheck.org, PolitiFact, and The Washington Post‘s Fact Checker to statements by public officials.

On Sunday shows, Schumer demands release of tapes–if they exist

Senate Majority Leader Charles “Chuck” Schumer, D., N.Y., made the rounds of Sunday news talk shows, appearing on “Meet the Press” and “State of the Union,” calling for a special prosecutor to investigate possible connections between the Trump campaign and Russia among other matters. In this clip, Schumer says Trump should turn over tapes–the possibility of which were raised by Trump in a tweet on May 12–if they exist, of the president’s conversations with now former FBI director James Comey.

In this piece titled “Trump vs. Comey,” FactCheck.org reporters Eugene Kiely and Robert Farley trace the history of statements by the president and Comey about their discussions. They note, “White House Press Secretary Sean Spicer has repeatedly refused to answer whether Trump has such recordings. In his interview with Jeanine Pirro, Trump said, “Well, that I can’t talk about. I won’t talk about that.”

McMaster reacts to report that Trump shared intelligence with Russians

After The Washington Post reported, on May 15, that Trump had revealed “highly classified information” to Russian envoys visiting the White House last week, national security adviser H.R. McMaster defended the president that day and at a press conference the following day. Among his assertions: “The story that came out tonight as reported is false.”

“The key phrase is “as reported,” wrote Glenn Kessler, for The Washington Post’s Fact Checker, in a piece that dissects McMaster’s statements before the press. “With this language, McMaster in theory could dispute any element, no matter how small, as false. He notably did not say the story was false.” John Kruzel, writing for PolitiFact, traced the “shifting” explanations from the White House on what happened at the meeting with the Russians, including McMaster’s statements.

Former FBI director Robert Mueller appointed special counsel

Wednesday, May 17 brought the news that the U.S. Department of Justice appointed Robert Mueller as a special counsel to investigate possible connections between Trump’s 2016 campaign and Russia. Here PolitiFact reporter Lauren Carroll gives the basics on Mueller’s background and experience.

The TV News Archive contains numerous historical clips of Mueller, who served as FBI director under  Presidents George W. Bush and Barack Obama, including this brief farewell interview he gave to ABC in 2013, where he talks about terrorism.

Mueller and Comey have an earlier association at a high-drama moment in U.S. history. In 2014, Comey told “60 Minutes” about the day that he and Mueller visited a bedridden John Ashcroft, then attorney general, to tell him they would resign rather than reauthorize a controversial domestic surveillance program under pressure from the White House. Ashcroft deferred to Comey, and, as recounted by The Los Angeles Times, “It was only when President George W. Bush agreed to listen to Comey and Mueller and restructure the program did resignation plans go away.”

To receive the TV News Archive’s email newsletter, subscribe here.

Posted in News, Television Archive | Tagged , , , , , , , , , , , , , , | Leave a comment

“And the Webby Award for Lifetime Achievement Goes to….”

“The Internet Archive…is building a home for Universal Access to All Knowledge, open to everyone, everywhere, to use as they like. Open to all societies of the future that care to build on our triumphs and learn from our mistakes.”

                                                                  – Lawrence Lessig

Last night in New York City, we put on our best duds and donned our fanciest archivist hats for a once in a lifetime event. The Internet Archive was honored with a Lifetime Achievement Award at the 21st annual Webbys, hailed by the New York Times as “one of the Internet’s highest honors.” The Webby Awards lauded the Internet Archive for being “the web’s most knowledgeable historian.”

Three of our veteran staff members, Tracey Jaquith, TV Archive Architect, Internet Archive founder and Digital Librarian, Brewster Kahle, and Alexis Rossi, Director of Media and Access, accepted the award. Kahle delivered the five-word acceptance speech with panache:  “Universal Access to All Knowledge.”

Perhaps the greatest honor of the evening came in the form of a video narrated by Open Knowledge champion, Lawrence Lessig.  He said, “Creativity and innovation built on the past.  The Internet Archive is the foundation preserving that past, so that perhaps, one can at least hope that our children and their children can shape a future that knows our joys and learns from our many mistakes.”

The award was presented by Nancy Lublin, CEO of the Crisis Text Line and DoSomething.org, who pointed out that in this chaotic political year, the Internet Archive has saved “200 terabytes of government data that could have otherwise been lost in the transition from blue light saber to red light saber.”

The award reads:

Webby Lifetime Achievement: Archive.org for its commitment to making the world’s knowledge available online and preserving the history of the Internet itself. With a vast collection of digitized materials and tools like the Wayback Machine, Archive.org has become a vital resource not only to catalogue an ever-changing medium, but to safeguard a free and open Internet for everyone.

The complete list of Webby Award winners is available here.

Posted in Announcements, News | 13 Comments

TV news fact-checked: Comey edition

We devote this week’s edition of the TV News Archive roundup to the controversy that’s erupted surrounding President Donald Trump’s sudden announcement on Tuesday, May 9, that he was firing FBI director James Comey. The TV News Archive provides a wealth of material for exploring media coverage of this major moment in U.S. history.

Comey fame tied to Clinton and Trump

Comey may still not quite be a household name, but mention of “Comey” spiked higher than ever on TV newscasts this week after he was fired. Comey has enjoyed notoriety in the past, his biggest moments tied closely to the fates of 2016 presidential rivals Hillary Clinton and Trump.

The most recent spike before this week was on March 20, when he testified before Congress, confirming that the FBI was investigating possible ties between the Trump campaign and Russia. Another major spike occurred in November 2016, days before the election, when Comey announced the FBI was reopening an investigation into then-Democratic candidate Hillary Clinton’s use of a private email server for official business while serving as secretary of state. Comey also garnered attention in July 2016, when he announced that the FBI would not be pursuing charges against Clinton.

The visual below, showing mentions of “Comey,” was created with Television Explorer, an online tool fueled by TV News Archive data and created by Kalev Leetaru. This tool can be used to find patterns in words and phrases captured by closed captioning and contained in the TV News Archive.

Source: Television Explorer, Kalev Leetaru

Trump’s letter to Comey fact-checked

In the hours following the firing, one major point of focus for fact-checkers and other media was the portion of the letter to Comey where Trump stated, “While I greatly appreciate you informing me, on three separate occasions, that I am not under investigation, I nevertheless concur with the judgment of the Department of Justice that you are not able to effectively lead the Bureau.”

Below is a CNN broadcast, as captured in the TV News Archive, where the CNN newscaster Dave Briggs reads the letter on the air.

PolitiFact, The Washington Post‘s Fact Checker, and FactCheck.org have all weighed in on the president’s assertion, noting that too much remains unknown to confirm it. “With Comey out, it’s unclear whether the public will ever learn if the FBI was investigating Trump personally, rather than just his associates — or anything else about the investigation, for that matter,” wrote PolitiFact’s Lauren Carroll on May 11. (See fact-checks connected to televised statements by public officials here.)

Meanwhile, the story continues to unfold. On May 11, Sarah Huckabee, deputy White House press secretary, told ABC’s George Stephanopoulos that the president had confirmed this assertion with her directly. And Trump himself told NBC News’ Lester Holt that the assurances came during a private dinner and twice over the phone. And on Friday morning, Trump tweeted that Comey “better hope there are no ‘tapes’ of our conversations before he starts leaking to the press!”

Some Watergate history, please

Many commentators this week have noted parallels between Trump’s firing of Comey and the Saturday Night Massacre of 1973, when President Richard Nixon ordered independent special prosecutor Archibald Cox fired, during the Watergate investigation; his boss, Attorney General Elliot Richardson, and Deputy General William Ruckelshaus, both of whom refused to fire Cox, resigned in protest. Acting head of the Department of Justice, Robert Bork, carried out the order to fire Cox. (Note: the Richard Nixon Library playfully, but accurately, fired off a tweet noting that Nixon had never fired an FBI director, and then later was criticized for doing so by the National Archives and Records Administration, the federal agency that administers presidential libraries.)

While the TV News Archive’s collection of 1.3 million TV news shows dates back to 2009, long after the Nixon era, some footage from that time is available from later airings. Here, for example, is footage of Cox’s press conference right before he was dismissed.

And here is a quick explainer of the Saturday Night Massacre, as broadcast by MSNBC in 2013.

Searching Trump Archive for past Trump statements about Comey

The largely hand-curated Trump Archive, a collection of Trump statements and appearances on TV news broadcasts, makes it easier to find past instances of Trump talking about Comey. The TV News Archive is working on ways to make the creation of such collections less labor intensive, by using machine learning tools to identify instances of public officials speaking within the collection of 1.3 million tv news shows.

A search of closed captions on the terms “Trump” and “Comey” would yield both instances when Trump is speaking about Comey and newscasters who are reporting on the two men. But searching within the Trump Archive quickly yields Trump statements about Comey.

Here is some of what we found:

April 28, 2016: Trump says “I think if [Comey’s]  straight up she’s not going to be able to run.”

 

June 13, 2016: Trump talking about FBI investigation of Orlando nightclub shooting, “I’m a big fan of the FBI, there’s no bigger fan than me, but look they’ve seen better days. Let’s face it.”

October 13, 2016:  Trump speaking about Comey, “The great men and women who work for the FBI are embarrassed and ashamed of what he has done to one of our truly great institutions, the FBI itself.”

October 20, 2016: Trump at Al Smith Dinner, joking at an annual fundraiser for Catholic charities:  “I’d like to address an important religious matter, the issue of going to confession. Or, as Hillary calls it, the Fourth of July weekend with FBI director Comey.”

October 29, 2016: Following Director Comey’s letter to congressional leaders about newly discovered Clinton emails, Trump says, “I have to tell you, I respect the fact that Director Comey was able to come back after what he did. I respect that very much.”

November 14, 2016: Trump won’t say if he will ask Comey to resign.  “I think that I would rather not comment on that yet. I don’t– I haven’t made up my mind. I respect him a lot. I respect the FBI a lot.”

To receive the TV News Archive’s email newsletter, subscribe here.

Posted in Television Archive | Tagged , , , , , , | 1 Comment

Listening to the 78rpm Disc Collection


By Jessica Thompson, Coast Mastering

The Great 78 Project
A few times a year, I join B. George in the Internet Archives’ warehouses to help sort and pack 78rpm discs to ship to George Blood L.P. for digitization. As a music fan and a professional mastering and restoration engineer, I get a thrill from handling the heavy, grooved discs, admiring the fonts and graphic designs on the labels, and chuckling at amusing song titles. Now digitized, these recordings offer a wealth of musicological, discographic and technical information, documenting and contextualizing music and recording history in the first half of the 20th century.

The sheer scale of this digitization project is unprecedented. At over 15,000 recordings and counting, the value strictly in terms of preservation is clear, especially given the Internet Archive’s focus on digitizing music less commonly available to researchers. Music fans can take a deep dive into early blues, Hawaiian, hillbilly, comedy and bluegrass. I even found several early Novachord synthesizer recordings from 1941.

As a researcher and audio restoration engineer, the real goldmine is in the aggregation of discographic and technical metadata accompanying these recordings. Historians can search for and cross reference recordings based on label, artist, song title, year of release, personnel, genre, and, importantly, collection. (The Internet Archive documents the provenance of the 78rpm discs so that donated collections remain digitally intact and maintain their contextual significance.) General users can submit reviews with notes to amend or add to metadata, and the content of those reviews is searchable, so metadata collection is active. No doubt it will continue to improve as dedicated and educated users fill in the blanks.

Access to the technical metadata offers a valuable teaching tool to those of us who practice audio preservation. For audio professionals new to 78s and curious about how much difference a few tenths of a millimeter of stylus can make, the Internet Archive offers 15,000+ examples of this. Play through the different styli options, and it quickly becomes apparent that particular labels, years and even discs do respond better to specific styli sizes and shapes. This is something audio preservationists are taught, but rarely are we presented with comprehensive audio examples. To be able to listen to and analyze the sonic and technical differences in these versions marries the hard science with the aesthetic.

Playback speeds were not standardized until the late 1920s or early 1930s, and most discs were originally cut at speeds ranging from 76-80rpm (and some well beyond). The discs in the George Blood Collection were all digitized at a playback speed of 78rpm. Preservationists and collectors debate extensively about the “correct” speed at which discs ought to be played back, and whether one ought to pitch discs individually. However, performance, recording and manufacturing practices varied so widely that even if a base speed could generally be agreed upon, there will always be exceptions. (For more on this, please check out George Blood’s forthcoming paper Stylus Size And Speed Selection In Pre-1923 Acoustic Recordings in Sustainable audiovisual collections through collaboration: Proceedings of the 2016 Joint Technical Symposium. Bloomington, IN: Indiana University Press.)

Every step of making a recording involves so many aesthetic decisions – choices of instrumentation, methods of sound amplification, microphone placement, the materials used in the disc itself, deliberate pitching of the instruments and slowing or speeding of the recording – that playback speed simply become one of many aesthetic choices in the chain. As preservationists, we are preserving the disc as an historic record, not attempting to restore or recreate a performance. (Furthermore, speed correction is possible in the digital realm, should anyone want to modify these digital files for their own personal enjoyment).

How do they sound? Each 78rpm disc has an inherent noise fingerprint based on the frequency and dynamic range the format can replicate (limited, compared to contemporary digital playback formats) and the addition of surface noise from dust, dirt and stylus wear in the grooves. As expected, the sound quality in this collection varies. Some of these discs were professionally recorded, minimally played, stored well, and play back with a tolerable, even ignorable level of surface noise relative to the musical content. Others were recorded under less professional circumstances, and/or were much loved, frequently played, stored without sleeves in basements and attics, and therefore suffer from significant surface noise that can interfere with enjoyment (and study) of the music.

Yet, a compelling recording can cut through noise. Take this 1944 recording of Josh White performing St. James Infirmary, Asch 358-2A. This side has been released commercially several times, so if you look it up on a streaming service like Spotify, you can listen to different versions sourced from the same recording (though almost certainly not from the same 78rpm disc). They play at different speeds, some barely perceptibly faster or slower but at least one nearly a half-step faster than the preservation copy digitized by George Blood L.P. They also have a range of noise reduction and remastering aesthetics, some subtle and some downright ugly and riddled with digital artifacts. The version on the Internet Archive offers a benchmark. This is what the recording sounded like on the original 78rpm disc. Listen to the bend in the opening guitar notes. That technique cuts through the surface noise and should be preserved and highlighted in any restored version (which is another way of saying that any noise reduction should absolutely not interfere with the attack and decay of those luscious guitar notes).

McGill University professor of Culture and Technology Jonathan Sterne wrote a book – The Audible Past: Cultural Origins of Sound Reproductionthat is worth reading for anyone interested in a cultural history of early recording formats, including 78s. As Sterne says, sound fidelity is “ultimately about deciding the values of competing and contending sounds.” So, in listening to digital versions of 78s on the Internet Archive, music fans, researchers, and audio professionals alike engage in a process of renegotiating concepts of acceptable thresholds of noise and what that noise communicates about the circumstances of the recording and its life on a physical disc.

Fortunately, our brains are very good at calibrating to accept different ratios of signal to noise, and, I found, the more I listened to 78rpm recordings on the Internet Archive, the less I was bothered by the inherent noise. Those of us who grew up on CDs or digitally recorded and distributed music are not used to the intrusions of surface noise. However, when listening to historic recordings, we are able to adjust our expectations and process a level of noise that would be ridiculous in contemporary music formats. (Imagine this week’s Billboard Top 100 chart topper, Bruno Mars’s “That’s What I Like,” with the high and low end rolled off, covered in a sheen of crackles and pops). The fact that these 78rpm recordings sound, to us, like they were made in the 1920s, 1930s, 1940s lets them get away with a different scale of fidelity. The very nature of their historicity gets them off the hook.

In analog form, crackles and pops can be mesmerizing, almost like the sound of a crackling fire. However, once digitized, those previously random pops become fixed in time. What may have been enjoyable in analog form becomes a permanent annoyance in digital form. The threshold of acceptable noise levels moves again.

This means that noise associated with recording carriers such as 78rpm discs is almost always preferably to noises introduced in the digital realm through the process of attempted noise reduction. Sound restorationists understand that their job is to follow a sonic Hippocratic oath: do no harm. Though noise reduction tools are widely available, they range in quality (and accordingly in cost), and are merely tools to be used with a light or heavy touch, by experienced or amateur restorationists.

The question of whether noise reduction of the Internet Archive’s 78rpm recordings could be partially automated makes my heart palpitate. Though I know from experience that, for example, auto-declickers exist that could theoretically remove a layer of noise from these recordings with minimal interference with the musical signal, I don’t believe the results would be uniformly satisfactory. It is so easy to destroy the aura of a recording with overzealous, heavy-handed, cheap, or simply unnecessary noise reduction. Even a gentle touch of an auto-declicker or de-crackler will have widely varying results on different recordings.

I tried this with a sampling of selections from the /georgeblood/ collection. I chose eleven songs from different genres and years and ran two different, high quality auto-declickers (the iZotope RX6 Advanced multiband declicker and CEDAR Audio’s declick) on the 24bit FLAC files. The results were uneven. Some of the objectively noisier songs, such as Blind Blake’s Tampa Bound, Paramount 12442-B, benefited from having the most egregious surface noises gently scrubbed.

Tampa Bound Flat Transfer vs Tampa Bound Declicked, Dehissed and Denoised
that’s a lot of noise!

However, a song with a strong musical presence and mild surface noise such as Trio Schmeed’s Yodel Cha Cha, ABC-Paramount 9660, actually suffered more from light auto-declicking because the content of the horns and percussive elements registered to the auto-delicker as aberrations from the meat of the signal and were dulled. A pop presents as an aberration across all frequencies. Mapped visually across frequency, time and intensity, it looks like a spike cutting through the waveform. A snare hit looks similar and is therefore likely to be misinterpreted by an auto-declicker unless the threshold at which the declicker deploys is set very carefully. This difference is why good restorationists earn their pay.

Yodel Cha Cha flat transfer and denoised. Notice the “clicks and pops” have been scrubbed,
but so has wanted high end content in the music.

 I am approaching this collection as a listener and music fan, as a researcher, and as an audio professional, three very different modes of listening and interacting with music. In all cases, the Internet Archive 78rpm collection offers massive amounts of music and data to be explored, discovered, enjoyed, studied and utilized. Whether you want to listen to early Bill Monroe tunes, crackles, pops and all, or explore hundreds of recordings of pre-war polkas, or analyze the effects of stylus size on 1930s Victor discs, the Internet Archive provides the raw materials in digital form and, not to be underestimated, preserves the original discs too.

Posted in 78rpm, Audio Archive, Music | Comments Off on Listening to the 78rpm Disc Collection

Over 200 terabytes of the government web archived!

In our December post, “Preserving U.S. Government Websites and Data as the Obama Term Ends,” we described our participation in the End of Term Web Archive project to preserve federal government websites and data at times of administration changes. We wanted to give a quick update on the project — we have archived a heck of a lot of data!

Between Fall 2016 and Spring 2017, the Internet Archive archived over 200 terabytes of government websites and data. This includes over 100TB of public websites and over 100TB of public data from federal FTP file servers totaling, together, over 350 million URLs/files. This includes over 70 million html pages, over 40 million PDFs and, towards the other end of the spectrum and for semantic web aficionados, 8 files of the text/turtle mime type. Other End of Term partners have also been vigorously preserving websites and data from the .gov/.mil web domains.

Every web page we have archived is accessible through the Wayback Machine and we are working to add the 2016 harvest to the main End of Term portal soon. While we continue to analyze this collection, we posted some preliminary statistics using the new Wayback Machine’s summary interface for this specific collection, which can be found on the End of Term (EOT 2016) summary stats page; those and additional stats are served via a public EOT 2016 stats API and the full collection is also available.

Through the EOT project’s public nomination form and through our collaboration with the DataRefugeEnvironmental Data and Governance Initiative (EDGI), and other efforts, over 100,000 webpages or government datasets were nominated by citizens and preservationists for archiving. The EOT and community efforts have also garnered notable press (see our End of Term 2016 Press collection). We are working with partners to provide access to the full dataset for use in data mining and computational analysis and hosted a hackathon earlier this year to support use of the Obama White House Social Media datasets.

While the specific End of Term collection has closed, we continue our large-scale, dedicated efforts to preserve the government web. Working with the University of North Texas, we launched the Government Web & Data Archive nomination form so the public can continue to nominate public government websites and data to be archived.

Lastly, archiving government data remains a critical activity of the preservation community. You can help our role in these efforts by continuing to nominate websites, promoting the EOT project via press and outreach (contact the EOT project team for any inquiries), and by donating to the Internet Archive to support our ongoing mission to provide “Universal Access to All Knowledge.”

Posted in Announcements, News | Comments Off on Over 200 terabytes of the government web archived!

TV news fact-checked: Trump’s first 100 days

To mark his 100th day in office, President Donald Trump made several public appearances and released a campaign-style political ad. Here are some fact-checked claims from those televised interviews,  speeches, as well as the ad, all viewable and shareable on the TV News Archive.

Fact-checked TV news clips on the TV News Archive. Click, watch, and share: https://archive.org/details/factchecked

Fact-checking Trump’s first 100 days campaign ad

President Donald Trump put his stamp of approval on this ad, which includes several factual claims. Among them: “Fact: 500,000-plus jobs created,” and “America becoming more energy independent,” while the words “Keystone Pipeline” are shown on screen.

“Using just the two months on his watch–February and March–the U.S. economy created 317,000 jobs, not 500,000,” reported Louis Jacobson for PolitiFact.

“Samantha Gross, a fellow at the Brookings Institution’s Energy Security and Climate Initiative… notes that she doesn’t think approving the pipeline was a bad decision…  ‘We’ve made it easier and efficient to ship that crude down to the states,’ though [sic.] the pipeline, ‘but as far as changing energy independence, I don’t think so,’ she said,” Lori Robertson reported for FactCheck.org.

Claim: first time in modern era a Supreme Court justice confirmed in first 100 days (mostly true)

At the National Rifle Association’s annual meeting in Georgia on April 28, Trump said, “For the first time in the modern political era, we have confirmed a new justice in the first 100 days. The last time that happened was 136 years ago, in 1881. Now, we won’t get any credit for this, but don’t worry about it.”

This statement was rated “mostly true” by PolitiFact’s Lauren Carroll. “Trump has his history right. He is indeed the first modern president to fill an open Supreme Court seat within the first 100 days.” She went on to report “The reality is that very few presidents are presented with the opportunity to appoint and confirm a new Supreme Court justice within their first 100 days… Trump is the only one of the group who entered office with a vacant Supreme Court seat from the start—meaning he had a full 100 days to nominate and confirm his pick.”

Claim: U.S. has a $17 billion trade deficit with Canada (no, it’s a surplus)

President Trump sat down with FOX’s Martha MacCallum for their “The First 100 Days” interview. In it, Trump said “The trade deficit with Mexico is close to $70 billion, even with Canada it’s $17 billion trade deficit with Canada.”

The reporters at FactCheck.org, though, found that “For the second year in a row, the U.S. had a trade surplus with Canada. In 2016, the U.S. had an $8.1 billion trade surplus in goods and services with Canada, up nearly 33 percent from the $6.1 billion surplus in 2015.”

Claim: Trump has negative media coverage because reporters donated to Clinton (misleading)

The president spoke to a large crowd in Harrisburg, Pennsylvania, where he cited a study by the conservative Media Research Center talking about negative coverage of his administration. He then suggested the reason was “perhaps that’s because, according to the Center for Public Integrity, 96 percent of journalists who made donations in the last election gave them to our opponent.”

FactCheck.org reported that the Center for Public Integrity study showed “that about 430 people ‘who work in journalism’ contributed about $382,000 to the Clinton campaign through August, compared with about 50 journalists who contributed $14,000 to Trump. Notably, however, the study did not find that any journalists responsible for covering the White House, Congress or national politics made political contributions of any kind.”

Claim: people with pre-existing conditions are covered in the House GOP health care bill (unclear)

In an interview for CBS’s Face the Nation, President Trump said, “Pre-existing conditions are in the bill. And I just watched another network than yours, and they were saying, ‘Pre-existing is not covered.’ Pre-existing conditions are in the bill. And I mandate it. I said, ‘Has to be.'”

PolitiFact’s Amy Sherman reviewed and rated this claim “mostly false,” reporting “Overall, the latest proposal seems to weaken existing protections for people with pre-existing conditions, not strengthen them.”

An amendment was proposed since then, and Glenn Kessler summarized the new proposal for the Washington Post’s Fact Checker, writing, “if the bill ever became law, much would depend on unknown policy decisions by individual states–and then how those decisions are implemented.” This proposal had a House vote yesterday, Thursday, and passed.

How to find Trump’s 100th-day TV news appearances

To review the 100 days interviews and speeches yourself and grab the clips you want, check out this Fox interview in two parts, the Harrisburg, PA speech, the NRA address, the Face the Nation interview in two parts, the “CBS This Morning” interview in two parts, and Trump’s weekly address.

Trump’s Harrisburg address, as seen in context in the TV News Archive. Note Trump icon on top left, showing this show is part of the Trump Archive.

To peruse all the fact-checking work our partners have done on statements made on TV and archived in the TV News Archive, take a look at this table of more than 800 fact-checks of Donald Trump, his administration, and some congressional leaders. This collection will continue to grow as we develop the congressional collections, add more administration official statements, and integrate new statements and reporting.

To receive the TV News Archive’s email newsletter, subscribe here.

Posted in Television Archive | Tagged , , , , , , | Comments Off on TV news fact-checked: Trump’s first 100 days