Tag Archives: The Washington Post

Vanishing Culture: Q&A with Philip Bump, The Washington Post

The following Q&A between writer Caralee Adams and journalist Philip Bump of The Washington Post is part of our Vanishing Culture series, highlighting the power and importance of preservation in our digital age. Read more essays online or download the full report now.

Philip Bump is a columnist for The Washington Post based in New York. He writes the weekly newsletter How To Read This Chart. He’s also the author of The Aftermath: The Last Days of the Baby Boom and the Future of Power in America.

Caralee Adams: What does it mean for an individual journalist to have their work preserved? Why is it important to have easy access to news stories from the past?

Philip Bump: One of the nice things about my career has been that I’ve worked for outlets that I feel confident are doing their own preservation, like The Washington Post. I’m not particularly worried about losing access to my writing. However, it’s less of a concern for me than it is for other outlets, unfortunately. It is unquestionably the case that I find the Internet Archive useful and use it regularly for a variety of things—both for its preservation of online content and collection of closed captioning for news programs.

Any recent examples of when you’ve found the Internet Archive particularly useful?

I use the search tool on closed captioning more than anything else. The other day I was trying to find an old copy of a webpage. I was writing about Donald Trump’s comments on Medal of Honor recipients. As it turns out, there is not an immediately accessible resource for when Medals of Honor were granted to members of the military. You can see aggregated—how many there are—but you can’t see who was given a medal and when they served. I actually used the Internet Archive to see how the metrics changed between the beginning of Trump’s presidency and by the end of it. I was able to see that there were medals awarded to about 11 people who served during the War on Terror, three who served in Vietnam, and one during World War II. Then, I was able to go back and double check against the Trump White House archive, which is done by the National Archives, and see the people to whom he had given this award. That’s a good example of being able to take those two snapshots in time and then compare them in order to see what the difference was to get this problem solved.

Why is it important for the public to have free public access to an archive of the news for television or print?

It’s the same reason that it’s important, in general, to have any sort of archive: it increases accountability and increases historical accuracy. The Internet Archive is essential at ensuring that we have an understanding of what was happening on the internet at a given point in time. That is not something that is constantly useful, but it is something that is occasionally extremely useful. I do a lot of work in politics and get to see what people are saying at certain points in time, which are important checks and accountability for elected officials.  The public can know what they were saying when they were running in the primary as compared with the general [election]. The Archive allows anyone to be able to get information from websites that are no longer active. If you’re looking for something and you have the old link to Gawker or the old link to a tweet, you can often [find] it archived.  The Internet Archive doesn’t capture everything—it couldn’t possibly do so. But it captures enough to generally answer the questions that need to get answered. There’s nowhere else that does that. There are other archiving sites, but none that do so as comprehensively, or none with an archive that goes back that far.

Download the full Vanishing Culture report.

Has any of your journalism vanished from the public? Do you have any examples where you’ve been looking for something and it’s been missing?

Yes. One of the challenges is that multimedia content has often, in the past, been overlooked. There are old news reports that I’ve been unable to find because they’re on video in the era before there was a lot of accessibility and transcripts. Therefore, yes, there are certainly things like that which come up with some regularity. Also, particularly in the era of 2005 to 2015, there were a lot of independent sites that had useful news reports—particularly since we’re talking about the cast of political characters that have been around in the public eye at that point in time. It’s often the case that it’s hard to track those things down. Or if you’re trying to track down the original source or verify a rumor, you might need to dip into the Archive. There are a lot of sites from that era of “bespoke” blogs that the Internet Archive often captures. 

How does limited access to historical data or previous coverage impact you as a journalist?

It is hard to say, because relatively speaking, I am advantaged by the fact that I live in this era.  If I were doing this in 1990, [I’d use] basically whatever was at the New York Public Library and on microfiche. It is far better than it used to be, but the amount of content being produced is also far larger. It is both a positive and a negative that it is far easier to do that sort of research here from my desk at home than it would possibly have been 30 years ago. In fact, I was working on a project where I relied heavily on a local newspaper in a small town in Pennsylvania that wasn’t available online. I literally had to hire someone in the town to go to the library, find [coverage from] the particular date and the local paper and to get the scans done. It cost me hundreds of dollars, but that was the only way to do it. You can see how getting these things done is problematic and challenging.

When Paramount deleted the MTV News Archive in June, there was a lot of dismay, but some say it was frivolous, disposable, and kind of meant to be thrown away. How do you feel about that?

My first writing gig online was at MTV News in college, so that actually had a personal resonance for me. I was at Ohio State in the early to mid 1990s, and I got this little internship with MTV News. I wrote one piece about this band called The Hairy Patt Band. It ended up on the MTV News website. I was very excited. I haven’t seen that in 30 years. It’s one of those things where I wondered what ever happened to that story or if it exists anywhere, in any form. So, that [news] actually had resonance. It’s a bummer. Is it as important to maintain the archives of MTV News as it is The Washington Post? I’m biased, but I would say, no. But it is still a loss of culture—and it is a unique loss of culture. This was a unique and novel form of information that was emergent in the 1990s and now is lost. In the moment, its very existence captured the culture in a way that is worth preserving.

How do you feel about the future of digital preservation of news, data, and information?

I’m more pessimistic than I used to be. I came of age with the internet. When it was new, I used to describe it as the emergence from a new dark age. We had all this information and there was no more going back. All this existed. Everything was online, and we had archives. Now, we see, in part because the scale has increased so quickly that economic considerations come into play, and all of a sudden… the internet isn’t just an endless archive anymore. There are very few places that are doing what libraries do to capture these things on microfiche or store books for the public’s benefit. There is so much of it and that becomes the problem.

Why is it important to pay attention to this issue and preserve journalism for future reporters?

It is obviously the case that we are creating information, culture, and benchmarks for society faster than we can figure out how we’re going to make sure they’re preserved. I think that’s probably always been the case, except that what’s different now is that we are more cognizant of the process of preservation and the challenges of preservation. We expect there to be this thing that exists forever. We don’t yet know how to balance the interest in having as few things be ephemeral as possible, versus the value in doing that… maybe it’s not even possible to preserve everything in the way that we would want to at scale. We have created a process by which it is possible to record and observe nearly everything, and now we’re realizing that that is potentially in conflict with our desire to also store and preserve all this information indefinitely.

Anything you’d like to add?

I think it’s worth noting that preservation is one of the few areas in which I think artificial intelligence bears some potential benefit. One of the things that I’ve long found frustrating is that The New York Times, The Washington Post, and other major news outlets, have enormous storehouses of information—not all of it textual. The New York Times must have, in its archives, photos of every square inch of New York City at some point in time over the course of the past 100 years. Artificial intelligence is a great tool for indexing and documenting. We now have tools that allow us to go deeper into our archives and extract more information from them, which I think is a positive development, and is something I’ve advocated for a long time publicly. Only with the advent of artificial intelligence does large-scale preservation become something that seems feasible. One can go through the National Archive and extract an enormous amount of information that is currently stored there in an accessible form, which saves someone from having to stumble upon a particular image. I think that is beneficial. I don’t think that necessarily solves the storage at scale issue, but it does address the fact that so much information is currently locked away and inaccessible, which is another facet of the challenge.  

About the author

Caralee Adams is a journalist based in Bethesda, Maryland. She is a graduate
of Iowa State University and received her master’s in political science at the
University of New Orleans. After working at newspapers and magazines, she
has been a freelancer covering education, science, tech and health for a
variety of publications for more than 30 years.

TV News Record: Caption analyses, plus fact-checks on wall & immigrants

A round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman.

This week we bring you analyses of cable TV news coverage and fact-checks of recent statements by President Donald Trump on immigration and his proposed wall on the border with Mexico.

Vox & Post turn TV news captions into media analysis

Vox’s Alvin Chang and The Washington Post’s Philip Bump continue to turn TV News Archive caption data, via Television Explorer, into analyses of current news. Chang analyzes cable TV network coverage of the March for Our Lives, an anti-gun violence demonstration, reporting that on Fox News, “There was a massive spike in mentions of the “Second Amendment” or “Constitution” during the peak of the march, and most of those mentions came from pundits and guests on the network.”

Source: Vox

Bump’s piece examines mentions of Hillary Clinton on cable TV news networks compared to those of Stormy Daniels, the adult entertainer involved in a legal dispute with the president. He finds that Fox News mentions Clinton the most, while CNN features more coverage of Daniels.

Source: The Washington Post

Fact-Check: We’ve started building the wall (Mostly False/Three Pinocchios)

During a press conference with the presidents of Estonia, Latvia, and Lithuania, President Donald Trump talked about his proposed border wall between the United States and Mexico: “We have to have strong borders. We need the wall. We’ve started building the wall, as you know, we have a $1.6 billion toward building the wall and fixing existing wall that’s falling down, it was never appropriate in the first place.”

The funding the president references comes from a spending bill recently passed by Congress. The omnibus “bill included $1.6 billion for some projects at the border, but none of that can be used toward the border wall promised during the presidential campaign.” For PolitiFact, Miriam Valverde rates the president’s claim “Mostly False.”

At The Washington Post’s Fact-Checker, Glenn Kessler gives the same claim “three Pinocchios”:

The White House failed miserably to achieve its objectives on funding for a border wall, receiving relative peanuts. It sought $25 billion, but ended up with just 5 percent of that. Moreover, the money came with strings attached so that it could only be used for fencing, not the “great” and “beautiful wall” promised by Trump.

In Orwellian fashion, fences have now become walls. Even then, the president has only secured enough money to pay for one-tenth of the new fence/wall he has sought.


Fact-Check: Caravans of people are coming to cross the U.S.-Mexico border (Half True)

Just after Fox News aired a segment on a caravan of people from Central America making its way through Mexico toward the United States, the president wrote on Twitter:

“Half True,” writes W. Gardner Shelby for PolitiFact: “President Trump tweeted that caravans of immigrants are coming to the Mexico-U.S. border… We confirmed that a caravan of 1,200 to 1,500 people from Central America–not caravans–was in southern Mexico, about 900 miles from the Rio Grande, when Trump tweeted. Also, accounts vary on whether all participants are bound to enter the U.S. An organizer estimated that most of the people intend to remain in Mexico.”

Reporting for FactCheck.org, Robert FarleyEugene Kiely and Lori Robertson write “Trump’s messages included muddled and inaccurate claims.” They summarize with the following bullet points:

  • Contrary to Trump’s assertion, there is no “liberal (Democrat)” law requiring the “Catch & Release” of people caught illegally crossing the border. There are court cases and laws that require some unaccompanied children, families and asylum-seekers to be released in the U.S., pending an immigration hearing. But it’s a stretch to blame those entirely on Democrats.

  • Trump said “big flows of people” are illegally entering the U.S. from Mexico “to take advantage of DACA.” In fact, current border-crossers are not eligible for the Deferred Action for Childhood Arrivals program.

  • Trump said that “caravans” of people were coming to the Southwest border and that Mexico “must stop them.” The caravan, a yearly demonstration, was organized by the activist group Pueblo Sin Fronteras, which says the people walking in the caravan have “a lot of intentions,” with some wanting to stay in Mexico. The caravan is now in southern Mexico, more than 800 miles from the U.S. border.

    Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.

TV News Record: How cable TV news reports news, fact-checks on banking, trade, and public lands

A round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman.

This week, we present a Washington Post analysis of coverage of an alleged affair by the president; a Vox piece examining coverage of Andrew McCabe, the former deputy FBI director; and The Toronto Star’s use of a salient clip to illustrate a point about a presidential appointment. We also show fact-checks from FactCheck.org, PolitiFact, and The Washington Post’s Fact-Checker on claims related to banking, public lands, and trade policy.

Chicken-egg question on cable news coverage of alleged affair

CNN and MSNBC hosts and guests are talking a lot more about the alleged past affair between President Donald Trump and Stormy Daniels than Fox News is, according to Philip Bump’s latest analysis for The Washington Post using TV News Archive data via Television Explorer. 

Bump used the analysis as context to dig into a poll released by Suffolk University earlier this month: “One-fifth of Americans said that Fox News was the news or commentary source they trusted the most, a group that was primarily made up of Republicans… There’s a chicken-egg question here. Does Fox give the Stormy Daniels story a light touch because its audience is largely supportive of Trump or is Fox’s audience largely supportive of Trump because of the coverage they see on Fox? Or is it both?”

Did Fox News reporting contribute to perception of fired FBI official?

Vox’s Alvin Chang argues a connection between the firing of Andrew McCabe, former FBI deputy director, to a narrative built up over the course of months by Fox News. Using TV News Archive data via Television Explorer, Chang reports that “long before he was fired, Fox News… constantly referred to McCabe as the quintessential example of the FBI’s corruption and anti-Trump bias. They hinted that he was plotting several schemes against Trump during the election, leaking information to the press, and was bought and paid for by Hillary Clinton and Democrats.” This, he writes, allowed FOX News viewers to think it made “perfect sense for Attorney General Jeff Sessions (perhaps directed by Trump) to fire McCabe.” Chang goes on to warn, “This alternate reality is being fed into the president’s mind.”


What new presidential economic pick had to say about Canadian PM

The Toronto Star embedded a TV news clip in a piece on Trump’s pick to replace his economic advisor. Larry Kudlow, who is taking over from Gary Cohn as economic advisor, had said of U.S. trade policy:  “NAFTA is the key. And unfortunately we’re going after a major NAFTA ally, and perhaps America’s greatest ally, namely Canada. Even with this left-wing crazy guy Trudeau, they’re still our pals. They’re still our pals. Why are we going after them?” The clip has been viewed more than 112,000 times and counting.


Fact-Check: Senate banking bill a big win for Wall Street (Yes and No)

In a floor speech, Sen. Elizabeth Warren, D., Mass., said of the latest proposal to make changes to Dodd-Frank, “This bill is about goosing the bottom line and executive bonuses at the banks that make up the top one half of 1 percent of banks in this country by size. The very tippy-top.”

Manuela Tobias reported for PolitiFact: “The bill raises the bar of what is considered a big bank five-fold, which effectively relaxes the standards for large regional banks. Experts warn this also could open a door for bigger Wall Street bank giveaways.

The bill also has a few provisions affecting banks above $250 billion in assets. However, the effects would largely depend on the Federal Reserve’s interpretation of the law. The biggest banks might be able to get relaxed regulations, but then again, they might not.”


Fact-Check: Public lands proposal largest in history (False)

In a Senate hearing on the budget for the Dept. of the Interior, Interior Secretary Ryan Zinke said the president’s proposal “is the largest investment in our public lands infrastructure in our nation’s history. Let me repeat that, this is the largest investment in our public lands infrastructure in the history of this country.”

PolitiFact rates the claim false. Louis Jacobson reported: “It’s far from assured that the maximum figure of $18 billion in the proposal will ever be reached if enacted. Beyond that, though, Roosevelt’s $3 billion investment in the Civilian Conservation Corps would amount to $53 billion today, and it accounted for vastly more than the Trump proposal as a percentage of federal spending at the time.”

Fact-Check: U.S. has trade deficit with Canada (Four Pinocchios)

After a private meeting with Canadian Prime Minister Justin Trudeau, Trump defended his view about U.S.-Canada trade, tweeting, “We do have a Trade Deficit with Canada, as we do with almost all countries (some of them massive). P.M. Justin Trudeau of Canada, a very good guy, doesn’t like saying that Canada has a Surplus vs. the U.S.(negotiating), but they do … they almost all do … and that’s how I know!”

Glenn Kessler reports for The Washington Post’s Fact Checker that the president is not including services in his analysis of the trade relationship with Canada. He adds: “The president frequently suggests the United States is losing money with these deficits, but countries do not ‘lose’ money on trade deficits. A trade deficit simply means that people in one country are buying more goods from another country than people in the second country are buying from the first.” Kessler gives the claim four Pinocchios.

Eugene Kiely reports for FactCheck.org that the president’s claim that figures giving the U.S. a trade surplus with Canada are not including timber and energy is “not accurate. The Census Bureau, which is within the U.S. Department of Commerce, said its trade figures do include timber and energy and referred us to two publications that show that the agency does include timber and energy for imports and exports.”

Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.

TV News Record: Television Explorer 2.0, shooting coverage & more

A round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman. 

Explore Television Explorer 2.0

Television Explorer, a tool to search closed captions from the TV News Archive, keeps getting better. Last week GDELT’s Kalev Leetaru added new and improved features:

  • 163 channels are now available to search, from C-Span to Al Jazeera to Spanish language content from Univision and Telemundo.
  • Results now come as a percentage of 15 second clips, making comparisons between simpler.
  • The context word function for searches is similarly redesigned, counting a matching 15-second clip as well searching the 15 second clips immediately before and after, helping to alleviate some previous issues with overcounting.
  • You can now see normalization timelines on the site, with newly available data about the total number of 15-second clips monitored each day and hour included in your query.

Take the revamped Television Explorer for a spin.

Here’s what we found when we used the new tools to track the use of the term, “cryptocurrency.” The rapid ascent, and sometimes fall, of the value of cryptocurrencies such as bitcoin have led to rises and dips in TV news coverage as well. In May 2017, international TV news channels began to run stories featuring the term, rapidly increasing in November and peaking just last week with BBC News. Television Explorer shows that Deutsche Welle led the pack ahead of BBC News and Al Jazeera in covering cryptocurrency. Among US networks, Bloomberg uses the term more than twice as often as Deutsche Welle. A search of the term bitcoin shows a similar trajectory, with CNBC coverage spiking December 11, 2017, a few days before bitcoin hit its historic peak in value to date.

Florida high school shooting TV news coverage shows familiar pattern

Within a broader analysis of how responses to the most recent school shooting compare with others, The Washington Post’s Philip Bump used TV News Archive closed caption data using GDELT’s Television Explorer to examine the pattern of use of the term “gun control” on CNN, Fox, and MSNBC. “After the mass shooting in Las Vegas last October, a political discussion about banning ‘bump stocks’ — devices that allowed the shooter to increase his rate of fire — soon collapsed.” “So far, the conversation after Parkland looks similar to past patterns.”

Washington Post graphic

Fact-check: Trump never said Russia didn’t meddle in election (Pants on Fire!)

“I never said Russia did not meddle in the election”

Reacting to the indictments of Russian nationals by Special Counsel Robert Mueller, President Donald Trump wrote, “I never said Russia did not meddle in the election, I said, ‘It may be Russia, or China or another country or group, or it may be a 400 pound genius sitting in bed and playing with his computer.’ The Russian “hoax” was that the Trump campaign colluded with Russia – it never did!”

Fact-checkers moved quickly to investigate this claim.

The Washington Post’s Fact Checker, Glenn Kessler: “According to The Fact Checker’s database of Trump claims, Trump in his first year as president then 44 more times denounced the Russian probe as a hoax or witch hunt perpetuated by Democrats. For instance, here’s a tweet from the president after reports emerged about the use of Facebook by Russian operatives, a key part of the indictment: ‘The Russia hoax continues, now it’s ads on Facebook. What about the totally biased and dishonest Media coverage in favor of Crooked Hillary?’”

PolitiFact’s Jon Greenberg:  “Pants on Fire!” The president “called the matter a ‘made-up story,’ and a ‘hoax.’ He has said that he believes Russian President Putin’s denial of any Russian involvement. He told Time, ‘I don’t believe they (Russia) interfered.’”

Vox on Fox (& CNN & MSNBC): Mueller indictment, Florida shooting

In an analysis of Fox News, CNN, and MSNBC during the 72 hours following the announcement of the indictment of 13 Russians, Vox’s Alvin Chang used TV News Archive closed captioning data and the GDELT Project’s Television Explorer to show “how Fox News spun the Mueller indictment and Florida shooting into a defense of the president.” Chang uses the data to show that “[I]nstead of focusing on the details of the indictment itself, pundits on Fox News spent a good chunk of their airtime pointing out that this isn’t proof of the Trump administration colluding with Russia.”


TV news coverage and analysis in one place

Scholars, pundits, and reporters have used the data we’ve created here in the TV News Archive in ways that continue to inspire us, adding much-needed context to our chaotic public discourse as seen on TV.  All that content is now in one place, showcasing the work of these researchers and reporters who turned TV news data into something meaningful.

Follow us @tvnewsarchive, and subscribe to our weekly newsletter here.

TV News Record: The year in TV news visualizations

Thanks for being part of our community at the TV News Archive. As 2017 draws to a close, we’ve chosen six of our favorite visualizations using TV News Archive data. We look forward to assisting many more journalists and researchers in what will likely be an even more tumultuous news year. 

The New York Times: Mueller indictments

The New York Times editorial page used our Third Eye chyron collection to produce an analysis of TV news coverage of major indictments of Trump campaign officials by special counsel Robert Mueller: “The way each network covered the story – or avoided it – is a sign of how the media landscape has become ever more politicized in the Trump era. ”

credit: Taylor Adams, Jessia Ma, and Stuart A. Thompson, The New York Times, “Trump Loves Fox & Friends,” November 1, 2017.

FiveThirtyEight: hurricane coverage

Writing for FiveThirtyEight.com, Dhrumil Mehta demonstrated that TV news broadcasters paid less attention to Puerto Rico’s hurricane Maria than to hurricanes Harvey and Irma, which hit mainland U.S. primarily in Texas and Florida. Mehta used TV News Archive data via Television Explorer.

credit: Dhrumil Mehta, “The Media Really Has Neglected Puerto Rico,” FiveThirtyEight, September 28, 2017.

TV News Archive: face-time for lawmakers

Using our Face-o-Matic data set, we found that Sen. Majority Leader Mitch McConnell, R., Ky., gets the most face-time on cable TV news, and MSNBC features his visage more than the other networks examined. Fox News features the face of House Minority Leader Nancy Pelosi, D., Calif., more than any other cable network.

Vox:  Mueller’s credibility

Vox’s Alvin Chang used Television Explorer to explore how Fox News reports on Mueller’s credibility. This included showing how often Fox news mentioned Mueller in the context of former presidential candidate Hillary Clinton.

Alvin Chang, “A week of Fox News transcripts shows how they began questioning Mueller’s credibility,” Vox, October 31, 2017.

The Trace: coverage of shootings

Writing for The Trace, Jennifer Mascia presented findings from Television Explorer showing how coverage of shootings declines rapidly: “Two days after 26 people were massacred in a Texas church, the incident — one of the worst mass shootings in American history — had nearly vanished from the major cable news networks.”

Credit: Jennifer Mascia, “Data Shows Shrinking Cable News Cycles for This Fall’s Mass Shootings,” The Trace, December 5, 2017.

The Washington Post: What TV news networks covered in 2017

Philip Bump of The Washington Post crunched Television Explorer data to look at coverage of eleven major news stories by five national news networks. Here’s his visualization of TV news coverage of “sexual assault,” which shows how coverage increased at the end of the year as dozens of prominent men in media, politics, and entertainment were accused of sexual harassment or assault.

Philip Bump, “What national news networks were talking about during 2017, The Washington Post, December 15, 2017.

Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.

A Year-end Message from the TV News Archive

by Katie Donnelly

Over the past extremely unpredictable election year, the Internet Archive invented new methods and tools to give journalists, researchers, and the public the power to access, scrutinize, share, and thoroughly fact-check political ads, presidential debates, and TV news broadcasts.

Our efforts were designed to help citizens better understand the patterns of political messages designed to persuade them and find factual, reliable information in what is disturbingly being seen as a “post-truth” world.

The Political TV Ad Archive project proved to be highly useful to our high-profile fact-checking partners, as well as reporters at an array of outlets including The New York Times, The Washington Post, FOX News, The Economist, The Atlantic, and more. By providing data about when, where, and how many times political ads aired on TV in key markets, the project unlocked new creative potential for data reporters to analyze how campaigns and outside groups were targeting messages to voters in different locations.

Breaking events, like political debates and speeches, also offered a chance for archived TV content to shine, allowing reporters to isolate and share clips in near-real time, and fact-checkers to harvest dubious statements for further exploration. In addition, the project’s experience with developing audio fingerprinting (through a new invention we call the Duplitron) for identifying instances of ads inspired a new use: tracking candidate debate sound bites in subsequent TV news shows.

In this way, reporters and researchers were able to analyze and report on which political statements were trending across different TV programs. This provided a way to show how political statements were trending across various networks, revealing the ideological, and agenda-setting and other editorial choices made by news producers about what issues to highlight and overlook.

screenshot-2016-12-19-13-21-14

As Roger Macdonald, director of the TV News Archive, wrote to project partners: “Citizens will increasingly hunger for sound information to inform wise electoral decisions. With our Republic being riven by increasing socio-political chaos and infectious divisions, whose magnitude has not been seen since before our Civil War, we think there are uncommon opportunities to serve citizens with the information for which they will increasingly yearn. We have an historic opportunity to thoughtfully place some grains of sand on the balance pan of reason.”

The project was supported by a generous grant from the Knight News Challenge, funded in partnership with the Knight Foundation, the Democracy Fund, the Hewlett Foundation and the Rita Allen Foundation, and received additional support from the Rita Allen Foundation, the Democracy Fund, PLCB Foundation, Craig Newmark, Christopher Buck, and others

Here is a quick look at project accomplishments:

Political TV Ad Archive

  • Total number of archived ad views, most embedded in partner sites: 2,036,063
  • Number of ads collected: 2,991
  • Political ads broadcast 364,822 times over 26 markets
  • Number of fact and source checks: 131
  • Press coverage: 156 articles

Katie Donnelly is associate director at Dot Connectors Studio, a Philadelphia-based strategy firm that has worked with the Political TV Ad Archive.

New Research Tool for Visualizing Two Million Hours of Television News

Guest post by Kalev Leetaru

Today the Internet Archive announces a new interactive timeline visualization–the Television Explorer–that lets you trace how any keyword–think “emails”, “tax returns”, “alt-right”–has been covered on U.S. television news over the past half-decade.

See the Television Explorer, a new tool for exploring TV News.

screenshot-2016-12-19-09-50-09

Over the past year and a half, the GDELT Project and the Internet Archive’s Television News Archive have worked closely together to visualize how U.S. television news has covered the contentious 2016 political campaign.

One of the tools we created was the 2016 Candidate Television Tracker, which used closed captioning to count how many times each of the presidential candidates was mentioned on television and offered a day-by-day timeline showing the ebbs and flows of who was “winning” the free media wars. (Answer: President-elect Donald Trump.) This tool was used by such media outlets as The Atlantic, The Washington Post, FiveThirtyEight, Politico and The Guardian, among many others.

Now we are adapting this tool to allow more sophisticated searches: rather than just the presidential candidates, now you can trace television news coverage of any keyword of your choosing. You can even run advanced searches that find words in conjunction with other works or phrases, such as finding mentions of Hillary Clinton that also discuss her email server. All search results are available for download via CSV and JSON export, making it possible for data journalists, researchers, and advocates to fine tune their analysis of the data.

When searching, you get back a visual timeline showing how often that word or phrase has appeared on American television news over the past half-decade. Nearly two million hours of television news totaling more than 5.7 billion words from over 150 distinct stations spanning July 2009 to present (though not all stations were monitored for the entire period) are searchable in this interface.

Unlike the Internet Archive’s Television New Archive interface, which returns results at the level of an hour or half-hour “show,” the interface here reaches inside of those six and a half years of programming and breaks the more than one million shows into individual sentences and counts how many of those sentences contain your keyword of interest. Instead of reporting that CNN had 24 hour-long shows yesterday that mentioned Donald Trump one or more times, the interface here will count how many sentences uttered on CNN yesterday mentioned his name–a vastly more accurate metric for assessing media attention.

Explore how CNN covered the presidential campaign of 2012 versus 2016 and understand just how big of a media event this year’s election really was. See precisely when Edward Snowden burst onto the scene and how Wikileaks got more coverage during the 2016 presidential election than its debut in 2010. Watch the seasonal spikes of Thanksgiving, or see how ebola received little attention, even as thousands died in Africa, becoming a topic only after the first Americans became infected.

Using the “near” search feature, plot coverage of Wikileaks that also mentioned either “Podesta,” “email,” or “emails” nearby and discover that FOX paid far more attention to the DNC and Podesta email hacks than CNN, MSNBC, CNBC or Bloomberg. In contrast, CNN focused more intensely on the Trayvon Martin shooting (Aljazeera America and Bloomberg were not yet being monitored by the Archive), while Aljazeera led coverage of the Michael Brown and Eric Garner deaths.

screenshot-2016-12-19-09-53-55

Search of term “Wikileaks” near Podesta, emails, Clinton

Search for “ivory” to see that Aljazeera America (which ceased operation in April 2016) devoted vastly more of its coverage to elephant poaching in Africa than any other monitored national network. It also paid the most attention to “Africa” and to the “refugee” crisis. On the other hand, Bloomberg has devoted much more of its time to “China” and to the economic crisis in “Greece” last year.

We look forward to seeing what people do with this new tool Please share your favorite searches on Twitter with the hashtag “#internetarchivetvsearch”. If you have any questions, please email kalev.leetaru5@gmail.com or nancyw@archive.org.

Kalev Leetaru is an independent data journalist.