Tag Archives: Face-O-Matic

Expanding the Television Archive

When we started archiving television in 2000, people shrugged and asked, “Why?  Isn’t it all junk anyway?” As the saying goes, one person’s junk is another person’s gold. From 2010-18, scholars, pundits and above all, reporters, have spun journalistic gold from the data captured in our 1.5 million hours of television news recordings. Our work has been fueled by visionary funders(1) who saw the potential impact of turning television – from news reports to political ads – into data that can be analyzed at scale. Now the Internet Archive is taking its Television Archive in new directions. In 2018 our goals for television will be: better curation in what we collect; broader collection across the globe; and working with computer scientists interested in exploring our huge data sets. Simply put, our mission is to build and preserve comprehensive collections of the world’s most important television programming and make them as accessible as possible to researchers and the general public. We will need your help.  

“Preserving TV news is critical, and at the Internet Archive we’ve decided to rededicate ourselves to growing our collection,” explained Roger MacDonald, Director of Television at the Internet Archive. “We plan to go wide, expanding our archives of global TV news from every continent. We also plan to go deep, gathering content from local markets around the country. And we plan to do so in a sustainable way that ensures that this TV will be available to generations to come.”

Libraries, museums and memory institutions have long played a critical role in preserving the cultural output of our creators. Television falls within that mandate. Indeed some of the most comprehensive US television collections are held by the Library of Congress, Vanderbilt University and UCLA. Now we’d like to engage with a broad range of libraries and memory institutions in the television collecting and curation process. If your organization has a mandate to collect television or researcher demand for this media, we would like to understand your needs and interests. The Internet Archive will undertake collection trials with interested institutions, with the eventual goal of making this work self-sustaining.

Simultaneously, we are looking to engage researchers interested in the non-consumptive analysis of television at scale, in ways that continue to respect the interests of right holders. The tools we’ve created may be useful. For instance, we hope the tools the Internet Archive used to detect TV campaign ads can be applied by researchers in new and different ways.  If your organization has interest in computing with television as data at large, we are interested in working with you.

This groundbreaking interface for searching television news, based on the closed captions associated with US broadcasts, was developed between 2009-2012.

A brief history of the Internet Archive’s Television collection:

2000 Working with pioneering engineer, Rod Hewitt, IA begins archiving 20 channels originating from many nations.

Oct. 2001 September 11, 2001 Collection established, and enhanced in 2011.

2009-2012 With funding from the Knight Foundation and many others, we built a service to allow public searching, citation and borrowing of US television news programs on DVD.

2012-2014 Public TV news library launched with tools to search, quote and share streamed snippets from television news.

2014 Pilot launched to detect political advertisements broadcast in the Philadelphia region, led to developing open sourced audio fingerprinting techniques.

2016 Political ad detection, curation, and access expanded to 28 battleground regions for 2016 elections, enabling journalists to fact check the ads and analyze the data at scale. The same tools helped reporters analyze presidential debates.  This resulted in front-page data visualizations in The New York Times, as well as 150+ analyses by news outlets from Fox News to The Economist to FiveThirtyEight.

2017-date Experiments with artificial intelligence techniques to employ facial identification, and on-screen optical character recognition to aid searching and data mining of television. Special curated collections of top political leaders and fact-check integrations.

In the run-up to the 2016 presidential elections, journalists at the NYT and elsewhere began analyzing television as data, in this case looking at the different sound bites each network chose to replay.

Embarking on a new direction also means shifting away from some of our current services. Our dedicated television team has been focusing on metadata enhancement and assisting journalists and scholars to use our data. We will be wrapping up some of these free services in the next three to four months.  We hope others will take up where we left off and build the tools that will make our collection even more valuable to the public.

Now more than ever in this era of disinformation, our world needs an open, reliable, canonical reference source of television news. This cannot exist without the diligent efforts of technologists, journalists, researchers, and television companies all working together to create a television archive open for all. We hope you will join us!

To learn more about the work of the TV News Archive outreach and metadata innovation team over the last few years, please see our blog posts.

(1) Funding for the Television Archive has come from diverse donors, including the John S. and James L. Knight Foundation, Democracy Fund, Rita Allen Foundation, craigslist Charitable Fund and The Buck Foundation.

TV News Record: Face-o-Matic on Trump, McConnell, and Pelosi; PolitiFact picks “2017 Lie of the Year”

A biweekly round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman.

This week we take a dive into nearly six months of Face-o-Matic facial recognition data. We also display the news clips behind PolitiFact’s top picks for lies and misstatements of the year, most of them seen and heard on TV news.

Face-o-Matic reveals cable news persistent patterns

With nearly six-months of data available, we’re finding certain persistent patterns on how cable networks make editorial choices in displaying the faces of President Donald Trump and top congressional leaders on TV screens.

First, Trump trumps the congressional leaders for on-screen face time, by many degrees of magnitude.

 

Second, of the four congressional leaders, Sen. Majority Leader Mitch McConnell, R., Ky., gets the most face-time on TV, and MSNBC features his visage more than the other networks examined. Fox News features the face of House Minority Leader Nancy Pelosi, D., Calif. more than any other cable network.

Face-o-Matic, an experimental service, developed in collaboration with the start-up Matroid, tracks the faces of selected high level elected officials on major TV cable news channels: CNN, Fox News, MSNBC, and the BBC.  Face-o-Matic data is available to the media, researchers, and the public, updated daily here. Stay tuned: we are in the process of testing new faces to add to Face-o-Matic: living past presidents and recent major political party nominees.

PolitiFact announces “2017 Lie of the Year”

Our fact-checking partner PolitiFact has announced its annual Lie of the Year: President Donald Trump’s statement in May 2017 to NBC’s Lester Holt, “This Russia thing…is a made-up story.” Below is the interview on the Internet Archive’s TV News Archive.

Writes PolitiFact editor, Angie Drobnic Holan:

In both classified and public reports, U.S. intelligence agencies have said Russian President Vladimir Putin ordered actions to interfere with the election. Those actions included the cyber-theft of private data, the placement of propaganda against particular candidates, and an overall effort to undermine public faith in the U.S. democratic process.

Trump’s statement about Russia was both the PolitiFact’s editors pick for Lie of the Year, and the top one chosen by PolitiFact’s readers in an online poll by “an overwhelming margin.” Readers also called out other fact-checks to highlight for the year, choosing from a list of ten from the editors, or writing in their own nominations.

The top vote-getter after Trump’s statement on Russia was Idaho Republican Rep. Raul Labrador’s claim that “Nobody dies because they don’t have access to health care.” “While the exact number of deaths saved by having health insurance is uncertain, the researchers we contacted agreed that the number is higher than zero–probably quite a bit higher,” writes PolitiFact reporter Louis Jacobson, who rated the claim “Pants on Fire.”

In third place was Sean Spicer’s claim, “that was the largest audience to witness an inauguration, period,” referring to the presidential inauguration in January 2017. PolitiFact at the time rated that claim “Pants on Fire”: “Spicer suggested 720,000 attended Trump’s inauguration, while organizers said they expected 700,000 to 900,000, and Trump himself estimated 1.5 million. All of those figures are less than the 1.8 million people who attended Obama’s 2009 inaugural.”

To see more video of fact-checks chosen by PolitiFact’s readers and editors as lies or misstatements of the year, see this list, which includes links both to TV news clips of public officals making statements on the air, along with links to fact-checks of those statements.

Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.

 

 

TV News Record: Whoops, they said it again (on taxes)

A biweekly round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman

This week, we demonstrate GOP and Democrat talking points on taxes; display a case of mistaken facial identity; and present fact checks on the GOP tax proposal.

Whoops, they said it again

Was that House Minority Leader Nancy Pelosi, D., Calif., who said “tax cuts for the rich”? Or was that Senate Minority Leader Chuck Schumer, D., N.Y.? Wait: they both said it. Often.

Meanwhile, House Speaker Paul Ryan, R., Wis., keeps talking about tax reform being a “once in a generation opportunity,” and, coincidence!, so does Sen. Majority Leader Mitch McConnell, R., Ky. It’s a recurring theme.

These types of repeated phrases, often vetted via communication staff, are known as “talking points,” and it’s the way politicians, lobbyists, and other denizens of the nation’s capital sell policy. The TV News Archive is working toward the goal of applying artificial intelligence (AI) to our free, online library of TV news to help ferret out talking points so we can better understand how political messages are crafted and disseminated.

For now, we don’t have an automated way to identify such repeated phrases from the thousands of hours of television news coverage. However, searching within our curated archives of top political leaders can provide a quick way to check for a phrase you think you’re hearing often. Visit archive.org/tv to find our Trump archive, executive branch archive, and congressional archives, click into an archive, then search for the phrase within that archive.

Sample search results in the congressional archive

Funny, you look familiar

Wait, is this former President George W. Bush trying out a new look?

No, it’s not. This is Bob Massi, a legal analyst for Fox Business News and host of “Bob Massi is the Property Man.”  In a test run of new faces for our Face-o-Matic facial detection tool, Massi’s uncanny resemblance (minus the hair) to the former president earned him a “false positive” – the algorithm identified this appearance as Bush incorrectly.

This doesn’t get us too worried, as we still include human testers and editors in our secret sauce: we’ll retrain our algorithm to disregard photos of Massi in the TV news stream. It does point toward why we want to be very careful, particularly with facial recognition, where a private individual may be tracked inadvertently or a public official misrepresented. Our concern about developing ethical practices with facial recognition is why, for the present, we are restricting our face-finding to elected officials. We invite discussion with the greater community about ethical practices in applying AI to the TV News Archive at tvnews@archive.org.

In our current Face-o-Matic set we track the faces of President Donald Trump and the four congressional leaders in their TV news appearances. After receiving feedback from journalists and researchers, our next set will include living ex-presidents and recent major presidential party nominees: Jimmy Carter, Bill Clinton, George H.W. Bush, George W. Bush,  Barack Obama, Hillary Clinton, John McCain, and Mitt Romney. Stay tuned, while we fine tune our model.

Fact-check: everyone will get a tax cut (false)

In an interview on November 7, on Fox News’s new “The Ingraham Angle,” House Speaker Paul Ryan, R., Wis., says: “Everyone enjoys a tax cut all across the board.”

Pulling in information from the Tax Policy Center and a tax model created by the American Enterprise Institute, The Washington Post’s Fact Checker Glenn Kessler counters Ryan’s claim: “In the case of married families with children — whom Republicans are assiduously wooing as beneficiaries of their plan — about 40 percent are estimated to receive tax hikes by 2027, even if the provisions are retained.”

Ryan changed his language, according to Kessler, following an inquiry on November 8 from the Fact Checker. Now he is saying, “the average taxpayer in all income levels gets a tax cut.”

Fact-check: tax bill not being scored by CBO as is tradition (false)

In an interview on November 12 on CNN’s “State of the Union,” Senate Minority Whip Dick Durbin, D., Ill., claimed that the GOP tax plan is “not being scored by the Congressional Budget Office, as it is traditionally. It’s because it doesn’t add up.”

“Under the most obvious interpretation of that statement, Durbin is incorrect. The nonpartisan analysis for tax bills is actually a task handled by the Joint Committee on Taxation, and the committee has been actively analyzing the Republican tax bills,” reported Louis Jacobson of PolitiFact.

Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.

TV News Record: With indictment, chyrons & captions get a graphic workout

A biweekly round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman

Fox News downplayed Mueller indictment, according to NYT editorial chyron analysis

In the most intensive use the Internet Archive’s Third Eye data to date, The New York Times editorial page analyzed chyron data to show how Fox News downplayed this week’s news of the indictment of former Trump campaign manager and other legal developments. The graphic-heavy opinion piece was featured at the top of the online homepage much of the day on Wednesday, Nov. 1:

Though it is far from the only possible way to evaluate news coverage, the chyron has become something of a touchstone for media analysts, being both the most obvious visual example of spin or distraction and the most shareable. Any negative coverage of the president usually prompts a flurry of tweets cataloguing the differences among networks in their chyron text. While CNN, MSNBC and the BBC are typically in alignment, Monday morning was a particularly stark example of how Fox News pushes its own version of reality.

 Read The New York Times opinion piece, and dig into the data yourself.

Captions yield insights on Mueller investigation, shooting coverage

Fox News actively tried to “plant doubt in viewers’ minds” as Mueller brought charges against former Trump campaign officials, according to an analysis of a week’s worth of closed captions by Alvin Chang of Vox News. Chang used Television Explorer, fueled by TV News Archive data, to crunch the numbers behind charts such as the one below.

And The Trace, an independent, nonprofit news organization that focuses on gun violence, used TV News Archive caption data via Television Explorer to show how TV news coverage of mass shootings declines quickly.

Face-o-Matic captures congressional leaders reactions on indictments

In the 24 hours following news breaking about the indictments, our Face-o-Matic data feed captured cable news networks’ editorial choices on how much face-time to allot to congressional leaders’ reactions. The answer: not much.

All together the four congressional leaders’ faces were shown for a total of 2.5 minutes on indictment-related reporting on screen by CNN, Fox News, and MSNBC. Ryan got the lion’s share of the attention. Much of this was devoted to airings of his photo in connection with his official statement,“[N]othing is going to derail what we are doing in Congress, because we are working on solving people’s problems.”

The image of Senate Majority Leader Mitch McConnell, K., Ky., was not featured by any network. House Minority Leader Nancy Pelosi, D., Calif., got attention only from Fox News, which featured her photo with discussion of her statement, in which she said despite the news, “we still need an outside fully independent investigation.”


Fact-check:Papadopoulos had a limited role in Trump campaign (had seat at table/not the whole story)

One of the most parsed statements this week was White House press secretary Sarah Huckabee Sanders’ claim that George Papadopolous, who pleaded guilty to lying to the FBI, had an “extremely limited” role in the campaign. “It was a volunteer position,” she said. “And again, no activity was ever done in an official capacity on behalf of the campaign.”

“Determining how important Papadopoulos was on the Trump team is open to interpretation, so we won’t put this argument to the Truth-O-Meter,” wrote Louis Jacobson, reporting for PolitiFact. Jacobson, however, laid out the known facts. For example, in March 2016, then presidential candidate Donald Trump tweeted out a photo of himself and advisors sitting at a table, saying it was a “national security meeting.” Papadopoulos is seen at the table sitting near future Attorney General Jeff Sessions. However, Jacobson also writes,“There is some evidence to support the argument that Papadopoulos was freelancing by pushing the Russia connection.”

Reviewing Sanders’ claim, as well as a Trump tweet along similar lines, Robert Farley and Eugene Kiely took a similar tack for FactCheck.org, concluding that Papadopoulos had a “seat at the table” in the campaign, but it was beyond licking envelopes and posting lawn signs:  “What we do know is that during this time — from late March to mid-August — Papadopoulos was in regular contact with senior Trump campaign officials and attended a national security meeting with Trump. We will let readers decide if this constitutes a ‘low-level volunteer.'”


Embed TV News Archive clips on web annotations

Now you can embed TV News Archive news clips when commenting and annotating the web, thanks to a new integration from Hypothes.is. From the Hypothesis.is blog:

This integration makes it easy for journalists, fact-checkers, educators, scholars and anyone that wants to relate specific text in a webpage, PDF, or EPUB to a particular snippet of video news coverage. All you need to do to use it is copy the URL of a TV News Archive video page, paste it into the Hypothesis annotation editor and save your annotation. You can adjust the start and end of the video to include any exact snippet. The video will then automatically be available to view in your annotation alongside the annotated text.

See a live example of the integration in this annotation with an embedded news video of Senator Charles Schumer at a news conference over a post that checks the facts in one of his statements.

“This integration means that one of the world’s most valuable resources — the news that the Internet Archive captures across the world everyday — will be able to be brought into close context with pages and documents across the web,” said Hypothesis CEO Dan Whaley. “For instance, a video of a politician making an actual statement next to an excerpt that claims the opposite, or a video of a newsworthy event next to a deeper analysis of it.”

Please take Hypothes.is for a spin and let us know what you think: tvnews@archive.org.

Follow us @tvnewsarchive, and subscribe to our biweekly newsletter here.

History is happening, and we’re not just watching

  1. Which recent hurricane got the least amount of attention from TV news broadcasters?
    1. Irma
    2. Maria
    3. Harvey
  2. Thomas Jefferson said, “Government that governs least governs best.”
    1. True
    2. False
  3. Mitch McConnell shows up most on which cable TV news channel?
    1. CNN
    2. Fox News
    3. MSNBC

Answers at end of post.

The Internet Archive’s TV News Archive, our constantly growing online, free library of TV news broadcasts, contains 1.4 million shows, some dating back to 2009, searchable by closed captioning. History is happening, and we preserve how broadcast news filters it to us, the audience, whether it’s through CNN’s Jake Tapper, Fox’s Bill O’Reilly, MSNBC’s Rachel Maddow or others. This archive becomes a rich resource for journalists, academics, and the general public to explore the biases embedded in news coverage and to hold public officials accountable.

Last October we wrote how the Internet Archive’s TV News Archive was “hacking the election,” then 13 days away. In the year since, we’ve been applying our experience using machine learning to track political ads and TV news coverage in the 2016 elections to experiment with new collaborations and tools to create more ways to analyze the news.

Helping fact-checkers

Since we launched our Trump Archive in January 2017, and followed in August with the four congressional leaders, Democrat and Republican, as well as key executive branch figures, we’ve collected some 4,534 hours of curated programming and more than 1,300 fact-checks of material on subjects ranging from immigration to the environment to elections.

 

The 1,340 fact-checks–and counting–represent a subset of the work of partners FactCheck.orgPolitiFact and The Washington Post’s Fact Checker, as we link only to fact-checks that correspond to statements that appear on TV news. Most of the fact-checks–524–come from PolitiFact; 492 are by FactCheck.org, and 324 from The Washington Post’s Fact Checker.

We’re also proud to be part of the Duke Reporter’s Lab’s new Tech & Check collaborative, where we’re working with journalists and computer scientists to develop ways to automate parts of the fact-checking process.  For example, we’re creating processes to help identify important factual claims within TV news broadcasts to help guide fact-checkers where to concentrate their efforts. The initiative received $1.2 million from the John S. and James L. Knight Foundation, the Facebook Journalism Project and the Craig Newmark Foundation.

See the TrumpUS Congress, and executive branch archives and collected fact-checks.

TV News Kitchen

We’re collaborating with data scientists, private companies and nonprofit organizations, journalists, and others to cook up new experiments available in our TV News Kitchen, providing new ways to analyze TV news content and understand ourselves.

Dan Schultz, our senior creative technologist, worked with the start-up Matroid to develop Face-o-Matic, which tracks faces of selected high level elected officials on major TV cable news channels: CNN, Fox News, MSNBC, and BBC News. The underlying data are available for download here. Unlike caption-based searches, Face-o-Matic uses facial recognition algorithms to recognize individuals on TV news screens. It is sensitive enough to catch this tiny, dark image of House Minority Leader Nancy Pelosi, D., Calif., within a graphic, and this quick flash of Senate Minority Leader Chuck Schumer, D., N.Y., and Senate Majority Leader Mitch McConnell, R., Ky.

The work of TV Architect Tracey Jaquith, our Third Eye project scans the lower thirds of TV screens, using OCR, or optical character recognition, to turn these fleeting missives into downloadable data ripe for analysis. Launched in September 2017, Third Eye tracks BBC News, CNN, Fox News, and MSNBC, and collected more than four million chyrons captured in just over two weeks, and counting.

Download Third Eye data. API and TSV options available.

Follow Third Eye on Twitter.

Vox news reporter Alvin Chang used the Third Eye chyron data to report how Fox News paid less attention to Hurricane Maria’s destruction in Puerto Rico than it did to Hurricanes Irma and Harvey, which battered Florida and Texas. Chang’s work followed a similar piece by Dhrumil Mehta for FiveThirtyEight, which used Television Explorer, a tool developed by data scientist Kalev Leetaru to search and visualize closed captioning on the TV News Archive.

 

FiveThirtyEight used TV News Archive captions to create this look at how cable networks covered recent hurricanes.

CNN’s Brian Stelter followed up with a similar analysis on “Reliable Sources” October 1.

We’re also working with academics who are using our tools to unlock new insights. For example, Schultz and Jaquith are working with Bryce Dietrich at the University of Iowa to apply the Duplitron, the audiofingerprinting tool that fueled our political ad airing data, to analyze floor speeches of members of Congress. The study identifies which floor speeches were aired on cable news programs and explores the reasons why those particular clips were selected for airing. A draft of the paper was presented in the 2017 Polinfomatics Workshop in Seattle and will begin review for publication in the coming months.

What’s next? Our plans include making more than a million hours of TV news available to researchers from both private and public institutions via a digital public library branch of the Internet Archive’s TV News Archive. These branches would be housed in computing environments, where networked computers provide the processing power needed to analyze large amounts of data. Researchers will be able to conduct their own experiments using machine learning to extract metadata from TV news. Such metadata could include, for example, speaker identification–a way to identify not just when a speaker appears on a screen, but when she or he is talking. Metadata generated through these experiments would then be used to enrich the TV News Archive, so that any member of the public could do increasingly sophisticated searches.

Going global

We live in an interdependent world, but we often lack understanding about how other cultures perceive us. Collecting global TV could open a new window for journalists and researchers seeking to understand how political and policy messages are reported and spread across the globe. The same tools we’ve developed to track political ads, faces, chyrons, and captions can help us put news coverage from around the globe into perspective.

We’re beginning work to expand our TV collection to include more channels from around the globe. We’ve added the BBC and recently began collecting Deutsche Welle from Germany and the English-language Al Jazeera. We’re talking to potential partners and developing strategy about where it’s important to collect TV and how we can do so efficiently.

History is happening, but we’re not just watching. We’re collecting, making it accessible, and working with others to find new ways to understand it. Stay tuned. Email us at tvnews@archive.org. Follow us @tvnewsarchive, and subscribe to our weekly newsletter here.

Answer Key

  1. b. (See: “The Media Really Has Neglected Puerto Rico,” FiveThirtyEight.
  2. b. False. (See: Vice President Mike Pence statement and linked PolitiFact fact-check.)
  3. c. MSNBC. (See: Face-O-Matic blog post.)

Members of the TV News Archive team: Roger Macdonald, director; Robin Chin, Katie Dahl, Tracey Jaquith, Dan Schultz, and Nancy Watzman.

Face-o-Matic data show Trump dominates – Fox focuses on Pelosi; MSNBC features McConnell

For every ten minutes that TV cable news shows featured President Donald Trump’s face on the screen this past summer, the four congressional leaders’ visages were presented  for one minute, according an analysis of Face-o-Matic downloadable, free data fueled by the Internet Archive’s TV News Archive and made available to the public today.

Face-o-Matic is an experimental service, developed in collaboration with the start-up Matroid, that tracks the faces of selected high level elected officials on major TV cable news channels: CNN, Fox News, MSNBC, and the BBC. First launched as a Slack app in July, the TV News Archive, after receiving feedback from journalists, is now making the underlying data available to the media, researchers, and the public. It will be updated daily here.

Unlike caption-based searches, Face-o-Matic uses facial recognition algorithms to recognize individuals on TV news screens. Face-o-Matic finds images of people when TV news shows use clips of the lawmakers speaking; frequently, however, the lawmakers’ faces also register if their photos or clips are being used to illustrate a story, or they appear as part of a montage as the news anchor talks.  Alongside closed caption research, these data provide an additional metric to analyze how TV news cable networks present public officials to their millions of viewers.

Our concentration on public officials and our bipartisan tracking is purposeful; in experimenting with this technology, we strive to respect individual privacy and extract only information for which there is a compelling public interest, such as the role the public sees our elected officials playing through the filter of TV news. The TV News Archive is committed to doing this right by adhering to these Artificial Intelligence principles for ethical research developed by leading artificial intelligence researchers, ethicists, and others at a January 2017 conference organized by the Future of Life Institute. As we go forward with our experiments, we will continue to explore these questions in conversations with experts and the public.

Download Face-o-Matic data here.

We want to hear from you:

What other faces would you like us to track? For example, should we start by adding the faces of foreign leaders, such as Russia’s Vladimir Putin and South Korea’s Kim Jong-un? Should we add former President Barack Obama and contender Hillary Clinton? Members of the White House staff? Other members of Congress?

Do you have any technical feedback? If so, please let us know what they are by contacting tvnews@archive.org or participating in the GitHub Face-o-Matic page.

Trump dominates, Pelosi gets little face-time

Overall, between July 13 through September 5, analysis of Face-o-Matic data show:

  • All together, we found 7,930 minutes, or some 132 hours, of face-time for President Donald Trump and the four congressional leaders. Of that amount, Trump dominated with 90 percent of the face-time. Collectively, the four congressional leaders garnered 15 hours of face-time.
  • House Minority leader Nancy Pelosi, D., Calif., got the least amount of time on the screen: just 1.4 hours over the whole period.
  • Of the congressional leaders, Senate Majority Leader Mitch McConnell’s face was found most often: 7.6 hours, compared to 3.8 hours for House Speaker Paul Ryan, R., Wis.; 1.7 hours for Senate Minority Leader Chuck Schumer, D., N.Y., and 1.4 hours for Pelosi.
  • The congressional leaders got bumps in coverage when they were at the center of legislative fights, such as in this clip of McConnell aired by CNN, in which the senator is shown speaking on July 25 about the upcoming health care reform vote. Schumer got coverage on the same date from the network in this clip of him talking about the Russia investigation. Ryan got a huge boost on CNN when the cable network aired his town hall on August 21.

Fox shows most face-time for Pelosi; MSNBC, most Trump and McConnell

The liberal cable network MSNBC gave Trump more face-time than any other network. Ditto for McConnell. A number of these stories highlight tensions between the senate majority leader and the president. For example, here, on August 25, the network uses a photo of McConnell, and then a clip of both McConnell and Ryan, to illustrate a report on Trump “trying to distance himself” from GOP leaders. In this excerpt, from an August 21 broadcast, a clip of McConnell speaking is shown in the background to illustrate his comments that “most news is not fake,” which is interpreted as “seem[ing] to take a shot at the president.”

MSNBC uses photos of both Trump and McConnell in August 12 story on “feud” between the two.

While Pelosi does not get much face-time on any of the cable news networks examined, Fox News shows her face more than any other. In this commentary report on August 20, Jesse Waters criticizes Pelosi for favoring the removal of confederate statues placed in the Capitol building. “Miss Pelosi has been in Congress for 30 years. Now she speaks up?” On August 8, “Special Report With Bret Baier” uses a clip of Pelosi talking in favor of women having a right to choose the size and timing of her family as an “acid test for party base.”

Example of Fox News using a photo of House Minority Leader Nancy Pelosi to illustrate a story, in this case about a canceled San Francisco rally.

While the BBC gives some Trump face-time, it gives scant attention to the congressional leaders. Proportionately, however, the BBC gives Trump less face-time than any of the U.S. networks.

On July 13 the BBC’s “Outside Source” ran a clip of Trump talking about his son, Donald Trump, Jr.’s, meeting with a Russian lobbyist.

For details about the data available, please visit the Face-O-Matic page. The TV News Archive is an online, searchable, public archive of 1.4 million TV news programs aired from 2009 to the present.  This service allows researchers and the public to use television as a citable and sharable reference. Face-O-Matic is part of ongoing experiments in generating metadata for reporters and researchers, enabling analysis of the messages that bombard us daily in public discourse.

 

McConnell, Schumer, Ryan, Pelosi fact-checked clips featured in new TV News Archive collections

Today the Internet Archive’s TV News Archive unveils growing TV news collections focused on congressional leadership and top Trump administration officials, expanding our experimental Trump Archive to other newsworthy government officials. Together, all of the collections include links to more than 1,200 fact-checked clips–and counting–by our national fact-checking partners, FactCheck.org, PolitiFact, and The Washington Post‘s Fact Checker.

These experimental video clip collections, which contain more than 3,500 hours of video, include archives focused on Senate Majority Leader Mitch McConnell, R., Ky.; Sen. Minority Leader Charles (“Chuck”) Schumer, D., N.Y.; House Speaker Paul Ryan, R., Wis.; and House Minority Leader, Nancy Pelosi, D., Calif., as well as top Trump officials past and present such as Secretary of State Rex Tillerson and former White House Press Secretary Sean Spicer.

Download a csv of fact-checked video statements or see all the fact-checked clips.

Visit the U.S. Congress archive.

Visit the Executive Branch archive.

Visit the Trump Archive.

We created these largely hand-curated collections as part of our experimentation in demonstrating how Artificial Intelligence (AI) algorithms could be harnessed to create useful, ethical, public resources for journalists and researchers in the months and years ahead. Other experiments include:

  • the Political TV Ad Archive, which tracked airings of political ads in the 2016 elections by using the Duplitron, an open source audio fingerprinting tool;
  • the Trump Archive, launched in January;
  • Face-O-Matic, an experimental Slack app created in partnership with Matroid that uses facial detection to find congressional leaders’ faces on TV news. Face-O-Matic has quickly proved its mettle by helping our researchers find clips suitable for inclusion in the U.S. Congress Archive; future plans include making data available in CSV and JSON formats.
  • in the works: TV Architect Tracey Jaquith is experimenting with detection of text in the chyrons that run on the bottom third of cable TV news channels. Stay tuned.

Red check mark shows there’s a fact-check in this footage featuring House Minority Leader Nancy Pelosi, D., Calif. Follow the link below the clip to see the fact-check, in this case by The Washington Post’s Fact Checker.

At present, our vast collection of TV news –1.4 million shows collected since 2009–is searchable via closed-captioning. But closed captions, while helpful, can’t help a user find clips of a particular person speaking; instead, when searching a name such as “Charles Schumer” it returns a mix of news stories about the congressman, as well as clips where he speaks at news conferences, on the Senate floor, or in other venues.

We are working towards a future in which AI enrichment of video metadata will more precisely identify for fact-checkers and researchers when a public official is actually speaking, or some other televised record of that official making an assertion of fact. This could include, for example, camera footage of tweets.

Such clips become a part of the historical record, with online links that don’t rot, a central part of the Internet Archive’s mission to preserve knowledge. And they can help fact-checkers decide where to concentrate their efforts, by finding on-the-record assertions of fact by public officials. Finally, these collections could prove useful for teachers, documentary makers, or anybody interested in exploring on-the-record statements by public officials.

For example, here are two dueling views of the minimum wage, brought to the public by McConnell and Schumer.

In this interview on Fox News in January 2014, McConnell says, “The minimum wage is mostly an entry-level wage for young people.” PolitiFact’s Steve Contorno rated this claim as “mostly true.” While government statistics do show that half of the people making the minimum wage are young, 20 percent are in their late 20s or early 30s and another 30 percent are 35 or older. Contorno also points out that it’s a stretch to call these jobs “entry-level,” but rather are “in the food or retail businesses or similar industries with little hope for career advancement.”

Schumer presents a different assertion on the minimum wage, saying on “Morning Joe” in May 2014 that with a rate of $10.10/hour “you get out of poverty.” PolitiFact’s Louis Jacobson rated this claim as “half true”: “Since the households helped by the $10.10 wage account for 46 percent of all impoverished households, Schumer is right slightly less than half the time.”

These new collections reflect the hard work of many at the Internet Archive, including Robin Chin, Katie Dahl, Tracey Jaquith, Roger MacDonald, Dan Schultz, and Nancy Watzman.

As we move forward, we would love to hear from you. Contact us with questions, ideas, and concerns at tvnews@archive.org. And to keep up-to-date with our experiments, sign up for our weekly TV News Archive newsletter.

 

TV News Record: McCain returns to vote, Spicer departs

A weekly round up on what’s happening and what we’re seeing at the TV News Archive by Katie Dahl and Nancy Watzman. Additional research by Robin Chin.

Last week, Sean Spicer left his White House post and Anthony Scaramucci, the new communications director, made his mark; Sen. John McCain, R., Ariz., returned to the Senate floor to debate–and cast a deciding vote on–health care reform; and fact-checkers examined claims about Trump’s off-the-record meeting with Russian President Vladimir Putin, and more.

McCain shows up in D.C. – and on Face-O-Matic

Last week, after we launched Face-O-Matic, an experimental Slack app that recognizes the faces of top public officials when they appear on TV news, we received a request from an Arizona-based journalism organization to track Sen. John McCain, R., Ariz.. Soon after we added the senator’s visage to Face-O-Matic, we started getting the alerts.

News anchors talked about how McCain’s possible absence because of his brain cancer diagnosis could affect upcoming debates and votes on health care.

Reporters gave background on how the Senate has dealt with absences due to illness in the past.

Pundits discussed McCain’s character, and his daughter provided a “loving portrait.” Then coverage shifted to report the senator’s return to Washington, and late last night his key no vote on the “skinny” health care repeal.



White House: Spicer out, Scaramucci in 

After Sean Spicer resigned as White House communications director, Fox News and MSNBC offered reviews of his time at the podium.

On Fox News, Howard Kurtz introduced Spicer as someone “long known to reporters as an affable spokesman; he became the president’s pit bull,” and went on to give a run-down of his controversial relationship with the press. The conclusion, “He lasted exactly, six months.”

MSNBC offered a mashup of some of Spicer’s most famous statements. These include: “This was the largest audience to ever witness an inauguration, period, both in person and around the globe,” and “But you had a – you know, someone who is as despicable as Hitler who didn’t even sink to using chemical weapons.”

Late this week, Ryan Lizza published an article in The New Yorker based on a phone call he received from the new White House communications director, Anthony Scaramucci, in which the new White House communications director used profanity to describe other members of the White House staff he accused of leaking information. That article soon became fodder for cable TV.



Schumer, Ryan weigh in on Mueller

As Special Counsel Robert Mueller widens his investigation into Russian interference in U.S. elections, speculation is running high on TV news that President Donald Trump might fire him.

Fox News ran a clip of Senate Minority Leader Chuck Schumer, D., NY., saying, “I think it would cause a cataclysm in Washington.”

MSNBC ran a radio clip from House Speaker Paul Ryan, R., Wis.:  “I don’t think many people are saying Bob Mueller is a person who is a biased partisan. We have an investigation in the House, an investigation in the Senate, and a special counsel which sort of depoliticizes this stuff and gets it out of the political theater.”



Fact-check: Transgender people in the military would lead to tremendous medical costs and disruption (lacks context)

In a series of tweets this week, President Trump wrote, “After consultation with my Generals and military experts, please be advised that the United States Government will not accept or allow… Transgender individuals to serve in any capacity in the U.S. Military. Our military must be focused on decisive and overwhelming… victory and cannot be burdened with the tremendous medical costs and disruption that transgender in the military would entail. Thank you.”

For FactCheck.org, Eugene Kiely reported, “Although Trump described the cost as ‘tremendous,’ RAND estimated that providing transition-related health care would increase the military’s health care costs for active-duty members ‘by between $2.4 million and $8.4 million annually.’ That represents an increase of no more than 0.13 percent of the $6.27 billion spent on the health of active-duty members in fiscal 2014.”



Fact Check: Nixon held meetings with heads of state without an American interpreter (true)

Speaking on “The Rachel Maddow Show,” Ian Bremmer, president of the Eurasia Group, said:  “Apparently, President Nixon used to do it because he felt, didn’t really trust the State Department, at that point, providing the translators and didn’t necessarily want information getting out, leaking, that he would want to keep private.”

“True,” wrote Joshua Gillan for PolitiFact: “Presidential historians, historical accounts and Nixon’s own memoir show this was the case. But it’s notable that even in the example most comparable to Trump’s meeting with Putin, when Nixon used only a Soviet translator during two meetings with Brezhnev, official records of the meeting exist.”



Fact-check: Allowing insurers to sell plans across state lines will mean premiums go down 60-70% (no evidence)

Not long before the Senate took up health care reform, President Donald Trump said “We’re putting it [allowing insurers to sell plans across state lines] in a popular bill, and that will come. And that will come, and your premiums will be down 60 and 70 percent.”

FactCheck.org’s Lori Robertson reported the “National Association of Insurance Commissioners — a support organization established by the country’s state insurance regulators — said the idea that cross-state sales would bring about lower premiums was a ‘myth.’”



Fact-Check: When the price for oil goes up, it goes up, and never goes down (false)

In an interview Sunday about the new Democratic Party national agenda, Senate Minority Leader Chuck Schumer, D., N.Y., said, “We have these huge companies buying up other big companies. It hurts workers and it hurts prices. The old Adam Smith idea of competition, it’s gone. So people hate it when their cable bills go up, their airline fees. They know that gas prices are sticky. You know … when the price for oil goes up on the markets, it goes right up, but it never goes down.”

For PolitiFact, Louis Jacobson reported, “This comment takes a well-known phenomenon and exaggerates it beyond recognition. While experts agree that prices tend to go up quickly after a market shock but usually come down more slowly once the shock is resolved, this phenomenon only occurs on a short-term basis – a couple of weeks in most cases.”

To receive the TV News Archive’s email newsletter, subscribe here.

TV News Record: adventures with Face-O-Matic

A weekly round up on what’s happening at the TV News Archive by Katie Dahl and Nancy Watzman

This week we bring you adventures with Face-O-Matic; fact-checks on President Donald Trump’s legislative record and on health care reform; and we follow the TV on use of the term “lies” and “lying.”

Here’s a face, there’s a face, everywhere a face…

Face-O-Matic, our new experimental Slack app that finds faces of political leaders on major national cable networks, has given us a whole new perspective on how imagery is used in news production. Sure, Face-O-Matic picks up clips of President Donald Trump, Senate Majority Leader Mitch McConnell, R., Ky., and others speaking, whether on the floor, at press conferences, or at a luncheon.

However, often these elected officials’ faces are used to illustrate a point a news anchor is making, or in footage without audio, sometimes as a floating head somewhere on the screen, or as part of a tweet. Face-O-Matic even picks up faces in a crowd.

Face-O-Matic can help find frequently re-aired clips.

Face-O-Matic can find even images that are only briefly displayed on the screen.

Face-O-Matic finds both still and video of Trump in a single clip.

Please take Face-O-Matic on a spin and share your feedback with us, tvnews@archive.org. This blog post explains how it fits into our overall plan to turn TV news into data. To install,  for now you’ll need to ask your Slack team administrator or owner to set it up. The administrator can click on the button below to get started. Visit Slack to learn how to set up or join a Slack team. Questions? Contact Dan Schultz, dan.schultz@archive.org.

Fact-check: Trump has signed more bills than any president ever (wrong)

News cameras captured Trump saying, “We’ve signed more bills — and I’m talking about through the legislature — than any president ever.” (A moment later, he commented that he doesn’t “like Pinocchios,” referring to The Washington Post’s Fact Checker rating system.)

Glenn Kessler, reporting for that same fact-checking site, explained why Trump is not, in fact, besting his predecessors in the White House when it comes to bill signing. But, he refrains from stating how many Pinocchios the president had earned: “Tempted as we are to give the president Pinocchios for his statement, he seemed to be speaking off the cuff and was operating on outdated information from his first 100 days. We don’t play gotcha here at The Fact Checker, and we appreciate that he added a caveat. He certainly appeared to pause for a moment and wonder if he was right. For Trump, that’s a step in the right direction…But he’s way off the mark and actually falling behind in legislative output.”



Fact-check: “bushel” of Pence claims on health care reform (range from “twists the facts” to “false”)

Also writing for The Washington Post’s Fact Checker, Michelle Ye Hee Lee checked a number of statements Vice President Mike Pence made about the Senate health care reform bill during an appearance at the National Governors Association.

These included, for example, the claim, “I know Governor Kasich isn’t with us, but I suspect that he’s very troubled to know that in Ohio alone, nearly 60,000 disabled citizens are stuck on waiting lists, leaving them without the care they need for months or even years.”

Lee wrote that this claim is false: “[T]here’s no evidence the wait lists are tied to Medicaid expansion. We previously gave four Pinocchios to a similar claim….The expansion and wait list populations are separate, and expansion doesn’t necessarily affect the wait list population….Whether people move off the wait list depends on many factors, such as how urgent their needs are, how long they’ll need services and whether the states have money to pay for them. Many times, a slot opens up only if someone receiving services moves out of the state or dies.”



Follow the TV

There’s been much controversy in news gathering circles about when, whether, and how to invoke the word “lie” when reporting on public officials. One of our archivists, Robin Chin, has noticed a number of prominent uses of the term by commentators in recent TV news coverage.

For example, here’s Shepard Smith on Fox News on July 14 saying, “Jared Kushner filled out his form. I think it’s an F-86 saying who he met with and what he had done… He went back and added 100 names and places. None of these people made it… Why is it lie after lie after lie? … My grandmother used to say when first we practice to — oh, what a tangled web we weave when first we practice to deceive. The deception, Chris, is mind boggling.”

And here’s Tom Brokaw on July 16 on NBC’s “Meet the Press,” saying: “Certainly there are atmospherics here that call to mind Watergate, the kind of denial of the obvious and the petty lying that is going on. But at the same time, Watergate, I like to think, was there by itself and this president is entangling himself in that kind of discussion they we’re having here today when it’s not in the interest of anyone, most of all this country, when we have so many issues before us. It’s got to get cleaned up.

On July 17, on “CNN: Tonight With Don Lemon,” here is David Gergen saying: “Other presidents succeed at this by just being straightforward about the facts. And it’s gone on for so long and so duplicitous and so much double speak that you begin to wonder, this is quite intentional. This may be quite intentional. You create a fog bank of lies and uncertainties and vagueness and create so many different details that people just sort of say, the hell with that, I don’t want to watch this… My sense is that a lot of Americans are starting to tune out…”

Search captions for terms you are interested in at the TV News Archive. For trends, try the Television Explorer, built by data scientist Kalev Leetaru, and powered by TV News Archive data, which can provide quick visualizations of terms broken down by network.

To receive the TV News Archive’s email newsletter, subscribe here.