From Fake News to Open Data: Studying the Histories of Digital Media Using the Wayback Machine

As scholars of digital media studies, Liliana Bounegru and Jonathan Gray say the Internet Archive preserves artifacts that are integral to their work.

Jonathan Gray and Liliana Bounegru

The two academics work at King’s College London in the Department of Digital Humanities—Bounegru is a lecturer in digital media and Gray is a senior lecturer in critical infrastructure studies. They are both interested in studying how media has changed with digital technology. The Internet Archive collection has been useful as they examine the history of the web, trends and evolution of websites and changes in technology, society and culture.

In one study of online myths and disinformation, the researchers used the Wayback Machine to understand how tracker signatures (snippets of code that embed ads and analytics on a website) of viral “fake news” sites changed over time. As websites were blacklisted from major ad networks, they looked up the archived versions of the websites to follow how their money-making practices via ads changed over time. This project was completed in collaboration with BuzzFeed news, which published an article about the findings and analytical techniques.

This investigation builds on work that Bounegru and Gray did with First Draft, a nonprofit that works with journalists to support investigations around misinformation. They analyzed the tracker signatures of mainstream news sites alongside those of junk news sites to understand their different monetization and audience economics practices.

As a result of their investigations, the researchers created A Field Guide to Fake News that explores how digital methods can be used to study false viral news, political memes, and trolling practices. “It became widely used by a network of hundreds of media organizations and fact checking groups as well as for training people doing investigative work on disinformation,” Gray said. Together with other collaborators at the Public Data Lab which they co-founded, Bounegru and Gray wrote a paper in New Media & Society about the threat of misleading junk news on social, economic and political life and the questions that it raises about social media and online content sharing platforms.

Gray has long been interested in the politics of open and public data and is writing a book on the subject. This involves tracing how open data policies and practices have developed around the world, and he said it’s been valuable to be able to search and analyze open data websites through the Wayback Machine. As part of research for the book he published an article in Data & Policy, from Cambridge University Press, about the rise of data portals as online devices for making data public.

 “In the case of data portals such as data.gov.uk we see a shift from more sociable and experimental design approaches aiming to surface questions, engage communities and support cultures of socially oriented invention to more muted, minimal expert facing infrastructures,”  said Gray. “It could be considered a certain kind of success for open data advocates that portals have become so established and institutionalized, but also suggests that maybe there’s less interest in being inclusive,accessible, responsive or thoughtful in reaching communities that may be less technically oriented or those who don’t already know what they are looking for or what kinds of data is likely to be found.”

In working with their students, both Bounegru and Gray share ways that the Internet Archive can be useful for research. Through hands-on research activities with the Wayback Machine they explore how it can show how web content, user interfaces and web categories change. It can even provide evidence of broader societal change, such as how political views have shifted over time. The Archive can reveal large-scale changes and allow researchers, journalists, students and community groups to gain a richer appreciation of digital media history.

Added Bounegru: “We use the Internet Archive a lot. It is an essential tool for our research.”

Slide on how the WayBack Machine is being used from Bounegru and Gray’s “web histories” class as part of digital methods course at King’s College London.

6 thoughts on “From Fake News to Open Data: Studying the Histories of Digital Media Using the Wayback Machine

  1. Anna Harrow

    This post highlights the instrumental role of the Internet Archive and its Wayback Machine in digital media studies. Scholars Liliana Bounegru and Jonathan Gray, from King’s College London, emphasize the Archive’s significance in preserving web artifacts, aiding research on web history, trends, and technology evolution. Their study on fake news and disinformation showcases the innovative use of the Wayback Machine to analyze tracker signatures and monetary shifts on blacklisted websites. The collaboration with BuzzFeed News and First Draft underscores the practical application of their research. Their creation of the “A Field Guide to Fake News” and contributions to media literacy are commendable. Sharing the Archive’s value with students and exploring web content evolution exemplifies their dedication to education. Lastly, Gray’s upcoming book on open data politics emphasizes the continued relevance and impact of open data initiatives.

  2. Andrew W

    The article contains a link, “A Field Guide to Fake News”, that goes to that publication’s web page. That page contains a Download button which results in a form if one wishes to be notified when a textbook being developed is finally available. The dialog says it isn’t necessary to fill out the form to download “A Field Guide to Fake News” but the field guide doesn’t download nor is there any other link to click to allow the download.

  3. Sam

    Finding disinformation by researching western official narratives over time, and comparing them with alternative (“junk”) narratives? Official narratives prevail over time? Alternative (“junk”) narratives disappear, over time? Isn’t that the modus operandi? Isn’t that the clipping work of the hidden censor, through YouTube, social media, etc.?
    Circular reasoning is the prerogative of scholarly research in the western brainy world, from archaeology, through biology, chemistry to… contemporary political news.
    It is called (contrived) confirmation bias, for the plebeian mind.

  4. Sheri

    Great article! It got me thinking, and please hear me out… who oversees the content within the archive and is it editable? Has it ever been edited, and if so why? Does the archive receive any funding, and I mean any funding – including grants, from the US Gov’t? If so, what are the terms of received money?

    I’ve used the archive for awhile. Back in the 90’s, I had a personal website about a rare cancer that ranked very high on Google (non-monetized). Now, I can’t find the website anywhere. It went poof. Gone. Same goes for some basic well known info I was looking for on Steve Jobs. Only reason why this particular piece of info would NOT be found, was because it needed to be completely disappeared. If that’s the case, then the archive has been altered. If the archive has been altered then it can’t be used for fact checking.

Comments are closed.