Author Archives: Wendy Hanamura

Name: Authors Alliance 10th Anniversary: Authorship in an Age of Monopoly and Moral Panics
Start: 2024-05-17T16:00:00-08:00
End: 2024-05-17T19:00:00-08:00
Location: Internet Archive

DWeb Camp: Exploring Governance & AI

Posted on August 22, 2023 by Wendy Hanamura

Written contributions by Val Elefante, Jenny Fan, Dazza Greenwood, Cent Hosten, Ronen Tamari, Joshua Tan, Riley Wong, and Jacky Zhao

The Metagovernance Project (aka- Metagov) returned to DWeb Camp for our second year in a row, this year as a DWeb Sponsor, supporting the event by curating some of the camp’s governance and AI sessions. In this blog, we hear from Josh Tan, co-curator of the AI track, and governance researchers from Metagov who helped co-create the governance track.

To get a sense of our work, watch this video documenting our Redwood Parliament program at DWeb Camp 2022.

AI Meets the Decentralized Web

What does the DWeb community talk about when they talk about AI? Perhaps more mysteriously, what brings an AI company like OpenAI out to the woods outside of San Francisco to talk about the decentralized web?

At this year’s DWeb Camp, Metagov worked with OpenAI, the Internet Archive, and the Foresight Institute to curate a selection of AI speakers and workshops at DWeb Camp. The programming featured presentations by Aza Raskin (Centre for Humane Technologies), Jason Kwon (OpenAI), Che Chang (OpenAI), Rosie Campbell (OpenAI), Doc Searls, Stephen Hood (Mozilla), Philip Rosedale (Second Life), and many, many others. The planning was led by Allison Duettman of Foresight and Joshua Tan of Metagov, with critical support from Wendy Hanamura of the Internet Archive.

One of the key questions raised was the challenge and risks of open-source AI. For example, in Aza Raskin’s picture of possible AI futures, open-source might also lead us to a future where everyone, everywhere has access to the intelligence needed to design viruses, imitate public figures, or manipulate elections. Yet, in a conversation on open-source AI models featuring Stephen Hood from Mozilla, James Baicoianu from Stability AI, Philip Rosedale, and Qianqian Ye, everyone agreed that “the cat is out of the bag” when it comes to open-source AI. Open-source AI is already here, and it’s not going away.

We didn’t necessarily come away with a conclusion so much as a better sense of the question. From Josh’s closing remarks: “I honestly wrestle with this. I honestly do not know, and it feels weird, it feels very weird to be a student of the legends who built the open internet and ask, should [AI] be open? It reminds me of a question we ask ourselves as a liberal society—is it possible to be too open as a society? Do open societies ultimately bring about their own downfalls?”

Governance at DWeb

Can We Trust Our Fellow “Digital Citizens”?

This session, led by Metagov contributor Jenny Fan, was a round table discussion around the provocation: can civic responsibilities for online “citizens” exist in an analogous way to how civic duties exist in offline communities? As one participant quoted, “The scarcest resource is legitimacy,” and appropriately, the conversation was framed in the context of the dearth of legitimate forms of community governance and content moderation for online communities. Though participants were not primarily governance researchers, we ended up with a comprehensive and thought-provoking survey of existing projects in this space.

We broke down the challenges of online “citizenship” around identity, reputation, intrinsic/extrinsic motivation, the issues of delegating trust to other users, and how the correlation between the level of effort affected online community engagement. Participants mentioned references as wide-ranging as existing political science research (liquid democracy, quadratic voting, radical markets), Web 2-adjacent projects (Periscope, Twitter community notes), Web 3-adjacent projects (Klairos, Nouns DAO’s zero knowledge voting, and one participant’s experience IRL at Zuzalu’s pop-up community), and more. In particular, users highlighted the challenges of shifting typically extrinsic motivators for civic behavior to intrinsic motivation, given the cost-incentive structure of the internet. As one participant put aptly, “The offline world is full of sticks, but the internet only has carrots.”

D20 Governance Playthrough

D20 Governance is a project focused on exploring modular governance through unstable communication environments and simulations. It aims to estrange the quotidian act of communication as a way of revealing ways in which interactions in online communities are infrastructurally prefigured by forms and norms of linguistic interoperability and implicit feudalism. D20 Governance aims to surface this revelation as a way of foregrounding the metagoverning architectures that order online communications, and catalyze experiences that empower communities to imagine and form more creative, flexible, experimental, and intentional patterns of self-governance. The current iteration of D20 Governance takes form as a Discord bot, and extends the composable governance mapping tool, CommunityRule. The D20 Governance working group is led by Janita Chalam, Val Elefante, Hazel, and Cent Hosten, and is supervised by Metagov research director Ellie Rennie.

For DWeb Camp we ran our first playtest with a group of eight campers placed into a “Build A Community” simulation where they had to name their community, decide on an animating purpose, and decide on their first action. The playtest had participants eloquently reciting Shakespearean recitations of their LLM-transformed posts and revolting against consensus as a decision-making mechanism. Stay tuned for future play test announcements in the newsletter.

Let Us Imagine A Communally-Owned Internet

This year at DWeb Camp, Jacky Zhao and Spencer Chang hosted a session asking campers to gather their collective imaginations and dreams for what a communally-owned internet could look like. Collectively, the group had a lot of dystopian fiction and a lot of reminiscing, but not a lot of forward-looking dreams for the web. Dreaming, to us, felt like an important piece of fiction that rallies people to articulate a vision they want to make a reality. In hosting this session, we recalled Ruha Benjamin: “to see things as they really are, you must imagine them for what they might be”.
The session focused on circulating 5 sheets of paper, each with a question on it:

What do you wish the Internet evoked for you?
What would co-owning digital spaces look like?
What is your digital neighborhood?
Where have you felt agency online?
What is/was your favorite place on the internet?

Each question was meant to evoke certain modes of questioning. In the discussion, the group spent a significant amount of time discussing the feeling that life on the internet feels like living on rented ground and an overwhelming feeling that we have no agency over our digital environments anymore. Some reminisced over Minecraft and building their own forums and webrings. Others wondered why modern platforms like Facebook or Twitter no longer have these affordances. The group closed by wondering how to give people the ability to be architects of their own digital homes again.

Reclaiming agency and ability to communally construct our digital spaces starts with people willing to dream and fight for it. In many ways, this session (and the greater DWeb Camp as a whole) felt like a gathering of people who haven’t given up on the inherent good of the internet and are fighting for this future.

(excerpted from a longer reflection)

Design Charrette on LLM LLC Governance Rules

The session, led by law.MIT.edu’s Dazza Greenwood, focused on an ongoing open-source project developing an algorithmically managed LLC using LLM technology. This is similar to the Wyoming DAO LLC approach insomuch as there is a role for “algorithmically managed” LLCs, but there is no smart contract, blockchain, or decentralization involved. Rather, the algorithmic manager is an LLM operating according to “constitutional rules” encoded into the software running the manager operations and communications. The current codebase is designed as a Discord bot with email integration and is being tested and iterated against a handful of relatively legal and business use cases. The Metagov-related aspect of this project is the architectural component where a set of rules governing the behavior and actions of the LLM LLC are specified. A DWeb Camp breakout group discussed the project overall and read aloud the current version of the Constitutional Rules, as the starting point for an engaging and constructive conversation and light design charrette. For more information, see the current code base, and this demo presentation of the project given to the Wyoming legislature.

Challenges and Triumphs in Community Self-Governance

This session, led by Metagov researcher Val Elefante, began with an overview of Metagov’s frameworks and tools including implicit feudalism, modular politics, CommunityRule, and a demo of CollectiveVoice. It was then followed by a rapid-fire collective brainstorm of challenges that communities face when it comes to online governance. Responses included: scale (from small to larger communities, from “not serious” to “serious” decisions), too many proposals, loss of institutional memory due to platform switching, not easy to experiment, and not many available models.

The group then brainstormed ways of solving some of these governance problems using modular governance frameworks including: a randomly-selected jury system for voting on proposals, organization and summary of relevant information for easy decision-making, improved deliberation formats, and using tech to facilitate in-person governance.

Qualitative Governance

How can community governance frameworks incorporate holistic, cooperative, and emergent processes? How can community governance embrace differing needs and wants, encourage agency, and promote whole group purpose and wellness?

Facilitated by cooperative governance researcher Riley Wong, this Qualitative Governance session sought to co-create possibilities to these questions and more by naming and observing qualities of effective governance; describing the emotional experience of how effective governance feels; identifying and speculating practices that create these experiences; and ideating ways to integrate and experiment with these practices within our own communities.

For some, effective governance was described as transparent, creative, honest, flowing, participatory, inclusive, resilient, signal boosting, and fun. It can feel energizing, activating, safe, emergent, warm, playful, joyful, caring, compassionate, holonic, empowering, euphoric, and open-hearted. Practices that can create this experience may include shared rituals, reflection, personal check-ins, “yes, and…”s, trust and relationship building, acknowledging consent, shared maintenance, space for tension processing, voluntary flows, ownership, mini-juries for direct democracy, and dancing. Integration of these practices may involve playing, prioritizing, ceding power, building trust, and celebrating stories.

Feeling safe as a necessary foundation for navigating differences, feeling seen and heard by others, building trust in the community, and keeping “epistemological humility” were also overarching themes and discussion points throughout the session. Follow-up discussions highlighted personal experiences of governance where community members felt valued, heard, safe, and trusting, and therefore empowered to take on more risk and responsibility.

Tech for Listening to Each Other Online

In this session, led by Metagov member Ronen Tamari, participants reflected on the dynamics of (figuratively) “speaking” vs “listening” in online spaces such as social media. We have lots of tools for speaking, enabling us to effortlessly broadcast our opinions to wide audiences. On the other hand, listening feels under-served: we lack tools to help sift through noise and distractions on social media and end up doom-scrolling or wandering aimlessly across platforms.

What would better tech for listening look like?

We did some embodied listening exercises to get a better sense for what listening in the real world involves. We then tried to apply the insights we gained to listening in the social media context; what does empathetic and active listening feel like online, and how can we create a shared sense of reality beyond reality-distorting algorithmic echo chambers?

Brainstorming together was a delight (”One of my fav events from the whole weekend”, as one participant wrote us); we covered a lot of topics (and whiteboards), from AI and human-powered curation to the design of new tools, norms, and rituals. We shared contact details to keep the listening conversation going as we left the luxury of intimate shared physical spaces behind and headed back to our noisy digital metropolises.

–

Bring Your Family to DWeb Camp

Posted on May 5, 2023 by Wendy Hanamura

If you’ve ever been to a typical tech event, full of neon-lit booths and cavernous main stage talks, you know it’s just about the last place you’d want to bring your kids.

So why would we make children such an integral part of DWeb Camp? It harks back to some of our core principles:

At DWeb Camp we don’t think you should have to choose between spending time with your family and growing professionally. We realize that building a better online world takes many kinds of people, imaginations, and skills.

So at DWeb Camp, it makes sense to us to have children at the front and center. They are the reason we are building a better web in the first place.

So please bring your entire family to DWeb Camp 2023. We’ve created some great family packages to make that more affordable. At the center of our Family program is its curator, Andi Wong. She’s an arts educator, an ocean advocate, a storyteller, and a historian. Andi weaves all these elements together to create a magical experience for the children and families at DWeb Camp.

“I like to get to know the kids and know what THEY are interested in,” explains Andi. “The goal is to get to know each other well enough so they can form a community. We try to introduce them to things in nature so they’ll understand there are all these invisible forces and flows they may not have thought about.”

What can your family expect at DWeb Camp?

Educator-led program just for kids
Indigenous storytellers exploring creation myths
Juggling lessons with flow artists
Daily lessons in animal kung fu from a wonderful Sifu
Exploring soundscapes–rain, ocean, river, forest –with the Del Sol Quartet
Open Play with clay, cardboard, string, paint — materials you can recreate at home
Archery, rock climbing, hiking, and a swimming hole
Scavenger hunts to help understand decentralized technologies
Evening talent show, game night, campfires with s’mores
Stargazing with an astronomer and a concert under the night sky
Sunset movies where you can drop off the kids
Giant puppet-making & a puppet parade at the end of Camp

Andi says there’s room for everyone from “babes in arms” to tweens and teens, plus their parents, who also form a close cohort. She weaves a rich curriculum drawn from the skills of the campers themselves. So artists, dancers, storytellers, coders: what do you have to share with our youngest campers?

At Camp Navarro, accommodations range from private cabins to glamping tents with comfy mattresses and linens for 3-4 people. Or you can bring your own tent or RV. The showers are hot and the flush toilets are clean. So make DWeb Camp a family affair this year, and discover the flows of nature, technology, community, and your own perfect family flow.

GITCOIN Grants: Donate a Few Tokens, Defend a Public Treasure

Posted on June 8, 2022 by Wendy Hanamura

CALLING ALL COMMUNITY MEMBERS:

In just a few months, the lawsuit Hachette v. Internet Archive will be heard in court. In 2020, four of the world’s largest publishers sued our non-profit library to stop us from digitizing books and lending them for free to the public. The publishers and the corporations who own them, including News Corp and Bertelsmann, are demanding $20 million in damages and that we destroy 1.4 million digitized books. What’s really at stake? The right of all libraries to own, digitize and lend books of any kind. (Here’s what Harvard’s copyright advisor has to say about the consequences of our case.) Starting today, make a small donation through Gitcoin and have an enormous impact for the defense of Internet Archive, through Gitcoin’s quadratic funding.

Today, Gitcoin Grant Round 14 opens, supporting advocacy groups around the world. When you donate even $1 worth of crypto to the Internet Archive, it can result in $3-400+ from the matching pool. Quadratic funding rewards the number of community members who give, along with the amount. So many small donations can really have an enormous impact.

This is an example of the matching funds allotted in a previous Gitcoin Grant round.

HOW TO DONATE:

First you’ll need to create or log in your Github account.
Use that account to authorize in to Gitcoin.
Choose one or both of our gitcoin grants here:

Internet Archive
DWeb Camp Fellowships

You’ll need a crypto wallet like Metamask or Rainbow Wallet with some Ethereum or other tokens.
Select how much you want to donate. (For example: .003 ETH = about $5.00 US)
Do you want to also add some money to the matching pool? Be sure to set an amount in that field as well.
Hit the “I’m Ready to Checkout” button.
In the drop down menu, pick Standard Checkout, Polygon, or zkSync.
Connect and log in to your crypto wallet to pay.
BONUS: You can verify your identity by creating a Gitcoin Passport via Ceramic to maximize the matching funds (up to 150%).
The more people who give, the greater the percentage of the matching pool we receive.

Checkout module for the Gitcoin Grant 14 Advocacy Round.

Thank you for taking these steps to unleash huge support for the Internet Archive, helping us pay the millions of dollars in legal fees we have already incurred. Your support helps ensure the Wayback Machine, Open Library, and all our games, concerts, books and films will be available to you for free for a very long time.

Filecoin Foundation Grants 50,000 FIL to the Internet Archive

Posted on April 1, 2021 by Wendy Hanamura

Amidst the speculative boom for NFTs and crypto-currencies, one decentralized technology foundation is taking the long view by investing in deep history and the far future.

Today, the Filecoin Foundation announced a 50,000 FIL grant to the Internet Archive – the largest single donation in the digital library’s 25-year history.

“Holy Crow! This is a big deal,” said Brewster Kahle, the Internet Archive’s founder. “And what are we going to do with it? We’re going to invest it in making the Internet Archive more decentralized, so that our digital history is available from thousands of computers, not just a few. The idea is to make a robust and private Internet that has a history that will persist over decades and maybe centuries.”

Filecoin is a decentralized storage system designed to preserve humanity’s most important information. The creators of Filecoin envisioned an independent foundation that would serve as the long-term governance body for the Filecoin ecosystem. In awarding the grant to the Internet Archive, Filecoin Foundation board chair, Marta Belcher, stressed the two organizations’ “common goal of preserving the web and fostering its future.”

It was back in 2015 that Protocol Labs‘ founder, Juan Benet, first visited the Internet Archive, to share his vision for an academic conference dedicated to preserving “humanity’s greatest treasures using decentralized storage.” Building on these conversations, the Internet Archive organized the Decentralized Web Summit in 2016 in San Francisco, the first gathering of its kind. Back then, a decentralized web was mostly a concept, with little working code.

Decentralized technologists, Trent McConaghy of Ocean and Juan Benet of Protocol Labs at the 2016 Decentralized Web Summit at the Internet Archive in San Francisco.

Since 2016, the Internet Archive has worked with several decentralized tech startups to create a decentralized prototype of the digital library. And when the Filecoin main net took off in 2020, stored in Filecoin servers were public domain audiobooks and films from the Internet Archive. Together, the two organizations created the Filecoin Archives, a community-led project to curate, disseminate and preserve important open access to information often at risk of being lost.

“It’s wonderful to see Filecoin come of age. We started six years ago by putting out a call to make a Decentralized Web, a web that would serve us better than the current web–one that is now starting to be dominated by just a few tech behemoths. Can we make a game with many winners?” asked Kahle. “Filecoin has made a huge step forward by deploying decentralized storage at the exabyte level. That’s very different from AWS (Amazon Web Services). It has many participants, not just one player. And its protocols are open-source. We want to see more technologies like this. This was the original vision of the Decentralized Web that the Internet Archive was hoping for five, six years ago. And it’s starting to come to fruition and Filecoin is a leader in that area.”

Although purveyors of cryptocurrencies are often accused of being driven only by short-term gain, in this group Kahle sees a different motivation. “This donation by the Filecoin Foundation is significant financially for the Internet Archive, but I’d say it’s a more interesting one than that,” said the Internet Hall of Fame engineer. “It’s a donation by a new generation of technologists that are building interesting new technologies…bringing the Archive along with it to make it so that history is preserved –that the Internet Archive makes it into this next generation. That is an interesting thing! You don’t often see that. But the Filecoin Foundation, Filecoin and IPFS, and Juan Benet himself have always been interested in preserving history and how history can be woven into the present and the future of these technologies.”

Meet Eliza Zhang, Book Scanner and Viral Video Star

Posted on February 9, 2021 by Wendy Hanamura

The glass rises and falls. Quickly and efficiently, a woman turns the pages to the rhythmic beep of the cameras. She never misses a beat.

In its first 48 hours, this tweet about book scanning at the Internet Archive went viral, reaching 7.7 million people. More than 1.5 million people viewed the video, liking it 70,000 times and retweeting it 24,000 more. At the center of it all sits Eliza Zhang, a book scanner at the Internet Archive’s headquarters in San Francisco since 2010. When I asked Eliza what she likes about her job, she replied, “Everything! I find everything interesting. I don’t feel it is boring. Every collection is important to me.”

Eliza, a college graduate from southern China, immigrated to the United States in 2009, seeking a new life and new opportunities. She landed in San Francisco during the midst of an economy-crushing recession. But through a city program called JobsNOW, the Internet Archive hired Eliza and scores of other job seekers, training them to digitize, quality control, and upload metadata for books, newspapers, periodicals and manuals. Often our digitizing staff are making these analog texts available online for the first time.

Eliza Zhang in front of the Scribe (featured in the viral video) that she has operated for more than a decade.

Raising the glass with a foot pedal, adjusting the two cameras, and shooting the page images are just the beginning of Eliza’s work. Some books, like the Bureau of Land Management publication featured in the video, have myriad fold-outs. Eliza must insert a slip of paper to remind her to go back and shoot each fold-out page, while at the same time inputting the page numbers into the item record. The job requires keen concentration.

If this experienced digitizer accidentally skips a page, or if an image is blurry, the publishing software created by our engineers will send her a message to return to the Scribe and scan it again.

Brittle, delicate fold-outs, like this page from “Early London theatres” (1894), make digitization a time-intensive task best handled by a human operator.

Listening to 70s and 80s R & B while she works, Eliza spends a little time each day reading the dozens of books she handles. The most challenging part of her job? “Working with very old, fragile books. The paper is very thin. I always wear rubber fingertips and sometimes gloves when I scan newspapers, because of the ink,” she explained.

Tweets Spark a New Interest in Digitization

Eliza is one of about 70 Scribe operators at the Internet Archive, working in digitization centers embedded in libraries across the United States, United Kingdom, and Canada. The operations are led by Elizabeth MacLeod, who manages our remote operations, and Andrea Mills, who is stationed at the University of Toronto, with support from managers and operators in each center.

“We try to meet libraries where they are,” said MacLeod, who manages remote operations from her home office in North Carolina. “From digitizing a few shipments a year at one of our regional centers to setting up and staffing full-service digitization within the library itself, we have a flexible approach to our library partnerships.”

Across Twitter, another common question arose: “Why hasn’t this job been automated?” To many, the repetitive act of turning the pages in a book and photographing them seems like the natural task for a robot. In fact, some 20 years ago, we tested commercial book scanners that feature a vacuum-powered page-turning arm. It turns out those automated scanners didn’t really work well for brittle books, rare volumes, and other special collections—the kinds of material our library partners ask us to digitize.

Scribe operators and staff at Internet Archive’s former digitization center in San Francisco, ca. 2011.

“Clean, dry human hands are the best way to turn pages,” said Mills, from her socially-distanced office at the University of Toronto. In her 15 years on the job, she has worked with hundreds of librarians to hone our digitization operations, balancing our need to preserve the original pages with minimal impact during the imaging process. “Our goal is to handle the book once and to care for the original as we work with it,” Mills explained.

So what does it take to be a Scribe operator? “It takes a level of zen,” wrote Brewster Kahle, founder and digital librarian of the Internet Archive, responding to one of the many threads about the video that popped up on Reddit. “It takes concentration and a love of books. For those who love working with books and libraries, it fits well.”

As for the hardware used for digitization, like much at the Internet Archive, the equipment is engineered and purpose-built for the job. In the viral video, Eliza is operating the original Scribe machine, designed more than 15 years ago, and Scribe software that was developed in-house and refined continuously over years of operation. “The variation in books makes [automation] difficult to do quickly and without damage,” Kahle elaborates. “We do not disbind the books, which also makes automation more difficult.”

18,000 Books and Climbing

In the decade Eliza has been working with the Internet Archive, she has scanned more than 3 million pages, 14,000 foldouts, and 18,000 items (mostly books).

And what about all the sudden social media attention? Eliza shrugs. She’s never been on Twitter before. “My goal is to guarantee zero errors,” she said. “I want to give our readers a satisfying experience.”

Digitize With Us

The Covid-19 pandemic has both created higher demand for digital content as well as shuttered some of our scanning centers for health and safety. We have reopened following local and national health guidelines and continue to engage with new libraries on their digitization projects.

If your library is interested in learning more about the Internet Archive’s digitization services, visit https://archive.org/scanning, and contact us at digitallibraries@archive.org.

What if you could wander the library stacks…online?

Posted on December 21, 2020 by Wendy Hanamura

Open Library Explorer is an experimental new interface that allows patrons to search our shelves of 4+ million books.

Introducing the new Open Library Explorer

As a student at the University of Waterloo, whenever Drini Cami felt stressed, he’d head to the library. Wandering through the stacks, flipping through 600-page volumes about quantum mechanics or the properties of prime numbers never failed to calm him down. And the best thing? “I would always leave the library having discovered something new—usually a variety of new things,” Cami explained. “This is something I haven’t been able to replicate at a digital library like Open Library.” What Drini longed for was the ability to discover new books serendipitously, browsing bookshelves organized by a century of librarians. But unlike most readers, Drini Cami wields a superpower: he is a designer and software developer at the Internet Archive.

Enter the Open Library Explorer, Cami’s new experiment for browsing more than 4 million books in the Internet Archive’s Open Library. Still in beta, Open Library Explorer is able to harness the Dewey Decimal or Library of Congress classification systems to recreate virtually the experience of browsing the bookshelves at a physical library. Open Library Explorer enables readers to scan bookshelves left to right by subject, up and down for subclassifications. Switch a filter and suddenly the bookshelves are full of juvenile books. Type in “subject: biography” and you see nothing but biographies arranged by subject matter.

Why recreate a physical library experience in your browser?

Now that classrooms and libraries are once again shuttered, families are turning online for their educational and entertainment needs. With demand for digital books at an all-time high, the Open Library team was inspired to give readers something closer to what they enjoy in the physical world. Something that puts the power of discovery back into the hands of patrons.

Escaping the Algorithmic Bubble

One problem with online platforms is the way they guide you to new content. For music, movies, or books, Spotify, Netflix and Amazon use complicated recommendation algorithms to suggest what you should encounter next. But those algorithms are driven by the media you have already consumed. They put you into a “filter bubble” where you only see books similar to those you’ve already read. Cami and his team devised the Open Library Explorer as an alternative to recommendation engines. With the Open Library Explorer, you are free to dive deeper and deeper into the stacks. Where you go is driven by you, not by an algorithm..

Zoom out to get an ever expanding view of your library

Change the setting to make your books 3D, so you can see just how thick each volume is.

Cool New Features

By clicking on the Settings gear, you can customize the look and feel of your shelves. Hit the 3D options and you can pick out the 600-page books immediately, just by the thickness of the spine. When a title catches your eye, click on the book to see whether Open Library has an edition you can preview or borrow. For more than 4 million books, borrowing a copy in your browser is just a few clicks away.

Ready to enter the library? Click here, and be sure to share feedback so the Open Library team can make it even better.

After Searching for a Decade, Legendary Hollywood Research Library Finds a New Home

Posted on December 15, 2020 by Wendy Hanamura

Over more than 50 years, Lillian Michelson built one of Hollywood’s most famous libraries for film research.

[Press: Hollywood Reporter]

Need to know what an Igloo really looks like? How about a Siberian hut? Or the inside of a 15th Century jail? For 50 years in Hollywood, generations of filmmakers would beat a path to the Michelson Cinema Research Library, where renowned film researcher Lillian Michelson could hunt down the answer to just about any question. She was the human card catalogue to a library of more than one million books, photos, periodicals and clippings. But ever since Lillian retired a decade ago, the Michelson Cinema Research Library has been languishing in cold storage, looking for a home. Today it has found one. Lillian Michelson, 92, announced that she is donating her library and life’s work to the Internet Archive. For its part, the nonprofit digital library vows to preserve her collection for the long-term and digitize as much of it as possible, making it accessible to the world.

“I feel as if a fantasy I never, never entertained has been handed to me by the universe, by fate,” mused the legendary film researcher.“The Internet Archive saved my library in the best way possible. I hope millions of people will use it [to research] space, architecture, costumes, towns, cities, administration, foreign countries… the crime business! Westerns! That’s what is amazing to me, that it will be open to everybody.”

Internet Archive founder, Brewster Kahle, explained why his organization was willing to accept the entire Michelson collection and keep it intact: “A library is more than a collection of books. It is the center of a community. For decades, the Michelson Cinema Research Library informed Hollywood—and we want to see that continue. Many organizations wanted pieces of the collection, but I think the importance of keeping it together is so it can continue to help inspire global filmmakers to make accurate and compelling movies.”

Samuel Goldwyn Studios, circa 1938, where the Michelson Cinema Research Library was housed for many decades.

With $20,000 borrowed against her husband Harold’s life insurance policy, Lillian Michelson purchased the reference library in 1969. Over the next half-century, the Michelson Cinema Research Library had many homes. From the Samuel Goldwyn Studios it moved to the American Film Institute, then to Paramount Studios, and finally to Zoetrope Studios at the invitation of director, Francis Ford Coppola. Michelson later received an offer via Jeffrey Katzenberg to move the Michelson Cinema Research Library to the newly opened DreamWorks Pictures, where it remained until Lillian’s retirement due to health reasons 19 years later.

The Michelson Cinema Research Library includes some 5,000+ books dating back to the early 1800s; periodicals, 30,000+ photographs, and 3,000+ clipping files. In storage they filled some 1600 boxes on 45 pallets—enough to fill more than two 18-wheel tractor trailers. Its contents have now been moved for long-term preservation to the Internet Archive’s physical archive in Richmond, California.

In September 2020, Internet Archive Founder & Digital Librarian, Brewster Kahle, was on hand at the Internet Archive’s Physical Archive in Richmond, CA to accept the 1600 boxes of books, photos, clippings, and memorabilia from the Michelson Cinema Research Library. Michelson’s books were then shipped to one of the Internet Archive’s scanning centers to be digitized and ultimately made accessible to the public.

For six decades, Michelson’s research informed scores of Hollywood films, including The Right Stuff, Rosemary’s Baby, Scarface, Fiddler on the Roof, Full Metal Jacket, The Graduate and The Birds.

Harold & Lillian Michelson fueled the creativity of scores of directors, from Alfred Hitchcock to Mel Brooks, and their influence can be traced through countless Hollywood films.

Bringing this historic Hollywood design resource back to life—a largely digital life—can make it a global design resource for art directors, designers, filmmakers and researchers in search of information and visual inspiration.

“Lillian Michelson opened my eyes to the importance of a research library to all aspects of motion picture production. At a time when the rich and deep research libraries created and maintained by the motion picture studios were being ‘given away’ or otherwise destroyed, Lillian was a beacon of light guiding us to consider them as treasure.”
—Academy Award-winning director, Francis Ford Coppola

“*Harold & Lillian: A Hollywood Love* Story” by director Daniel Raims chronicles the couple who became Hollywood’s “secret weapons,” empowering generations of filmmakers and designers to create their most iconic work.

The story of her long and creative union with renowned storyboard artist Harold Michelson was told in Harold and Lillian: A Hollywood Love Story, a 2015 documentary produced and directed by Daniel Raim and currently streaming on Netflix. (To honor this devoted Hollywood couple, the DreamWorks Pictures named the king and queen in Shrek 2 Harold and Lillian.)

Lillian Michelson will preside over a virtual ribbon cutting, panel discussion, and a screening of the documentary on Wednesday, January 27 from 4-6:30 PM Pacific time. There, she will unveil the first phase of her new digital library, available to the world via the Internet Archive’s digital platform, at https://archive.org/details/michelson. Sign up for the screening event here.

What Information Should we be Preserving in Filecoin?

Posted on October 22, 2020 by Wendy Hanamura

The folks at Protocol Labs love their rockets. And outerspace. And exploration.

So when Filecoin, their cryptocurrency-fueled decentralized storage network launched recently, it was no surprise they called it Filecoin Liftoff. In the payload of that Filecoin rocket are treasures from the Internet Archive:

14,613 Free Audiobooks from LibriVox (20 terabytes worth)
17,000+ Films from the Prelinger Archive (17 terabytes worth)

For 15 years, LibriVox has harnessed a global army of volunteers, creating 14,200 free public domain audiobook projects in 100 different languages. Where else can you listen to Jules Verne’s 20,000 Leagues Under the Sea in French, Spanish, English, German or Dutch…for free? Now, phrases of Shakespeare, Poe, Joyce and Dante will be stored across the Filecoin mainnet, broken into packets to be reconstituted when needed—perhaps in a new century.

The same destiny awaits the home movies, stock footage, educational and amateur films in the public domain, lovingly curated by the Prelinger Archives founder, Rick Prelinger. He encourages creatives to download and reuse these videos, creating countless new works like this one by musician Jordan Paul:

Now filmmakers and connoisseurs can sleep easier, knowing that a new, distributed copy of those films lives in the Filecoin network, (along with the main copy and multiple backups in the Internet Archive’s repositories.)

So what’s next Filecoin explorers?

Today, Protocol Labs and the Internet Archive are happy to announce the Filecoin Archives, a new community project to curate, disseminate and preserve important open access information often at risk of being lost. You can get involved in so many ways: by nominating information to be stored, uploading it to the Internet Archive, preserving the data as a Filecoin node while earning Filecoin for sharing your storage capacity.

What information should we be preserving? Please tell us!

How about 166,000 public domain books (60 terabytes) from the Library of Congress? Including 2100 texts about Abraham Lincoln and slavery?

Or Open Access Journal articles? (The Internet Archive has collected 9.1 million of them.)

It takes a host of global voices with diverse viewpoints to ensure that humanity’s most precious knowledge is represented online and preserved. So we need to hear from you. What open access information or datasets are you interested in preserving?

Between now and November 5, please send us your ideas and vote on the others. We will gather your suggestions, add our own, and publish the list from which we will select information to preserve across a global network of Filecoin nodes.

How to send us your suggestions

Look for the tweet from @JuanBenet– reply to it with:

The Name of the Dataset.
The size in GB or TB.
An HTTP or @IPFS link to the data.
Why it matters.
#FilecoinArchives

Bonus points if the data is already stored in the Internet Archive or if you upload it there. Vote for ideas by retweeting them and please help us spread the word!

Juan Benet presents his early vision at the 2016 Decentralized Web Summit at the Internet Archive in San Francisco.

In 2015, a young developer named Juan Benet wandered into the Internet Archive headquarters. He painted a picture of a decentralized stack, something he now calls Web3, where the storage, transport and other layers would be distributed across many machines. Together with the DWeb community, we have imagined a web with our values written into the code: values such as privacy, security, reliability, and control over one’s own identity. With the launch of Filecoin’s mainnet, a piece of that new web is perhaps within reach.

Now it’s up to us to make sure the payload includes humanity’s most important knowledge.

Advertising powers the Web. What if it just doesn’t work?

Posted on October 19, 2020 by Wendy Hanamura

On October 14, the Internet Archive presented a book talk with author Tim Hwang, NYT Tech Reporter, Kashmir Hill, and technologist, Desigan Chinniah, discussing Hwang’s new book is “Subprime Attention Crisis.”

Is the Ad-Tech model powering the Internet really just the next financial bubble?

That is the question at the heart of a significant new book by Internet researcher Tim Hwang: Subprime Attention Crisis: Advertising and the Time Bomb at the Heart of the Internet. If you don’t already know Tim, he’s is a polymath: former Google AI policy wonk, lawyer, polemicist. In other words, just the kind of thinker we think you should know. Watch the video of a virtual book event with Tim here:

Subprime Attention Crisis makes the case that the core advertising model driving Google, Facebook, and many of the most powerful companies on the internet is—at its heart—a multibillion dollar financial bubble. Drawing parallels to the 2008 subprime mortgage crisis, Tim shines a spotlight on the lack of transparency, flawed incentives, and outright fraud that keep this machine running.

On October 14, the Internet Archive hosted a talk with the author and New York Times technology reporter Kashmir Hill. Their discussion tackled:

Why data-driven, online advertising may be much, much less effective than it looks
The long-term impact of the COVID-19 recession on the media and online ads
Whether or not the giants of Big Tech are already “too big to fail”

This discussion focused not only on the problems of advertising, but also on the future, and how we might be able to transition to a better, more financially robust internet. Joining the discussion was Desigan Chinniah, who co-leads Grant for the Web—a $100 million fund launched by Coil, Mozilla, and Creative Commons to spur open standards and new economic models for the web beyond advertising.

NOTE: We urge you to purchase a copy of Tim’s new book, Subprime Attention Crisis, via our local bookseller, The Booksmith. The first 50 purchasers will receive an autographed copy.

On Bookstores, Libraries & Archives in the Digital Age

Posted on October 7, 2020 by Wendy Hanamura

The following was a guest post by Brewster Kahle in Against The Grain (ATG). See the original article from September 28, 2020 on the ATG website here.

Back in 2006, I was honored to give a keynote at the meeting of the Society of American Archivists, when the president of the Society presented me with a framed blown-up letter “S.” This was an inside joke about the Internet Archive being named in the singular, Archive, rather than the plural Archives. Of course, he was right, as I should have known all along. The Internet Archive had long since grown out of being an “archive of the Internet”—a singular collection, say of web pages—to being “archives on the Internet,” plural. My evolving understanding of these different names might help focus a discussion that has become blurry in our digital times: the difference between the roles of publishers, bookstores, libraries, archives, and museums. These organizations and institutions have evolved with different success criteria, not just because of the shifting physical manifestation of knowledge over time, but because of the different roles each group plays in a functioning society. For the moment, let’s take the concepts of Library and Archive.

The traditional definition of a library is that it is made up of published materials, while an archive is made up of unpublished materials. Archives play an important function that must be maintained—we give frightfully little attention to collections of unpublished works in the digital age. Think of all the drafts of books that have disappeared once we started to write with word processors and kept the files on fragile computer floppies and disks. Think of all the videotapes of lectures that are thrown out or were never recorded in the first place.

Bookstores: The Thrill of the Hunt

Let’s try another approach to understanding distinctions between bookstores, libraries and archives. When I was in my 20’s living in Boston—before Amazon.com and before the World Wide Web (but during the early Internet)—new and used bookstores were everywhere. I thought of them as catering to the specialized interests of their customers: small, selective, and only offering books that might sell and be taken away, with enough profit margin to keep the store in business. I loved them. I especially liked the used bookstore owners—they could peer into my soul (and into my wallet!) to find the right book for me. The most enjoyable aspect of the bookstore was the hunt—I arrived with a tiny sheet of paper in my wallet with a list of the books I wanted, would bring it out and ask the used bookstore owners if I might go home with a bargain. I rarely had the money to buy new books for myself, but I would give new books as gifts. While I knew it was okay to stay for awhile in the bookstore just reading, I always knew the game.

Libraries: Offering Conversations not Answers

The libraries that I used in Boston—MIT Libraries, Harvard Libraries, the Boston Public Library—were very different. I knew of the private Boston Athenæum but I was not a member, so I could not enter. Libraries for me seemed infinite, but still tailored to individual interests. They had what was needed for you to explore and if they did not have it, the reference librarian would proudly proclaim: “We can get it for you!” I loved interlibrary loans—not so much in practice, because it was slow, but because they gave you a glimpse of a network of institutions sharing what they treasured with anyone curious enough to want to know more. It was a dream straight out of Borges’ imagination (if you have not read Borges’ short stories, they are not to be missed, and they are short. I recommend you write them on the little slip of paper you keep in your wallet.) I couldn’t afford to own many of the books I wanted, so it turned off that acquisitive impulse in me. But the libraries allowed me to read anything, old and new. I found I consumed library books very differently. I rarely even brought a book from the shelf to a table; I would stand, browse, read, learn and search in the aisles. Dipping in here and there. The card catalog got me to the right section and from there I learned as I explored.

Libraries were there to spark my own ideas. The library did not set out to tell a story as a museum would. It was for me to find stories, to create connections, have my own ideas by putting things together. I would come to the library with a question and end up with ideas. Rarely were these facts or statistics—but rather new points of view. Old books, historical newspapers, even the collection of reference books all illustrated points of view that were important to the times and subject matter. I was able to learn from others who may have been far away or long deceased. Libraries presented me with a conversation, not an answer. Good libraries cause conversations in your head with many writers. These writers, those librarians, challenged me to be different, to be better.

Staying for hours in a library was not an annoyance for the librarians—it was the point. Yes, you could check books out of the library, and I would, but mostly I did my work in the library—a few pages here, a few pages there—a stack of books in a carrel with index cards tucked into them and with lots of handwritten notes (uh, no laptops yet).

But libraries were still specialized. To learn about draft resisters during the Vietnam War, I needed access to a law library. MIT did not have a law collection and this was before Lexis/Nexis and Westlaw. I needed to get to the volumes of case law of the United States. Harvard, up the road, had one of the great law libraries, but as an MIT student, I could not get in. My MIT professor lent me his ID that fortunately did not include a photo, so I could sneak in with that. I spent hours in the basement of Harvard’s Law Library reading about the cases of conscientious objectors and others.

But why was this library of law books not available to everyone? It stung me. It did not seem right.

A few years later I would apply to library school at Simmons College to figure out how to build a digital library system that would be closer to the carved words over the Boston Public Library’s door in Copley Square: “Free to All.”

Archives: A Wonderful Place for Singular Obsessions

When I quizzed the archivist at MIT, she explained what she did and how the MIT Archives worked. I loved the idea, but did not spend any time there—it was not organized for the busy undergraduate. The MIT Library was organized for easy access; the MIT Archives included complete collections of papers, notes, ephemera from others, often professors. It struck me that the archives were collections of collections. Each collection faithfully preserved and annotated. I think of them as having advertisements on them, beckoning the researcher who wants to dive into the materials in the archive and the mindset of the collector.

So in this formulation, an archive is a collection, archives are collections of collections. Archivists are presented with collections, usually donations, but sometimes there is some money involved to preserve and catalog another’s life work. Personally, I appreciate almost any evidence of obsession—it can drive toward singular accomplishments. Archives often reveal such singular obsessions. But not all collections are archived, as it is an expensive process.

The cost of archiving collections is changing, especially with digital materials, as is cataloging and searching those collections. But it is still expensive. When the Internet Archive takes on a physical collection, say of records, or old repair manuals, or materials from an art group, we have to weigh the costs and the potential benefits to researchers in the future.

Archives take the long view. One hundred years from now is not an endpoint, it may be the first time a collection really comes back to light.

Digital Libraries: A Memex Dream, a Global Brain

So when I helped start the Internet Archive, we wanted to build a digital library—a “complete enough” collection, and “organized enough” that everything would be there and findable. A Universal Library. A Library of Alexandria for the digital age. Fulfilling the memex dream of Vanevar Bush (do read “As We May Think“), of Ted Nelson‘s Xanadu, of Tim Berners-Lee‘s World Wide Web, of Danny Hillis‘ Thinking Machine, Raj Reddy’s Universal Access to All Knowledge, and Peter Russell’s Global Brain.

Could we be smarter by having people, the library, networks, and computers all work together? That is the dream I signed on to.

I dreamed of starting with a collection—an Archive, an Internet Archive. This grew to be a collection of collections: Archives. Then a critical mass of knowledge complete enough to inform citizens worldwide: a Digital Library. A library accessible by anyone connected to the Internet, “Free to All.”

ABOUT THE AUTHOR

Brewster Kahle, Founder & Digital Librarian, Internet Archive

A passionate advocate for public Internet access and a successful entrepreneur, Brewster Kahle has spent his career intent on a singular focus: providing Universal Access to All Knowledge. He is the founder and Digital Librarian of the Internet Archive, one of the largest digital libraries in the world, which serves more than a million patrons each day. Creator of the Wayback Machine and lending millions of digitized books, the Internet Archive works with more than 800 library and university partners to create a free digital library, accessible to all.

Soon after graduating from the Massachusetts Institute of Technology where he studied artificial intelligence, Kahle helped found the company Thinking Machines, a parallel supercomputer maker. He is an Internet pioneer, creating the Internet’s first publishing system called Wide Area Information Server (WAIS). In 1996, Kahle co-founded Alexa Internet, with technology that helps catalog the Web, selling it to Amazon.com in 1999. Elected to the Internet Hall of Fame, Kahle is also a Fellow of the American Academy of Arts and Sciences, a member of the National Academy of Engineering, and holds honorary library doctorates from Simmons College and University of Alberta.

Internet Archive Blogs

A blog from the team at archive.org