When professor Jason Luther wants students in his Intro to Writing Arts class to learn about multimodal composition, he has them go to the Internet Archive for inspiration.
Students peruse 78rpm records going back to the early 20th century to find just the right one for their assignment. There is no lack of material with more than 300,000 recordings from 1898 through the 1950s preserved. They are available to the public because of the collaborative Great 78 Project.
Although the students are enrolled at Rowan University in New Jersey, many are participating remotely from their homes this year because of the pandemic, and the materials are conveniently available digitally to them from anywhere.
“If the [Great 78 Project] didn’t exist, I don’t think I would have this curriculum at all,” said Luther, assistant professor for Writing Arts in the Ric Edelman College of Communication & Creative Arts at Rowan. “What I really like is the research challenge. It’s really powerful. So many times students have recovered the lost histories of these songs.”
Luther developed the project in 2018 as part of the “Technologies and Future of Writing” module in the writing course. Students have just eight classes to complete the 1-3 minute podcasts, in which they learn to master a mix of audio tools and editing skills using Audacity and WordPress. The course covers issues of compatibility and ownership, along with instruction on the economy of writing like a critic about lyrics and culture. For one recent class session, he invited Liz Rosenberg of the Archive to be a guest speaker and talk about the organization’s work and the Great 78 Project.
In the future, Luther said he would like to find more ways to incorporate some of the Archive’s collection into his curriculum. For instance, he may have students use primary source documents from independent publishers over time to craft something tangible, such as an actual history from those materials that could be passed along. “That’s one of the neat things about accretion,” he said. “We have the creativity, but then there’s also documents on the Archive that are helping us understand the 78s themselves. It’s such a vast resource.”
Incorporating materials from the Internet Archive into your course curriculum is easy. Each semester we hear from instructors doing so worldwide. Let us know how you are weaving Internet Archive media into your classes by writing to us at firstname.lastname@example.org.
The Internet Archive has reached a new milestone: 2 million. That’s how many modern books are now in its lending collection—available free to the public to borrow at any time, even from home.
“We are going strong,” said Chris Freeland, a librarian at the Internet Archive and director of the Open Libraries program. “We are making books available that people need access to online, and our patrons are really invested. We are doing a library’s work in the digital era.”
The lending collection is an encyclopedic mix of purchased books, ebooks, and donations from individuals, organizations, and institutions. It has been curated by Freeland and other librarians at the Internet Archive according to a prioritized wish list that has guided collection development. The collection has been purpose-built to reach a wide base of both public and academic library patrons, and to contain books that people want to read and access online—titles that are widely held by libraries, cited in Wikipedia and frequently assigned on syllabi and course reading lists.
“The Internet Archive is trying to achieve a collection reflective of great research and public libraries like the Boston Public Library,” said Brewster Kahle, digital librarian and founder of the Internet Archive, who began building the diverse library more than 20 years ago.
“Libraries from around the world have been contributing books so that we can make sure the digital generation has access to the best knowledge ever written,” Kahle said. “These wide ranging collections include books curated by educators, librarians and individuals, that they see are critical to educating an informed populace at a time of massive disinformation and misinformation.”
Everyday about 3,500 books are digitized in one of 18 digitization centers operated by the Archive worldwide. While there’s no exact way of identifying a singular 2 millionth book, the Internet Archive has chosen a representative title that helped push past the benchmark to highlight why its collection is so useful to readers and researchers online.
On December 31, The dictionary of costume by R. Turner Wilcox was scanned and added to the Archive, putting the collection over the 2 million mark. The book was first published in 1969 and reprinted throughout the 1990s, but is now no longer in print or widely held by libraries. This particular book was donated to Better World Books via a book bank just outside of London in August 2020, then made its way to the Internet Archive for preservation and digitization.
As expected from the title, the book is a dictionary of terms associated with costumes, textiles and fashion, and was compiled by an expert, Wilcox, the fashion editor of Women’s Wear Daily from 1910 to 1915. Given its authoritative content, the book made it onto the Archive’s wish list because it is frequently cited in Wikipedia, including on pages like Petticoat and Gown.
Now that the book has been digitized, Wikipedia editors can update citations to the book and include a direct link to the cited page. For example, users reading the Petticoat page can see that page 267 of the book has been used to substantiate the claim that both men & women wore a longer underskirt called a “petticote” in the fourteenth century. Clicking on that reference will take users directly to page 267 in The dictionary of costume where they can read the dictionary entry for petticoat and verify that information for themselves.
An additional reason why this work is important is that there is no commercial ebook available for The dictionary of costume. This book is one of the millions of titles that reached the end of its publishing lifecycle in the 20th century, so there is no electronic version available for purchase. That means that the only way of accessing this book online and verifying these citations in Wikipedia—doing the kind of research that students of all ages perform in our connected world—is through a scanned copy, such as the one now available at the Internet Archive.
Donations play an important role
Increasingly, the Archive is preserving many books that would otherwise be lost to history or the trash bin.
In recent years, the Internet Archive has received donations of entire library collections. Marygrove College gave more than 70,000 books and nearly 3,000 journal volumes for digitization and preservation in 2019 after the small liberal arts college in Detroit closed. The well-curated collection, known for its social justice, education and humanities holdings, is now available online at https://archive.org/details/marygrovecollege.
Just like The dictionary of costume, many of the books supplied for digitization come to the Archive from Better World Books. In its partnership over the past 10 years, the online book seller has donated millions of books to be digitized and preserved by the Archive. Better World Books acquires books from thousands of libraries, book suppliers, and through a network of book donation drop boxes (known as “book banks” in the UK), and if a title is not suitable for resale and it’s on the Archive’s wish list, the book is set aside for donation.
“We view our role as helping maximize the life cycle and value of each and every single book that a library client, book supplier or donor entrusts to us,” said Dustin Holland, president and chief executive officer of Better World Books. “We make every effort to make books available to readers and keep books in the reading cycle and out of the recycle stream. Our partnership with the Internet Archive makes all this possible.”
The Archive provides another channel for customers to find materials, Holland added.
“We view archive.org as a way of discovering and accessing books,” said Holland. “Once a book is discoverable, the more interest you are going to create in that book and the greater the chance it will end up in a reader’s hands as a new or gently used book.”
Having books freely available for borrowing online serves people with a variety of needs including those with limited access to libraries because of disabilities, transportation issues, people in rural areas, and those who live in under-resourced parts of the world.
Sean, an author in Oregon said he goes through older magazines for design ideas, especially from cultures that he wouldn’t be exposed to otherwise: “It gives me a wider understanding of my small place in the global historical context.” One parent from San Francisco said she uses the lending library to learn skills like hand drawing to draw characters and landscapes to interact deeper with her child.
The need for information is more urgent than ever.
“We are all homeschoolers now. This pandemic has driven home how important it is to have online access to quality information,” Kahle said. “It’s gratifying to hear from teachers and parents that are now given the tools to work with their children during this difficult time.”
Kahle’s vision is to have every reference in Wikipedia be linked to a book and for every student writing a high school report to have access to the best published research on their subject. He wants the next generation to become authors of the books that should be in the library and the most informed electorate possible.
Adds Kahle: “Thank you to all who have made this possible – all the funders, all the donors, the thousands who have sent books to be digitized. If we all work together, we can do another million this year.”
Authorized readers have special access to millions of digitized books through the Internet Archive’s program, connecting patrons with print disabilities to a vast digital library.
Doug Wilson says he’s a bit of a “bookaholic.” As senior pastor of a nondenominational church in West Covina, California, he surrounds himself with books at his office and study at home.
“I’m a voracious reader and love learning,” said Wilson, who recalls taking a wagon to the library every week as a kid and bringing back a stack of books. “I find life intriguing.”
The 64-year-old said his vision has never been great, but within the last year noticed it was worsening. Wilson was struggling to read print with small font sizes or in low light. An avid user of the Internet Archive, he learned about the Archive’s program for users with print disabilities, which allows authorized users to skip waitlists for the ebook collection and download protected EPUBs and PDFs. Wilson applied for the program and was granted access, with great results.
“It’s so helpful to be able to have multiple resources open at once on my computer,” said Wilson, who looks up material online from ancient thought to contemporary theology for his sermons. “It’s been wonderful to find something on just about anything.”
Signing up for the program was easy and fast, said Wilson. Students and researchers associated with a university can obtain access through their university library or student success center. For those outside higher ed, the process is run by Jessamyn West at the Vermont Mutual Aid Society, who receives requests through an online form from patrons around the world. People who qualify for the program include those with blindness, low-vision, dyslexia, brain injuries and other cognition problems who need extra time to interact with materials. Since October 2018, West has welcomed more than 5,600 users into the program.
“It makes a real difference to people’s lives,” said West. “Especially nowadays when many people are stuck at home and working with limited resources, having a world of accessible books available to them opens doors and expands horizons. I’ve seen people checking out books on drawing and painting, books about art history and comparative religions, and just a lot of fiction. The collection is truly extensive.”
With expanded access to digitized books, Wilson said he has been reconnecting with works written by many of his mentors through the digital theological collections. “It’s been a profound gift to discover those books that have been influential in my life. Getting access has been another way to be encouraged and mentored even from a distance,” said Wilson, adding he is in a season of life when many of those people are passing away. “The service has been so generous and a supplement to my own library.”
Being able to find the exact resource he needs from home and late at night is a convenience that Wilson said he values. Wilson has enjoyed books from Marygrove Library, a collection full of religious and social justice materials that was recently donated and made available online.
In an era with competing forms of information and disinformation, Wilson said the Internet Archive is important. “Wisdom is hard to come by,” said Wilson. “We are barraged with data in our culture. It can be hard to ferret out what’s real. To have access to actual information and works that have stood the test of time is a godsend.”
To learn more about the Internet Archive’s program for users with print disabilities, and to verify eligibility, please visit the program web page.
Rik Nemanick believes in the power of mentoring in the workplace. As an author, corporate consultant, and university instructor, he explains to business leaders and students how a mentor can bring the best out in others.
“A mentor is different from a teacher who imparts knowledge,” Nemanick says. “A good mentor broadens someone’s perspective and opens doors. It’s about challenging someone’s thinking and creating a relationship.”
“I want my message out there. I saw the Internet Archive as a way to make it more available to more people,” Nemanick says of his recent donation to the Controlled Digital Lending program. “The book sitting on Amazon or a shelf doesn’t get anyone engaged as much as if it’s available at the library.”
One of the first things that Nemanick says he did when the book was published was to donate a copy to Washington University Library in St. Louis. He wanted it available for students in his executive education graduate courses in leadership, mentoring, and human resource metrics so they could learn the concepts he advocates.
Through his work, Nemanick says he wants to challenge the way people think about mentoring and offer practical ideas. Often people enter their careers with certain, narrow expectations and a mentor can be critical with the workplace adjustment. “A mentor can help someone find their way in their profession,” he says. “My hope is that people can find their fit more easily with the information in my book.”
Nemanick says he does not worry about his book being hurt by library lending through Controlled Digital Lending.
“This is a respectful way to get your message heard. A fair number of authors just want people to read what they have written,” he says. “It’s just one more avenue to make sure it gets into people’s hands.”
There’s little doubt that both learning and work require a high degree of technology use. As schooling continues to move online in response to COVID-19, students are expected to be able to access, process, manipulate, and interpret digital content. This has brought to light a significant skill that separates successful learners from those who struggle: digital fluency. Digital fluency is a step above “digital literacy.” Learners now need to know much more than just the basics of navigating the internet, writing an email, and making their way around common productivity applications like spreadsheets. Digital fluency includes skills such as using technology tools for collaboration, marshaling online resources to solve a problem, and evaluating the accuracy of a source.
Despite the “whiz-kid” reputation of Generation Z, an alarming number of high school students lack the appropriate level of digital fluency. This set of skills is part of a larger group of key work and learning aptitudes called 21st Century Skills. A lack of digital fluency can harm students’ futures as they progress into college and careers where these skills are necessary.
Fortunately, having students complete assignments with the aid of the Internet Archive’s digital library can help build digital fluency. Students and teachers can use Internet Archive as a collaborative tool for sharing books and digital content across remote teams or classrooms, removing the physical barriers of access to books and collaborators. They can use digital libraries like Internet Archive to conduct research for assignments, with access to 20th-century texts that aren’t available from other sources. Finally, they can cross-reference sources to evaluate the accuracy of material they may find elsewhere on the internet.
Other features of Internet Archive’s digital library promote digital fluency for students as well. For example, the site includes advanced search and sorting features that are commonly used on research websites. It is critical for students to understand how to use the right keywords to find what they need, as well as how to find the most recent (or oldest) material, particular authors or publications, etc. On Internet Archive, this can be done from the advanced search options in the left toolbar. Sorting by the number of views, title, date published or the creator is available by clicking the appropriate header at the top of the search results. Even when you have the material you are looking for, you need to know how to find the specific content within it. You can do this at Internet Archive by using the search box in the upper right corner when a particular book is open on the screen.
Nearly a quarter of the way into the 21st century, learning is changing rapidly and digital fluency is becoming increasingly important for students. Tools such as Internet Archive’s digital library can help students develop these skills through activities like team collaboration, online research, and verifying sources. With multiple features that support learning in the classroom or remotely, teachers and students should consider Internet Archive a valuable resource for their work and learning.
Matt Poland is founder of MAP Consulting, an educational consulting firm specializing in workforce development.
By Michelle Swanson, an educator and national educational specialist from Eugene, OR
While education leaders and classroom teachers have discussed the growing issue of the Digital Divide for years, its severity has become painfully clear as classrooms have been forced online during school closures. The results of distance learning show low levels of engagement and progress for students from homes lacking internet access and devices. In addition, students facing the digital gap tend to have fewer books at home and live in communities struggling to keep libraries open. The pandemic has brought these serious equity issues to the forefront.
Closing the technology divide by ensuring that every student has a personal learning device and reliable internet access at home is a critical first step. Districts looking for guidance on 1:1 initiatives should look to ISTE’s definition of equitable technology access that makes up one of their “essential conditions” for supporting all learners.
Once students have a computer and WiFi, school leaders can look to the Internet Archive’s digital collections as one part of a multi-pronged strategy to address learner equity. Specifically, these online resources can be used to target issues of resource access, instructional rigor, and special needs access.
Resource Access Considerations
Book and Library Access By supplementing their onsite collections with online access to the Internet Archive’s Open Library, schools can extend a wealth of resources to all learners in a digital learning space that is open 24/7. Books can either be borrowed for one hour or two weeks, depending on availability.
Diverse Resource Access Open Library offers diverse reading materials to represent its readers. It includes books from the curated #1000 Black Girl Books list created by Marley Dias, a young girl determined to find books with main characters that looked like her. The collection includes books for younger readers like Karen Katz’ The Colors of Us and Chris Cleave’s Little Bee for older students.
Instructional Rigor Considerations
Standards-Aligned Books Providing rigorous instruction is an important equity strategy. To support high quality teaching and learning, the Internet Archive collections include texts suggested by the Common Core framework. For example, beginning readers can borrow books for reading aloud like Pat Mora’s Tomas and the Library Lady, while middle schoolers can explore Mildred Taylor’s Roll of Thunder, Hear My Cry and high schoolers can tackle In the Time of Butterflies by Julia Alvarez.
Grade-Level Appropriate Books The Internet Archive’s digital libraries include grade-level appropriate collections that teachers can use to ensure that all students are appropriately challenged with complex and quality texts.
Special Needs Access
Read Aloud and Print Disabled Books For students who need or prefer to listen to and visualize the plot of a story, Open Library provides a read aloud option. When viewing a borrowed book online, students can click on the audio speaker icon and choose their preferred reading speed. For those who are visually impaired and have special software, print disabled books have been formatted through DAISY. These tools can support school efforts to employ a Universal Design for Learning approach.
Print Disabled Collection To make access to print disabled books across the collection even easier to find, the Internet Archive has curated a Books for People with Print Disabilities section. Over 1.5M books are accessible through this page and cover the wide range of topics available in the broader library from History and Science to Children’s literature.
Working toward educational equity should be core to the mission of every school. By supporting resource access, instructional rigor, and special needs access, tools like the Internet Archive’s digital libraries can help schools move toward this essential goal.
The following blog post was written by freelance writer Caralee Adams about the Internet Archive’s Library Leaders Forum, held on October 23 at San Francisco Public Library.
As enthusiasm grows for making library collections more accessible, the Internet Archive hosted an event to build a community of practice around Controlled Digital Lending (CDL). A diverse group gathered for the 2019 Library Leaders Forum Oct. 23 to share stories and strategies for libraries to expand their reach by lending out digital books based on their physical collections.
Why is this important?
“At the Internet Archive, we have a strong belief that everyone deserves to learn. We want to offer up the greatest digital library the Internet has ever seen to the world for free,” said Chris Freeland, Director of the Internet Archives’ Open Libraries program. “We think that everyone, regardless of where they live, should have ready access to a great library. More importantly, we think it should be available on phones and mobile devices that people turn to today. We want to make sure they have access to vetted, trusted information that’s held in libraries.”
The mantra of CDL: “Own one, loan one.” The idea is that a library can make a choice of lending either a physical copy or a digital version of a book.
The Internet Archive has been doing CDL since 2011, beginning with the Boston Public Library. Now two dozen other libraries of all sizes in the U.S. and Canada have embraced the model. Librarians from some of those institutions spoke about their passion for the practice at the forum.
The meeting provided an overview of the legal issues, policy considerations, and examples of CDL in action. The appeal to library leaders gathered was to endorse CDL, join Open Libraries, donate books to the Internet Archive for scanning, and volunteer to help with a new serials project.
Helping libraries see what’s possible
Michael Lambert, City Librarian at the San Francisco Public Library, which hosted the event, shared his institution’s experience as an early partner with the Internet Archive on Open Libraries and CDL. Beginning with city government documents and historical materials, SFPL created an entire scanning department. To date, the library has digitized 13,000 books and documents with the Internet Archive, which have received over 7.5 million views. Since November 2018, SFPL has donated 30,000 copyrighted books to the Internet Archive as part of its community distribution program.
“Having this alternative virtual lending site as an option has been great,” Lambert said. ”Librarians have been able to confidently weed excess, outdated materials from our collection, secure in knowledge that the books will not disappear, but rather have a new life where people around the world can read and research the materials that SFPL has meticulously collected over the decades.”
The Internet Archive embodies library values: persistence, comprehensiveness and accessibility, said Lambert. “The Archive has become a crucial part of the broad library information eco-system,” he said. “They have provided examples that have challenged traditional libraries. The Internet Archive helps other libraries see what’s possible.”
What Internet Archive Founder Brewster Kahle hopes is possible is digitization will allow more online sources to be linked to books, providing people trust information.
“If Wikipedia is the encyclopedia of the Internet, we are trying to build the library of the Internet,” Kahle explained at the forum. “Let’s make it really easy for people to go deeper.”
So far, the Internet Archive has turned 122,000 references on Wikipedia to digitized book links through its online library. Still, a century of books is missing after 1923 because of copyright laws. Kahle called on libraries to help fill that gap.
As part of that strategy, the Internet Archive is trying to institutionalize CDL, a practice that has been successfully working in a handful of libraries for eight years with no negative pushback. Yet, it has not been widely embraced. Kahle appealed to libraries to endorse CDL and donate books for scanning to address the larger goal of universal access to knowledge.
Framing the approach
The forum hosted experts to explain the legal underpinnings of CDL and discuss how the concept fits into the overall push to level the playing field for access to information.
Lila Bailey of the Internet Archive moderated a conversation with Kyle Courtney, Copyright Advisor at Harvard University, David Hansen, Associate University Librarian at Duke University, and Michelle Wu, Associate Dean for Library Services and Professor of Law at the Georgetown Law Library in Washington, D.C.
They have written a paper spelling out how libraries can practice CDL within the confines the fair use doctrine in current copyright law. Copyright law established in 1976 and dating back to 1950 does not reflect the digital reality today and it should allow flexibility for libraries to lend out one book at a time – no matter what the format – digital or print, they maintain.
To garner broad support for the concept of CDL, John Bergmayer of the nonprofit, Public Knowledge, spoke about the need to build relationships with lawmakers and educate them on the issue. This summer, he led a group engaged in CDL to The Hill in Washington, D.C. to brief members of Congress and their aides on the importance of expanding access to library materials through CDL.
“You have to make a project matter to the politicians,” explained Bergmayer. In the case of CDL, it’s about outlining the benefits of providing access to rural patrons, protecting materials from damage from disasters, saving libraries money, and helping K-12 school libraries, among others. “You want to get people to do the right thing for their reasons, not your reason — and show how your issue affects voters.”
Heather Joseph, Executive Director of the Scholarly Publishing & Academic Resources Coalition (SPARC), said CDL fits into the larger open agenda that advocates for unrestricted access to research. “It’s a vision based on opportunity,” said Joseph. “An old tradition and a new technology have converged to make possible an unprecedented public good.”
Now more than ever, in an era of “fake news,” and “alternative facts,” free, immediate access to high-quality vetted, source material is crucial for scholars, scientists, students, journalists, policymakers – everyone, she said.
“CDL is a pragmatic, incremental step towards open that operates in a way that’s respectful of libraries current operations and of copyright. It moves the needle towards open,” said Joseph. “CDL can contribute to collective movement towards a full vision of open access to knowledge.”
Opening Doors for Students
Making digital books more widely available to students has the potential for remedying inequities in education. Nationwide, public school districts have lost 20 percent of their libraries and librarians in recent years. Lisa Petrides, founder of the non-profit Institute for the Study of Knowledge Management in Education, has embraced CDL as a model to build a Universal School Library (USL) and connect students – particularly from under-resources schools — to relevant materials that increasingly are digital.
“CDL holds the potential to broaden access to knowledge in public schools in a way that schools haven’t even begun to tap,” said Petrides, who is trying to curate an inclusive collection of 15,000 high-quality digitized books. “We are taking an equity lens in terms of diversity.”
The Detroit School of Arts will be piloting USL and Librarian Karen Lemmons said she was excited to be able to offer her high school students books they can access while they are on the go. “This might give them an opportunity to read in between practices. They can pull out their phone and read a few pages. It’s mobile and flexible,” said Lemmon, noting that reading is closely linked to student achievement. “Our students really do want to be the best.”
Lemmons said she wants to be a model for other urban schools. “We want to be a driving force to get other libraries involved,” said Lemmons. “This is a data-driven district and we will need data to show reading more makes a difference in student performance.”
When the prestigious Phillips Academy in Andover, Massachusetts, recently was doing a $20 million renovation to its library, the Internet Archive approached it about digitizing their collection. The library already had its books packed on pallets, but instead of storing them decided to have them all scanned, explained Michael Barker, Director of Academy Research, Information and Library Services.
“We had this very well-intentioned idea to create a space for learners of the 21st century. It’s all good. It is a space of immense privilege. But it takes a vision to think well beyond our campus to say that belongs to every learner. That opportunity is to digitize the entire collection – that’s why we are all in,” said Barker of the school’s decision to participate in CDL “It goes to the heart of what Phillips was founded on. This school is for youth from every quarter and we try to live out that ideal as a private school for a public purpose.”
Next, Barker said he would like to see peer prep schools join the CDL model to further expand access to schools without the same resources.
CDL in Action
As the first library to use the CDL approach, the Boston Public Library recently extended its offerings by scanning its historic Alice Jordan Collection of 250,000 children’s books that were in storage. It has also digitized city directories, cookbooks and other fact-based documents in its catalog. Recently, it got permission from Boston-based publisher Houghton Mifflin to digitize its entire trade collection that is housed at BPL.
Expanding its CDL involvement, BPL’s Tom Blake challenged participants to bring another partner library next year to the forum.
“This the first time, I feel like it’s less about digitization and scanning and more about us, as librarians, leveraging not just our collections, but our historical collection policies with each other,” said Blake, who has been attending the library leaders forum for 10 years.
In discussing how to improve the CDL process, meeting participants suggested adjusting the amount of time users checked out titles and allowing for short-term loans. Perhaps smarter return and wait-list notifications could be developed to encourage faster processing of books. Others said re-branding Digital Rights Management (DRM) software with a different moniker to that would be more appealing to librarians.
In Sonoma County, California, Geoffrey Skinner said its 14 public library branches have just starting to participate in CDL. It first scanned documents in the history and genealogy library, then digitized its specialized wine library.
“We are doing a massive weed of our closed stacks. By taking those material to the Internet Archive, we will have digital access back,” said Skinner. Having library materials online will benefit many of the county’s rural users who otherwise travel far to access the physical books and provide access for print-disabled patrons.
Justin Gardner, Special Collections Librarian at the American Printing House for the Blind in Louisville, Kentucky, said digitizing 9,000 books in its collection has preserved rare and fragile documents, including books autographed by Helen Keller. Also, being located in Kentucky, it gives people interested in their materials from anywhere.
“We are becoming the go-to place for visual impairment materials,” said Gardner. Now these research documents are in an accessible form for people who have visual impairments and have never been able to read these materials before they were digitized.
At the forum, Mike Buschman of the Washington State Library announced that the Chief Officers of State Library Agencies (COSLA) voted to endorse CDL. “It feels like it’s entering a new, good phase – a traction phase,” he said.
Kahle emphasized the need for CDL to be a community project and build a deeper collection. “We have to brave up,” he said. “We just act in good faith. We aren’t pirates. We are trying to do the right thing.”
Chief Librarian and CEO at the Hamilton Public Library in Canada Paul Takala said his institution is an enthusiastic supporter of CDL. With a long history of innovation, moving forward with digitizing is the right move – despite the technical challenges – to make information more accessible to patrons, he said.
“Deeper collaboration is needed. It’s hard to get adequate resources,” said Takala. “As a library community, we are generally risk adverse. When we talk about CDL, I think we need to take a more balanced view….If we make what’s available in our community to other communities – and others make their collections available – then everyone wins.”
Dale Askey, Vice Provost at the University of Alberta, said he liked Takala’s challenge to pull more Canadian institutions past their risk aversion to embrace CDL. “It’s great to see people aligning behind these principles and taking this to scale,” said Askey, whose university has scanned an historic collection of education materials with zero negative impact. “There is a strong history and impulse at the university to do things with maximum benefit to the largest possible community.”
Princeton Theological Seminary is piloting CDL and it has created a secure area in its library for the physical collection, so that when a digital copy is checked out that the physical copy will reside there. Participating the program has great potential benefits for the seminary’s reach, according to Managing Director of the Library Evelyn Frangakis.
“The PTS comprehensive theological collection is in high demand and the CDL library allows increased accessibility to all users, including those with various print disabilities,” said Frangakis. “I think CLD is gaining momentum. That’s really heartening for broad access to the materials that we are able to contribute to this program. It’s going to continue to grow.”
Ross Mounce, Director of Open Access Programmes at Arcadia, a charitable fund of Lisbet Rausing and Peter Baldwin in London, said he was encouraged by participation in the forum and said action points were clear and institutions can choose their level of engagement.
“It’s nice seeing things moving forward. At the end of the day, it just makes sense,” said Mounce of CDL. “If you own a physical copy of a book, you should be able to loan a digital version of it. Libraries should be able to lend books.”
Added Wu of Georgetown: “I’m delighted there has been a lot more buy in in recent years. The voices and the participants are much more diverse. Libraries [like Phillips] are willing to go all in and that’s remarkable. It is true that if we get more of those, I think we will see a true movement across the nation.”
Every month, we look over the total download counts for all public items at archive.org. We sum item counts into their collections. At year end 2014, we found various source reliability issues, as well as overcounting for “top collections” and many other issues.
archive.org public items tracked over time
To address the problems we did:
Rebuilt a new system to use our database (DB) for item download counts, instead of our less reliable (and more prone to “drift”) SOLR search engine (SE).
Changed monthly saved data from JSON and PHP serialized flatfiles to new DB table — much easier to use now!
Fixed overcounting issues for collections: texts, audio, etree, movies
Fixed various overcounting issues related to not unique-ing <collection> and <contributor> tags (more below)
Fixes to character encoding issues on <contributor> tags
We now track *all collections*. Previously, we only tracked items tagged:
For items we are tracking <contributor> tags (texts items), we now have a “Contributor page” that shows a table of historical data.
Graphs are now “responsive” (scale in width based on browser/mobile width)
The Overcount Issue for top collection/mediatypes
In the below graph, mediatypes and collections are shown horizontally, with a sample “collection hierarchy” today.
For each collection/mediatype, we show 1 example item, A B C and D, with a downloads/streams/views count next to it parenthetically. So these are four items, spanning four collections, that happen to be in a collection hierarchy (a single item can belong to multiple collections at archive.org)
The Old Way had a critical flaw — it summed all sub-collection counts — when really it should have just summed all *direct child* sub-collection counts (or gone with our New Way instead)
So we now treat <mediatype> tags like <collection> tags, in terms of counting, and unique all <collection> tags to avoid items w/ minor nonideal data tags and another kind of overcounting.
… and one more update from Feb/1:
We graph the “difference” between absolute downloads counts for the current month minus the prior month, for each month we have data for. This gives us graphs that show downloads/month over time. However, values can easily go *negative* with various scenarios (which is *wickedly* confusing to our poor users!)
Here’s that situation:
A collection has a really *hot* item one month, racking up downloads in a given collection. The next month, a DMCA takedown or otherwise removes the item from being available (and thus counted in the future). The downloads for that collection can plummet the next month’s run when the counts are summed over public items for that collection again. So that collection would have a negative (net) downloads count change for this next month!
Here’s our fix:
Use the current month’s collection “item membership” list for current month *and* prior month. Sum counts for all those items for both months, and make the graphed difference be that difference. In just about every situation that remains, graphed monthly download counts will be monotonic (nonnegative and increasing or zero).
More than doubling the number of books available to print disabled people of all ages, today the Internet Archive launched a new service that brings free access to more than 1 million books — from classic 19th century fiction and current novels to technical guides and research materials — now available in the specially designed format to support those who are blind, dyslexic or are otherwise visually impaired.
“Every person deserves the opportunity to enhance their lives through access to the books that teach, entertain and inspire,” said Brewster Kahle, founder and Digital Librarian of the Internet Archive. “Bringing access to huge libraries of books to the blind and print disabled is truly one of benefits of the digital revolution.”
Kahle also announced that the Internet Archive will be investing in the growth of its virtual bookshelf by funding the digitization of the first 10,000 books donated. Individuals and organizations are welcome to donate their favorite book or a collection of books. Books in all languages welcome. To donate books visit: http://openlibrary.org/bookdrive
Dr. Marc Maurer, President of the National Federation of the Blind, said: “Blind people must have access to repositories of digital information if we are to reach our goal of becoming full and equal participants in society. Access to the books that have been scanned by the Internet Archive in a format accessible to the blind will be another step toward that goal. We are excited about continuing to work with Internet Archive to make access to more books a reality.”
The 1 million+ books in the Internet Archive’s library for print disabled, are scanned from hard copy books then digitized into DAISY — a specialized format used by blind or other persons with disabilities, for easy navigation. Files are downloaded to devices that translate the text and read the books aloud for the user to enjoy. To access books visit: http://openlibrary.org/subjects/accessible_book
Jessie Lorenz is a 31 year old woman who was born blind and is the Associate Director of at the Independent Living Resource Center in San Francisco. She believes, “Knowledge is power – and like everyone else, blind and print disabled people need equitable access to books to help them be innovative, productive, contributing community members.”
Older books are available from the Internet Archive’s unencrypted DAISY library and modern books can be accessed by “qualified users” through their NLS key — an encrypted code provided by the Library of Congress’ National Library Service for the Blind and Physically Handicapped (NLS), that is dedicated to providing materials to the print disabled. Currently, over 800,000 people in the US are registered with the Library of Congress as being print disabled.
As of today, the Internet Archive offers over one million books for print disabled people. Other large libraries for the print disabled including NLS, Bookshare.org, and Reading for the Blind & Dyslexic.
“This demonstrates why having open and public access to published works is so important,” said Kahle.
Ben Foss, President of Headstrong, an advocacy group for people with dyslexia said, “As dyslexic and print-disabled students scramble to complete their end-of-year research papers and projects, beginning today, there is a great new library of resources that will expand the tools these young people need to be successful in school and in life.”
By leveraging automated scanning and conversion processes, Internet Archive technicians can conduct a cost-efficient scan of more than one thousand books per day. Books are scanned at sites located in San Francisco, New York, Los Angeles and other major cities in five countries. Most of the older scanned books have been reformatted for the print-disabled from broad digitizing projects. Scanned physical books came from the collections of over 150 libraries, most of which are in the Open Content Alliance, but others as well. The funding of those scanning projects is coming from foundations, corporations and governments.
Most of the older books have been scanned from library collections, with newer books having been donated to the Internet Archive by companies such as the online bookseller Alibris, libraries and individuals.
The print disabled collection of books are now available through the Archive’s new Open Library site (www.openlibrary.org), which serves as a gateway to information about millions of hardcopy books and more than 1 million electronic books.
The Internet Archive will continually increase the number books it makes available. They are currently seeking donations of books and ebooks from individuals, libraries and publishers. The Archive is announcing today its commitment to fund the scanning and automatic processing of the first 10,000 donated books. Any organization or individual that would like to make particular books or collections available are encouraged to donate them by sending them to the Internet Archive. For donations of large collections please contact the Internet Archive. Financial support is also welcome to expand the program.
The Internet Archive is a 501(c)(3) non-profit organization that was founded to build an Internet library. Its missions are to offer universal access to all knowledge and provide specialized services for adaptive reading and information access for the blind and other persons with disabilities. Internet Archive, 300 Funston Avenue, San Francisco California 94118, email@example.com, www.archive.org, www.openlibrary.org, +1-415-561-6767.
“. . . (the Internet Archive) serves as custodian of much of NASA’s current and legacy digital imagery records. In addition, IA will help digitize NASA’s historically significant, analog images for inclusion on the Web site, enabling digital archiving with the National Archives and greater public access to these records via the IA Website.”
“Strictly on its own initiative, IA recently began to capture NASA’s publicly posted social media content. NASA is considering exploration of how this activity might be leveraged for records management purposes.”