| CARVIEW |
Select Language
HTTP/2 200
server: GitHub.com
content-type: text/csv; charset=utf-8
last-modified: Tue, 02 Apr 2024 15:57:45 GMT
access-control-allow-origin: *
strict-transport-security: max-age=31556952
etag: W/"660c2af9-2494"
expires: Thu, 15 Jan 2026 23:30:29 GMT
cache-control: max-age=600
content-encoding: gzip
x-proxy-cache: MISS
x-github-request-id: 55CD:7ED56:5601:7ECC:6969763A
accept-ranges: bytes
age: 0
date: Thu, 15 Jan 2026 23:20:29 GMT
via: 1.1 varnish
x-served-by: cache-bom-vanm7210087-BOM
x-cache: MISS
x-cache-hits: 0
x-timer: S1768519230.617475,VS0,VE223
vary: Accept-Encoding
x-fastly-request-id: 155f6a2a39860292272b8db26128d1d8d4ed6970
content-length: 3807
quote,name,date
Know how optical character recognition (OCR) output is produced in your digitization workflows. What software is used? What formats are created? What levels of accuracy are produced? Where is it stored? Is it available for user download?,,
"Create an inventory of full-text collections managed by your institution. Document rights status, license status, discoverability, and downloadability. Ask the question: are we offering optimal access for computational use of the full-text? How can we make it better?",,
Migrating a legacy digital collection to a new system or platform? Take the opportunity to make the content accessible to researchers that have computational projects in mind.,,
"Interview the archivist, librarian, or curator responsible for a digital collection to document data provenance and decisions made in the course of collection processing and digitization. Work to make this information publicly available.",,
"Inventory your data holdings. Just make a simple list. And then commit to keeping it up to date, and watch it grow.",,
Add new fields to the collection management database to indicate and describe data components. ,,
"Survey your digital collections to identify characteristics -- good metadata, open access, good OCR, high usage, relevance to a high-profile academic program or research area at the institution -- which lend themselves to high impact as data.",,
Recognize and identify the things you need to do differently than have been done for physical collection objects.,,
"Find out if your digital collection database or access platform has an API available for querying by the public. If it does not, see if it is possible to develop one. If it does, determine if it is actively used. If it is actively used, see if you can reach out to users and ask about their usage! ",,
Talk to a colleague responsible for systems that provide networked access to digital collections about possible approaches to facilitate download of collection data in bulk.,,
Add a terms of use to your archival finding aids.,,
Read the language of your organization’s collection deed of gift or purchase agreement to evaluate whether it allows for providing access to collection content in the form of data.,,
Review your digital collections metadata and evaluate the rights statements and license statements in terms of consistently and clarity. Are you able to adopt rightsstatements.org?,,
Socialize CAD as something that can be supported by units and staff across the library. Identify some champions across the organization and people who have skills or positions to do the work.,,
Talk to people responsible for research data management to encourage planning for data preservation and other considerations that make it possible for others to reuse the data in the future.,,
"Review your institution’s mission statement or strategic plan documentation, and consider if and how collections as data activities are aligned with and support it.",,
Share sample projects with community partners to give them an idea of how their collections can be used and be relevant to new ways of conducting scholarship. ,,
Network with people who work with data and have the skills or knowledge you need to get your work done.,,
"Identify barriers and limitations to what services you can offer support, and talk with colleagues about creative, feasible solutions to overcome them.",,
"Publish or present on ""Wikidata for librarians,"" including case studies of libraries working with Wikidata to expand discovery of collections.",,
"Read up on IIIF (for example, check out this useful tutorial) and determine what hurdles to implementation exist at your institution. Then talk to relevant folks about what it would take to overcome them.",,
Read the resources in the Always Already Computation project's Zotero library.,,
Develop a workshop focused on the use of data in and about collections; shop it to department faculty and incorporate it into research orientations for faculty and students.,,
Mentor a liaison interested in learning a data science skill who is well positioned to identify datasets and data support needs amongst their researchers.,,
"Conduct user testing of your library’s main discovery environment, with the goal of understanding how easy or hard it is for a researcher to find the available data collections.",,
Develop a portal page with a site map specifically for discovering collections at your institution available for computational use and related support services.,,
Begin tracking demand for and use of data in and about your collections. ,,
"For a collection that cannot be made available openly on the web, investigate if your organization is able to support mediated access to the data, such as through an offline or encrypted workstation.",,
"Prepare and provide datasets that are intentionally useful, in terms of size and complexity, for teaching in semester- or quarter-long classes.",,
"For classes that draw directly on library collections and generate data, ask the students to submit their data products back to library, through the institutional repository. Normalize the process of giving back and augmenting the collections with data. This may work particularly well for collections that are institutionally or regionally focused.",,
Identify a faculty member who does computational analysis for their own research and find a way to transfer or replicate the tools and approaches they use to apply them to a library collections-as-data use case.,,
"If you offer an API to your repository, evaluate the public-facing documentation to see if it is clear, current, accurate, and discoverable by researchers. ",,
"Publish documentation about how to find, use, and interpret collections as data in multiple places including blogs, README files, and LibGuides.",,
"A dataset should always be accompanied by a README plain text file that documents basic, important information about the data. Make READMEs part of your data documentation practice. Develop one or more templates that can be used by librarians and researchers.",,
"Make an effort to make existing OCR output generated from past scanned text collections projects more available for computational analysis, such as through bulk download.",,
"When planning your next digitization project, incorporate additional steps for preparing content files, OCR or transcription text, and metadata for bulk access. Document the key issues and decision points you encounter as you evolve and expand your digitization workflows.",,
Talk to colleagues involved in taking in deposits to your institutional repository or research data repository about a process for encouraging and accepting contributions back from users of data in your collections.,,
Gain the support of administration by following and supporting the work of third-party research groups like OCLC that help bolster and highlight the trends in the development of collections as data.,,
"Provide a resource that shows a data user how to cite a dataset, and that shows a data creator how to format a preferred citation for an original dataset and a derivative dataset.",,
Ask a subject specialist at your institution if faculty or students are requesting data about or derived from library collections.,,
"Take a public services librarian, curator, or archivist out for coffee to talk about collections as data. Ask what they are hearing from faculty, students, and other users about computational use of collections. Try to lower barriers to computational use. ",,
"Investigate how your library is collecting, managing, and making email archives accessible. Consider whether a collections as data approach will serve your institution's goals.",,
"Start small. Start with a research question, and choose projects that have promise to be generalizable for use by future scholars such that the investment is worth the level of commitment. No one-offs!",,
Start with a prototype/proof of concept. It's fine if your CAD project does not integrate with institutional repository or formalized infrastructure. ,,
"Collaborate with subject specialists or instruction librarians to ask scholars about interest in computational data in and about collections. Compile their ideas to make a case, and build a team for the next opportunity to pursue one of them.",,
"Be thoughtful and strategic about allocating scarce resources to collection digitization projects. Consider prioritizing projects that produce outcomes that are reusable (derivative datasets) and repeatable (processes, tools, workflows) that can benefit your department and your users again and again.",,
"Explore what it would take for your organization to contribute subject data to Wikidata, drawing on a local collection and then incorporating the Wikidata links into your local discovery environment.",,
Test how data gathered in a crowdsourcing project can be associated with the existing source object data and can also serve as stand-alone dataset.,,
Use your favorite search engine to find information about APIs provided by museums and read about the various ways that data about museum collections can be analyzed to discover new insights.,,
"Keep tabs on the projects emerging in the Collections as Data: Part to Whole project, funded by the Mellon Foundation. They are bound to point a way forward for us all!",,