archiving and digital preservation (dp)

archivebox

23 May 2020Last Commit6446 (2112/yr)Github Stars77Issues

▶️ Quickstart | Demo | Github | Documentation | Info & Motivation | Community | Roadmap

ArchiveBox takes a list of website URLs you want to archive, and creates a local, static, browsable HTML clone of the content from those websites (it saves HTML, JS, media files, PDFs, images and more).

You can use it to preserve access to websites you care about by storing them locally offline. ArchiveBox imports lists of URLs, renders the pages in a headless, authenticated, user-scriptable browser, and then archives the content in multiple redundant common formats (HTML, PDF, PNG, WARC) that will last long after the originals disappear off the internet. It automatically extracts assets and media from pages and saves them in easily-accessible folders, with out-of-the-box support for extracting git repositories, audio, video, subtitles, images, PDFs, and more.

ckan

23 May 2020Last Commit2639 (309/yr)Github Stars361Issues

CKAN is the world’s leading open-source data portal platform. CKAN makes it easy to publish, share and work with data. It's a data management system that provides a powerful platform for cataloging, storing and accessing datasets with a rich front-end, full API (for both data and catalog), visualization tools and more. Read more at ckan.org.

See the CKAN Documentation for installation instructions.

If you need help with CKAN or want to ask a question, use either the ckan-dev mailing list, the CKAN chat on Gitter, or the CKAN tag on Stack Overflow (try searching the Stack Overflow and ckan-dev archives for an answer to your question first).

archivesspace

22 May 2020Last Commit207 (26/yr)Github Stars57Issues

Built for archives by archivists, ArchivesSpace is the open source archives information management application for managing and providing web access to archives, manuscripts and digital objects.

The latest technical documentation is managed in a separate GitHub repository ArchivesSpace tech-docs and is published along with the API documentation and architecture notes, at http://archivesspace.github.io/archivesspace/.

ArchivesSpace is released under the Educational Community License, version 2.0. See the COPYING file for more information.

archivematica

21 May 2020Last Commit207 (27/yr)Github Stars95Issues

By Artefactual

Archivematica is a web- and standards-based, open-source application which allows your institution to preserve long-term access to trustworthy, authentic and reliable digital content. Our target users are archivists, librarians, and anyone working to preserve digital objects.

You are free to copy, modify, and distribute Archivematica with attribution under the terms of the AGPLv3 license. See the LICENSE file for details.

Thank you for your interest in Archivematica! For more details, see the contributing guidelines