🚜 Combine

Overview

Combine is a Django application to facilitate the harvesting, transformation, analysis, and publishing of metadata records by Service Hubs for inclusion in the Digital Public Library of America (DPLA).

The name "Combine", pronounced /kämˌbīn/, is a nod to the combine harvester used in farming famous for, "combining three separate harvesting operations - reaping, threshing, and winnowing - into a single process" Instead of grains, we have metadata records! These metadata records may come in a variety of metadata formats, various states of transformation, and may or may not be valid in the context of a particular data model. Like the combine equipment used for farming, this application is designed to provide a single point of interaction for multiple steps along the way of harvesting, transforming, and analyzing metadata in preperation for inclusion in DPLA.

Documentation

Documentation is available at Read the Docs.

Installation

Combine has a fair amount of server components, dependencies, and configurations that must be in place to work, as it leverages Apache Spark, among other applications, for processing on the backend. There are a couple of deployment options.

Docker

A GitHub repository Combine-Docker exists to help stand up an instance of Combine as a series of interconnected Docker containers.

Server Provisioning with Vagrant and/or Ansible

To this end, use the repository, Combine-playbook, which has been created to assist with provisioning a server with everything neccessary, and in place, to run Combine. This repository provides routes for server provisioning via Vagrant and/or Ansible. Please visit the Combine-playbook repository for more information about installation.

Name		Name	Last commit message	Last commit date
Latest commit History 1,468 Commits
combine		combine
core		core
docs		docs
inc		inc
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
manage.py		manage.py
pyspark_shell.sh		pyspark_shell.sh
requirements.readthedocs.txt		requirements.readthedocs.txt
requirements.txt		requirements.txt
runcelery.sh		runcelery.sh
runconsole.sh		runconsole.sh
runserver.sh		runserver.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

🚜 Combine

Overview

Documentation

Installation

Docker

Server Provisioning with Vagrant and/or Ansible

About

Uh oh!

Releases 21

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

Uh oh!

License

Uh oh!

MI-DPLA/combine

Folders and files

Latest commit

History

Repository files navigation

🚜 Combine

Overview

Documentation

Installation

Docker

Server Provisioning with Vagrant and/or Ansible

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 21

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages