Who's Using What: DPLA

During the month of September we are presenting special "Who's Using What?" interviews with the development teams from three of the world's leading aggregators, protectors, and advocates for digital cultural heritage, DigitalNZ, DPLA, and our very own Europeana. For any developer or institution currently working on, or planning to manage such massive digital collections and succesfully re-present them to the global public, these three development teams combined hold an unprecedent amount of expertise and they do it all while using open source software. Read here what tools these teams are favouring, what they've made for the world to improve upon and use, and what they hope to see in the future. You can read our previous piece with, Chris McDowall and the development team from DigitalNZ here as well as Bram Lohman and the Europeana Development Team here. Up next though, Mark Matienzo and the DPLA Development team!

Mark Matienzo and the DPLA

Mark Matienzo and the DPLA The DPLA has been making waves in the digital heritage sector since their official beginning in 2013. Their goal isn’t much different than Europeana’s, providing easy digital access to a large aggregated corpus of digitised heritage materials. DPLA built their Metadata Application Profile (MAP) off of Europeana’s EDM and have subsequently collaborated with Europeana on International Rights Statements and putting them into action. But what drew me to the DPLA was something I heard at EuropeanaTech 2015 in Paris. There, DPLA Director Dan Cohen said in his wonderful presentation that the DPLA development team names all their tools after goats because “they’re curious and consume everything”. Well those sounded like developers I needed to talk to. So without further ado, read our interview with Mark Matienzo from the DPLA.

Mark Matienzo is the Director of Technology for the Digital Public Library of America. Prior to joining DPLA, Matienzo worked as an archivist and technologist specialising in born-digital materials and metadata management, at institutions including the Yale University Library, The New York Public Library, and the American Institute of Physics. Matienzo received a MSI from the University of Michigan School of Information and a BA in Philosophy from the College of Wooster, and was the first awardee (2012) of the Emerging Leader Award of the Society of American Archivists.

What open source tools are you currently working with?

Most of the current development work we’re doing is built using Ruby on Rails, the PostgreSQL database system, and Apache Solr or Elasticsearch for indexing. For our metadata projects, such as to support ingestion and metadata enhancement, we’ve been using Apache Marmotta as our triple store, and ActiveTriples to help with modeling RDF objects in Ruby code. We’re excited to start doing more with Blacklight and Hydra as well. We also use Ansible for deployment and automation, and Redmine for tracking issues.

What open source tools have you used in the past to develop larger applications?

We’ve used a variety of tools, including Omeka, Akara, and some of the same previous tools mentioned.

What are you currently developing?

All of DPLA’s current public projects are available on our Github account. Most of our work has been going towards Heiðrún and Krikri (parts of our new metadata ingestion system that’s been under development since October 2014). We’re also just beginning a project, called "Hydra in a Box", with Stanford University and DuraSpace, to develop a turnkey repository system built on the Hydra framework.

What would you like to see developed?

Our largest need are high-performance storage and querying solutions for RDF data. We’re particularly excited about work going into Linked Data Fragments, but we’re hoping that there are more implementations that will be under way. We’d also like to see more convergence around developing shared metadata enhancement tools or common APIs for developing them, such as for geocoding and alignment of terms with authority files and vocabularies.

For more information about the DPLA read Joris Pekel’s interview with Dan Cohen from EuropeanaTech 2015.