2 minutes to read Posted on Friday August 7, 2015

Tech
portrait of Gregory Markus

Gregory Markus

EuropeanaTech Community Manager , Netherlands Institute for Sound and Vision

Who's Using What - Chris McDowall and DigitalNZ Developer Profile

During the month of September we will present special "Who's Using What?" interviews with the development teams from three of the world's leading aggregators, protectors, and advocates for digital cultural heritage, DigitalNZ, DPLA, and our very own Europeana.

main image

For any developer or institution currently working on, or planning to manage such massive digital collections and succesfully re-present them to the global public, these three development teams combined hold an unprecedent amount of expertise and they do it all while using open source software. Read here what tools these teams are favouring, what they've made for the world to improve upon and use, and what they hope to see in the future. Up first, Chris McDowall and the development team from DigitalNZ!

Chris McDowall and DigitalNZ

Digital New Zealand logoThe National Library of New Zealand's DigitalNZ team makes New Zealand's digital content available to find, share and use. Like Europeana, DigitalNZ aggregates metadata from many digital collections so it is available to search and also encourages the development of new discovery experiences through an open API. The team works with the collections from a wide range of organisations across the cultural sector, community groups, businesses and media organisations. It also brings in metadata relevant to New Zealand from Europeana and DPLA. DigitalNZ is almost eight years old and its most recent development is the release of Supplejack, an open source tool for aggregating, searching and sharing metadata records. Supplejack collects metadata about millions of items from hundreds of data sources. The tool transforms messy data, creates a unified search index and makes the reconciled metadata widely available via an open API data service.

Chris McDowall is the manager of DigitalNZ Systems. He looks after the day to day running of the DigitalNZ infrastructure and coordinates feature development.

What open source tools are you currently working with?

Here is a list of the main open source tools we use. The true list would be much longer.

LANGUAGES/WEB FRAMEWORKS

  • Ruby
  • Rails
  • jQuery
  • SASS

DATABASES / DATA STORES / SEARCH

SERVERS

MISC

OUR HARVESTING / SEARCH PLATFORM

What open source tools have you used in the past to develop larger applications?

The tools mentioned above as well as a bunch that appear top of mind …

and lots of others I can’t think of right now ;)

What are you currently developing?

We are working on our open source metadata harvesting, aggregation and search platform called Supplejack. Supplejack powers the DigitalNZ service. Its main purpose is to make it easy to aggregate heterogeneous data at scale and provide ways to surface that data so it is more useful. From a data management perspective there are several things you can do with Supplejack:

  1. Define common data schema that incoming data should map to
  2. Create search and database indexes that conform to your chosen schema
  3. Script instructions for extracting and mapping data from many different data sources
  4. Set up validation rules for your data harvesting activity
  5. Schedule data harvests to run at whatever frequency you like
  6. Run enhancement scripts to improve the quality or completeness of harvested data
  7. Deliver a public API of the standardised data
  8. Monitor API key activity and set query throttle rates
  9. View collected data on a demo website
  10. Supplejack was designed to provide assurance to the quality of data management activities when working at scale.

What would you like to see developed?

Oh, that is a big question. My immediate answer is better and simpler tools for querying/exploring and visualising linked data.

top