LUX Dataset

Providing users with a more straightforward and connected online collection landscape at Yale is essential to enhance collections use and research.  The resolution and reconciliation of variant metadata entities with external authorities, using best practices for production of Linked Open Data, is one critical element of this work. The resulting normalized index of metadata entities will support search and discovery within LUX: Yale Collections Discovery.  Additionally, the dataset exists as a tool for advanced research and data analysis. 

The dataset is comprised of metadata from five distinct internal sources:

  • Yale University online catalog (ORBIS)
  • Archives at Yale
  • Yale Center for British Art collections
  • Yale Peabody Museum of Natural History collections
  • Yale University Art Gallery collections

In addition, the LUX dataset uses internally-assigned identifiers to create alignments through local harmonization, reconcilation with external authorities, and enrichment (new data) from external sources.   In this regard, the 2019 grant awarded by the Andrew W. Mellon Foundation for metadata reconciliation and harmonization work at Yale has been transformational.  

Mellon Foundation Logo


  • Development of first generation schema to crosswalk data from each of the partner units into a shared index
  • Iterative improvement of unit data mapping to schema
  • Identification of local harmonization opportunities (for enhanced search and discovery)
  • Proof-of concept identifies data model limitations
  • Development of LUX Linked Data (LD) model, based on LinkedArt/CIDOC-CRM
  • Work underway within each unit to add identifiers for Agents (people/groups), Places, and eventually Concepts
  • Units transforming their data to JSON-LD for LUX
  • Design and prototyping of a data enrichment pipeline for LUX
  • Collaborative effort to identify reparative work to be be done within units and across the dataset

Background about Linked Data, and Knowledge Graph Discovery and LUX:

For more information about Yale’s Cultural Heritage metadata work, please contact: Rob Sanderson or Larry Gall