Neil Thomson (n.thomson(at)nhm.ac.uk)
| Wouter Addink | (wouter(at)eti.uva.nl) | Technical - NCD Toolkit |
| Carol Butler | (butlercr(at)si.edu) | Terminology |
| Markus Döring | (m.doering(at)BGBM.org) | Technical - RDF |
| Doug Holland | (doug.holland(at)mobot.org) | Data mapping |
| Barbara Mathé | (mathe(at)amnh.org) | Data mapping |
| Connie Rinaldo | (crinaldo(at)oeb.harvard.edu) | Documentation |
| Larry Speers | (lspeers(at)gbif.org) | GBIF liaison |
| Günter Waibel | (Guenter_Waibel(at)notes.rlg.org) | Resource organiser |
The Group is developing and supporting a data standard, called Natural Collections Description (NCD) for describing entire collections of natural history materials. Examples include collections of specimens, observations data, original artwork, photographs and those from the many voyages of discovery. Collection description records contain information about the collection, access and usage of the collection and where to get more detailed information.
There are valuable collections that have no information stored in any database and many do not have a presence on the Internet. Such collections are easily overlooked by researchers as they become more reliant on the Web. A brief descriptive record as defined by the NCD standard can act as the “business card” for a collection, providing enough information to identify and locate it.
A component of the NCD standard will be used as the “business card” of organisations in the area of natural history and biodiversity research. NCD records will allow an organisation to be identified from its code. NCD will also provide information on any other relevant codes.
The Convener would be pleased to hear from anyone, and particularly:
Natural Collections Description (NCD) is a data standard for describing collections of natural history materials at the collection level; one NCD record describes one entire collection.
NCD is a lightweight resource description standard specific to natural history. NCD is between a general resource discovery standards such as Dublin Core (DC) and rich collection description standards such as the Encoded Archival Description (EAD). It will be possible to extract a Dublin Core record from an NCD record for use with general resource discovery systems or. It is also possible to use an NCD record as the seed for a richer collection description.
The NCD standard covers any type of natural history collection, including specimens, original artwork, archives, observations, library materials, datasets, photographs or mixed collections such as those that result from expeditions and voyages of discovery.
Collection descriptions are electronic records that document the holdings of organisations as groups of items. Such descriptions complement the more traditional item-level records describing a single specimen or a library book. Each collection record describes one entire collection, including information on the extent and purpose of the collection, conditions of access and use and who to contact for more information.
A collection is loosely defined as any group of things that have something in common. That "something in common" can be defined by the basic questions that users ask when accessing collections - who, what, where and when. Examples of collections include:
In natural history museums, for example, the best known collections are the specimens, the library and the archives. Exhibitions, paintings and sculptures are also collections. The electronic equivalents include datasets and databases (which are collections of item-level records) and the thematic sections of Web-sites.
Three main areas are served by collection-level description records:
Collections of natural history material can be large. Consequently detailed item-level descriptions often take a long time to generate. A collection-level record can ensure that knowledge about a collection can be rapidly revealed.
It is also useful to hold the details that are common to the whole collection once, rather than being repeated with the record for each item.
Relating collections that are in museums, libraries, archives and other organisations (cross-domain resources) is a priority for many governments. By adopting the same standard for collections in each domain, it is possible to search across all collections, regardless of domain or location.
"Virtual collections" are another way to link resources. Some organisations divide collections between departments for curatorial purposes. Researchers would need to contact each department individually to assess the complete collection. Similarly, some collections, such as those of Darwin, have been dispersed throughout several organisations. These collections may be re-united in a virtual sense, using collection descriptions for each component.
Collection descriptions are verbose, and therefore complement the rows of raw data that are produced by item-level databases.
A collection description record can be created for a collection whether the items in that collection have their own records in a database, or not. Where a database containing item-level details exists, a link can be provided to that database to provide more detail.
If the collection does not have an item-level database, producing a collection description reduces the chances of that collection being overlooked by researchers using the Web for resource discovery.
Collection descriptions enable a broad perspective. A set of collection descriptions could serve a variety of additional purposes for any organisation:
Collection descriptions can serve to prevent loss of data that is in a physical form or data in a format that is nearing technological obsolescence.
Creating collection descriptions for datasets that includes format information will help to act as an early warning so that data can be transferred to a current format. Data then becomes part of a digital preservation programme, rather than a digital archaeology project.
Collection description records act as a convenient place to store information volunteered by visitors.
Re-use of data is important. Collection description records can be created with the use of existing published or unpublished resources such as printed catalogues, exhibition planning documents or archival finding aids. Conversely, data recorded in the collection description records can be used to produce exhibit labels.
The data can also be provided to external initiatives, some of which wish to merge data from several sources to provide regional coverage of biodiversity collections.
The first steps in standardising collection-level description began with a European Union Framework V project known as BioCASE - the Biodiversity Collections Access Service for Europe. This project ran from November 2001 until early 2005 (more on this at the BioCASE project website.)
An RLG Programs working group of mainly North American natural history librarians and archivists known as RAVNS (Resources AVailable in Natural Sciences) has made the BioCASE metadata standard more generally applicable, rather than dealing only with specimen collections it is now a standard that covers any type of natural history collection. RAVNS includes representatives from the following institutions:
Discussion at the TDWG 2004 meeting in Christchurch, New Zealand, concluded that a standard for describing natural history collections would fit well with the suite of data standards being developed on behalf of the Global Biodiversity Information Facility (GBIF).
The Collections Descriptions Interest Group brings together work on collections descriptions being carried out for the European Union Framework VI programme known as SYNTHESYS and the work performed by RAVNS under the auspices of RLG with that carried out by TDWG members.
Natural Collections Descriptions (NCD) covers all types of collections of natural history material; specimens, original artwork, photographs, archives, published material or a mixture.
The Interest Group is developing NCD for use with RDF to ensure that it integrates with TDWG's common development architecture. The standard enables the aggregation of collections descriptions from many sources and facilitates resource discovery - particularly for collections that do not have a Web presence.
NCD is a standard between the general resource discovery standards such as Dublin Core, and the rich collections description standards such as EAD. Mappings enable the extraction of a Dublin Core record from an NCD record or, conversely, filling out an NCD record into an EAD record.