Type
Guideline

Data standards in Dandjoo

Summary

Making biodiversity data more searchable and usable.

Hierarchy
Part of Dandjoo

About data standards

Dandjoo brings data from many different people and organisations together. This means that when data arrives at BIO for ingestion it may contain a variety of different fields and be structured quite differently from submission to submission.

Incoming data needs to be standardised so it can be represented consistently in Dandjoo and be combined and searched by data users. Dandjoo does this by allowing key fields in data submitted to be mapped to widely recognised biodiversity data standards.

 

The Darwin Core standard

Darwin Core is the first biodiversity data standard to be incorporated into Dandjoo, and BIO is currently reviewing other standards (such as VegX) for inclusion in future releases.

Darwin Core is an internationally recognised standard that supports management and sharing of biodiversity data, and defines a glossary of terms in a flat structure to represent taxon, occurrences, specimens, and samples.

Resources:

 

The Dublin Core standard

The Dublin Core Metadata Initiative (DCMI) is specifically based around record-level definitions, with some terms integrated into the widely used Darwin Core standard. Links to documentation for the expanded Dublin Core standard are provided below.

Resources:

 

The Australian Biodiversity Information Standard (ABIS)

The Australian Biodiversity Information Standard (ABIS) is a standard being specifically developed for biodiversity data exchange in Australia. It combines many established standards (including Darwin Core, and TERN Ontology) and is being designed to be machine readable.

ABIS is introducing standards related to surveys and projects, and are used in Dandjoo’s new Systematic Survey Data release.

Resources:

Data fields used in Dandjoo

When users export search results from Dandjoo, the export will contain the following data fields shown in the table below.

Some fields may be blank for some records - this means that the submitter who provided a record did not collect or upload data for that field, or may not have mapped the field during data submission. However, all submitters must provide certain core fields to show an organism’s name, and to indicate where and when it was observed.

 

Data standardDandjoo FieldData Field Property NameData Field Property Description
N/ARecord IDRecord_IDThe persistent unique ID assigned to each record in Dandjoo
Darwin CoreData typedwc:eventTypeThe nature of the dwc:event, in this case refers to either the occurrence or systematic survey data submission pathway 
Darwin CoreLatitudedwc:decimalLatitudeThe geographic latitude (in decimal degrees, using the WGS84 (EPSG:4326) system) of the geographic centre of a Location. Positive values are north of the Equator, negative values are south of it. Legal values lie between -90 and 90, inclusive.
Darwin CoreLongitudedwc:decimalLongitudeThe geographic longitude (in decimal degrees, using the WGS84 (EPSG:4326) system) of the geographic centre of a Location. Positive values are east of the Greenwich Meridian, negative values are west of it. Legal values lie between -180 and 180, inclusive.
Darwin CoreDate observed/collecteddwc:eventDateThe date-time on which an Event occurred. For occurrences, this is the date-time when the event was observed. Not suitable for a time in a geological context.
Darwin CoreField Scientific Namedwc:scientificNameThe full scientific name. When forming part of an Identification, this should be the full name, including the lowest level taxonomic rank that can be determined. (Note: In Dandjoo, this is the taxonomic name originally provided by the data submitter, after curation to address any errors.)
Darwin CoreRecognised Scientific Namedwc:acceptedNameUsageThe full name, with authorship and date information if known, of the currently valid (zoological) or accepted (botanical) taxon. (Note: In Dandjoo, this is BIO’s determination of the most recently-known current name of the organism observed.)
Darwin CoreSubmitterdwc:institutionCodeThe name (or acronym) in use by the institution having custody of the object(s) or information referred to in the record. (Note: In Dandjoo, this is the organisation that had custody of the information in the record, and submitted it to Dandjoo.)
Dublin CoreDatasetdcterms:titleA name given to the resource. (Note: In Dandjoo, this is the dataset name provided by the submitter.)
Darwin CoreDataset ID*dwc:datasetIDAn identifier for the set of data. May be a global unique identifier or an identifier specific to a collection or institution. 
ABISProject nameabis:projectAn Activity that requires concerted effort following a Plan in pursuit of an objective
Darwin CoreProject IDdwc:parentEventIDAn identifier for the broader dwc:event that groups this and potentially other dwc:events.
ABISProject Purposeabis:purposeThe intent of the Activity
ABISSurvey nametern:surveyA Survey is an 'Activity' during which 'Sampling' or 'Observation' Activities occur.
Darwin CoreSurvey IDdwc:eventIDAn identifier for the set of information associated with a dwc:event (something that occurs at a place and time). May be a global unique identifier or an identifier specific to the data set.
Dublin CoreSurvey participantsdcterms:contributorAn entity responsible for making contributions to the resource.
ABISSurvey date range starttern:survey; prov:startedAtTimeSupports the association of a temporal entity (instant or interval) to any thing
ABISSurvey date range endtern:survey; prov:endedAtTimeSupports the association of a temporal entity (instant or interval) to any thing
Darwin CoreSurvey Summarydwc:eventRemarksComments or notes about the dwc:event.
Darwin CoreBounding boxdwc:footprintWKTThe ellipsoid, geodetic datum, or spatial reference system (SRS) upon which the geometry given in dwc:footprintWKT is based.
Dublin CoreHabitat, Type and Highlights Tags*dcterms:subjectA topic of the resource. Recommended practice is to refer to the subject with a URI. If this is not possible or feasible, a literal value that identifies the subject may be provided. Both should preferably refer to a subject in a controlled vocabulary.
Darwin CoreDocumentsdwc:associatedReferencesA list (concatenated and separated) of identifiers (publication, bibliographic reference, global unique identifier, URI) of literature associated with the dwc:occurrence.
Darwin CoreCountdwc:individualCountThe number of individuals present at the time of the Occurrence.
Dublin CoreRights Holderdcterms:rightsHolderThe person or organization owning or managing rights over the record. (Note: In Dandjoo, this is the submitter in most cases.)
Darwin CoreMethod/Protocoldwc:samplingProtocolThe names of, references to, or descriptions of the methods or protocols used during an Event.
GBIF Darwin Core Extension: Species DistributionConservation Status (authorized users only)threatStatusConservation status of a species.  (Note: This is populated and updated in Dandjoo based on the most recent threatened and priority species lists maintained by the Western Australian Government.)
Darwin CoreIdentification basisdwc:basisOfRecordThe specific nature of the data record. e.g. Fossil, live specimen etc.
Darwin CoreCollector*dwc:recordedByA list (concatenated and separated) of the globally unique identifier for the person, people, groups, or organizations responsible for recording the original occurrence. Recommended best practice is to separate the values in a list with “space vertical bar space” ( | ).
Darwin CoreField identification (original field name)dwc:verbatimIdentificationA string representing the taxonomic identification as it appeared in the original record.
Darwin CoreHuman observation ID*dwc:occurrenceIDAn identifier for the Occurrence (as opposed to a particular digital record of the occurrence). In the absence of a persistent global unique identifier, construct one from a combination of identifiers in the record that will most closely make the occurrenceID globally unique.
Darwin CoreDate identifieddwc:dateIdentifiedThe date on which the subject was determined as representing the Taxon.
Darwin CoreIdentification Ambiguitydwc:identificationQualifierA brief phrase or a standard term ("cf.", "aff.") to express the determiner's doubts about the Identification.
Darwin CoreIdentified by*dwc:identifiedByA list (concatenated and separated) of names of people, groups, or organizations who assigned the Taxon to the subject. Recommended best practice is to separate the values in a list with “space vertical bar space” ( | ).
Darwin CoreSpecimen ID*dwc:materialSampleIDA physical result of a sampling (or subsampling) event. In biological collections, the material sample is typically collected, and either preserved or destructively processed.
Darwin CoreIdentification notesdwc:identificationRemarksComments or notes about the Identification.
Darwin CoreScientific name publisherdwc:scientificNameAuthorshipThe authorship information for the scientific name formatted according to the conventions of the applicable nomenclatural code.
Darwin CoreTaxonomic Rankdwc:taxonRankThe taxonomic rank of the most specific name in the dwc:scientificName. (e.g. species, subspecies, variety)
Darwin CoreOrganism Remarksdwc:organismRemarksComments or notes about the Organism instance.
Darwin CorePresence/Absencedwc:occurrenceStatusA statement about the presence or absence of a Taxon at a Location.
Darwin CorePreparationsdwc:preparationsA list (concatenated and separated) of preparations and preservation methods for a specimen (e.g. ethanol, dried).
Darwin CoreGenomic sequence informationdwc:associatedSequencesA list (concatenated and separated) of identifiers (publication, global unique identifier, URI) of genetic sequence information associated with the Occurrence.
Darwin CoreLife Stagedwc:lifeStageThe age class or life stage of the Organism(s) at the time the Occurrence was observed (e.g. juvenile, nymph).
Darwin CoreReproductive Statedwc:reproductiveConditionThe reproductive condition of the biological individual(s) represented in the Occurrence (e.g. pregnant, flowering).
Darwin CoreNative/introduced/feraldwc:establishmentMeansStatement about whether an organism or organisms have been introduced to a given place and time through the direct or indirect activity of modern humans.
Darwin CoreGeographic uncertainty (m)dwc:coordinateUncertaintyInMetersThe horizontal distance (in meters) from the given decimal-Latitude and decimal-Longitude describing the smallest circle containing the whole of the Location. Leave the value empty if the uncertainty is unknown, cannot be estimated, or is not applicable (because there are no coordinates). Zero is not a valid value for this term.
Darwin CoreArea/locality of occurrencedwc:localityThe specific description of the place. (e.g. 200km north of Perth). (Note: This is a free text field that does not correspond to a specific geography or list of regions.)
Darwin CoreHabitatdwc:habitatA category or description of the habitat in which the Event occurred.
Darwin CoreVernacular namedwc:vernacularNameA common or vernacular name.
N/AInformal groups  
Darwin CoreKingdomdwc:kingdomThe full scientific name of the kingdom in which the dwc:taxon is classified.
Darwin CorePhylumdwc:phylumThe full scientific name of the phylum or division in which the dwc:taxon is classified.
Darwin CoreClassdwc:classThe full scientific name of the class in which the dwc:taxon is classified.
Darwin CoreOrderdwc:orderThe full scientific name of the order in which the dwc:taxon is classified.
Darwin CoreFamilydwc:familyThe full scientific name of the family in which the dwc:taxon is classified.

*Not available for export in the current version of Dandjoo

BIO Blog

Image
Mining Tenements, DBCA managed conservation areas, and Local Government Areas can now be searched by code, license number, or name in the Location search box.
Image
The PDF Species List report, based on user feedback, shows species by Conservation status and Kingdom from user-defined searches, with definitions included. Each species entry lists Class, Family, names, Establishment status, and Conservation code.
Image

We have added functions to be able to search, view and download (where available) Systematic Survey Data in the Dandjoo platform.

Image

To enhance value of data for users the following additional data attributes have been added to the data exports to better assist in data filtering.

Image

We have been working hard and now bring you two new ways to search in Dandjoo. These are Kingdom search and Latitude & Longitude search.

Image

From March 2024, Dandjoo will produce a species list for an area of interest inclusive of all known species that has been evident within the area of interest through observation and survey.

Image

Dandjoo is committed to providing biodiversity data to the Western Australian public that is both usable and compliant with legislation regarding sensitive species.

Join the BIO newsletter and get updated first

Sign up for access to the latest developments at the Biodiversity Information Office, upcoming Dandjoo features, and our newest datasets.

 

Get the BIO newsletter

Image
Map of Western Australia with location points plotted