I acknowledge the Traditional Owners of the lands on which we meet, the Kaurna people, as well as the Traditional Owners of the lands on which I live and work, the Wurundjeri people of the Kulin Nation. I pay my respects to their Elders past and present. I recognise the spiritual and cultural significance of land, water, and all that is in the environment to Traditional Owners, and their continuing connection to Country.
How can targeted efforts fill gaps in the biodiversity record?
Visualising these spatial, temporal, and taxonomic gaps, it’s clear that our picture of biodiversity is shaped by patterns of human effort, but it also gives us some idea of where we can direct our efforts to make the biggest difference in filling the gaps.
Queensland’s Christmas Beetle Plague (1935, December 25). Sydney Mail (NSW : 1912 - 1938), p. 47
This newspaper article from 1935 describes the Qld canefields being overrun with beetles, and describes how “beetling” could be very lucrative - good worked could average 10 pounds night, and it took 200 beetles to make a pound. But those annual swarms have disappeared, suggesting that there may have been a decline. But like many invertebrates, Christmas beetles haven’t been the focus of long-term monitoring, so there aren’t consistent records to tell us if these are natural population fluctuations.
Michelle McFarlane, Museums Victoria
The flip side of going out and collecting new records, is making currently available records more accessible. There’s a wealth of data in collections institutions like museums and herbaria, which are in undigitised forms.
To address this, the ALA runs a program called the Australian Biodiversity Data Mobilisation Program, which provides funding to projects that improve access to biodiversity data through the ALA. So it’s about making data that we already have, more accessible and discoverable.
Australian Biodiversity Data Mobilisation Program: Click Beetles, Museums Victoria (2024)
Lucy McMillan, Museums Victoria
One project that was funded through this program is the Museums Victoria Click Beetle project in 2024, where staff digitised over 6,000 click beetle records and uploaded them to the ALA. Prior to the project, the ALA had just over 11,000 records from 63 genera and 322 species of click beetles based on preserved specimens. This project added over 6000 new occurrence records and 1 genus and 60 species that had not been represented in the ALA before.
Australian Biodiversity Data Mobilisation Program: Click Beetles, Museums Victoria (2024)
It’s estimated that there are about 800 species in this group, so we’re still lacking information on more than half the species in this family, but this one project alone was able to increase our species-level data by 18%.
Australian Biodiversity Data Mobilisation Program: Click Beetles, Museums Victoria (2024)
This dataset is particularly valuable because differentiation to species level is very difficult for this group, it often requires close examination of small body structures so records based on preserved specimens are likely to be more reliable or better able to be identified to a lower taxonomic rank. This becomes very clear if you look at the breakdown of records based on record type - a lot more of the preserved specimens are identified down to species level, whereas this proportion is lower for human observations.
Sharing data is hard…
Munroe, R (2019) How To: Absurd Scientific Advice for Common Real-World Problems
Research data is the other type of biodiversity data that doesn’t always see the light of day. It sits on a hard drive or in the supplementary table of a paper, and is generally inaccessible. And that’s because sharing data is hard - if you want to submit it to infrastructures like GBIF or the ALA, it needs to align with a data standard called Darwin Core. But this isn’t a format that most people are familiar with, so it can be difficult to know where to start.
galaxias makes it easier
{galaxias} streamlines the process of converting biodiversity data into Darwin Core Archives, allowing users to more easily submit data to infrastructures such as the ALA or GBIF
We built galaxias to help with this - it’s a package in R and Python that steps you through the process of converting data into the right format, putting everything together in the right place, and bundling it all up.
Darwin Core Archive
An archive is a .zip file containing three things:
Living Atlases require data to be submitted as a Darwin Core Archive, which is essentially a zip file containing three things: the dataset, metadata, and a schema.
Darwin Core Archive
An archive is a .zip file containing three things:
galaxias has functions that will scan your dataset and give you a suggested workflow to meet the minimum requirements. This might involve renaming columns to match standard names, or adding new columns to include particular types of information. These are easily done in existing R and Python workflows.
Darwin Core Archive
An archive is a .zip file containing three things:
You can write a metadata statement in plain text, and galaxias will convert a markdown file to EML format.
Darwin Core Archive
An archive is a .zip file containing three things:
Once those are done, galaxias automatically builds a schema file and zips everything up into a Darwin Core Archive. galaxias handles all the file conversions and file management through this process, so everything’s always in the right place. Our hope is that this will encourage more researchers to make their data accessible. If you’d like to know more about galaxias, please come and talk to us, Dax and I are at the ALA booth where we have a poster that goes into more detail. And if you’d like to test out galaxias with your own dataset, we’d love to chat about it so please stop by.
Shandiya Balasubramaniam Decision Support Program Lead Atlas of Living Australia shandiya.balasubramaniam@csiro.au shandiya.bsky.social shandiya