Rogue Scholar

Published November 21, 2013

Quick notes on yet another attempt to marry the task of editing a taxonomic classification with versioning it in GitHub.The idea of dumping the whole GBIF classification into GitHub as a series of nested folders looks untenable.

GBIFGithubZooKeysComputer and Information Sciences

ZooKeys, GBIF, and GitHub: fixing Darwin Core Archives part 2

https://doi.org/10.59350/npms6-2bh86

Published November 6, 2013

Author Roderic Page

Here's another example of a Darwin Core Archive that is "broken" such that GBIF is missing some information. GBIF data set A checklist to the wasps of Peru (Hymenoptera, Aculeata) comes from Pensoft, and corresponds to the paper:As with the previous example GBIF says there are 0 georeferenced records in this dataset. This is odd, because the ZooKeys page for this article lists three supplementary files, including KML files for Google Earth.

Darwin Core ArchiveGBIFGithubComputer and Information Sciences

GBIF and Github: fixing broken Darwin Core Archives

https://doi.org/10.59350/gt9sg-pj406

Published November 6, 2013

Author Roderic Page

Following on from Annotating and cleaning GBIF data: Darwin Core Archive, GitHub, ORCID, and DataCite here's a quick and dirty example of using GitHub to help clean up a Darwin Core Archive. The dataset 3i - Cicadellinae Database has 2,152 species and 4,749 taxa, but GBIF says it has no georeferenced data.

DataCiteGBIFGithubORCIDComputer and Information Sciences

Annotating and cleaning GBIF data: Darwin Core Archive, GitHub, ORCID, and DataCite

https://doi.org/10.59350/4b0m8-ysp35

Published November 1, 2013

Author Roderic Page

This is a quick sketch of a way to combine existing tools to help clean and annotate data in GBIF, particularly (but not exclusively) occurrence data. GitHub The data provider puts a Darwin Core Archive (expanded, not zipped) into a GitHub repository.

GBIFComputer and Information Sciences

What can Global Biodiversity Information Facility (GBIF) do for you?

https://doi.org/10.59350/y15c0-qzs40

Published October 15, 2013

Author Roderic Page

I've recently been appointed Chair of the Science Committee of the Global Biodiversity Information Facility (GBIF) http://www.gbif.org [1]. The committee is a small group of people with a range of backgrounds, and one of our roles is to advise GBIF on matters scientific (e.g., what kinds of data GBIF should collect?, what kinds of scientific questions should GBIF help answer?, etc.).There have been formal surveys (see the papers in the journal

GBICGBIFGBIOComputer and Information Sciences

Global Biodiversity Informatics Outlook (GBIO) launched

https://doi.org/10.59350/wr8jh-8vh05

Published October 4, 2013

Author Roderic Page

Wednesday saw the launch of the Global Biodiversity Informatics Outlook (GBIO), based in large part on the Global Biodiversity Informatics Conference (GBIC). The aim is to provide a framework for biodiversity informatics and its applications in the hope that the field will unite around a shared vision of where we are and what needs to be done next:There is a web site http://www.biodiversityinformatics.org/ with more details and links to related

Data QualityGBIFMap Of LifeComputer and Information Sciences

The quality of GBIF's taxonomic classification

https://doi.org/10.59350/apxaw-0c835

Published September 20, 2013

Author Roderic Page

In some recent posts I've been exploring the quality of GBIF's taxonomic data. I've done some further analyses and decided to write this up in something more than a blog post. I'm writing a draft which you can see on GitHub. It tackles just one issue, namely what happens when you combine taxonomic names from multiple sources and don't know that some of those names are synonyms.

BatsClassificationCluster MapsData CleaningGBIFComputer and Information Sciences

Cluster maps, papaya plots, and the trouble with GBIF taxonomy

https://doi.org/10.59350/dq1cv-szd96

Published August 14, 2013

Author Roderic Page

Continuing the theme of the failings of the GBIF classification I've been playing further with cluster maps to visualise the problem (see this earlier post for an introduction).Browsing through bats in GBIF I keep finding the same species appearing more than once, albeit in different genera.

Creative CommonsGBIFLicenseOpen DataComputer and Information Sciences

GBIF and open biodiversity data: what license should GBIF use?

https://doi.org/10.59350/rg6dy-3rs94

Published August 5, 2013

Author Roderic Page

GBIF is asking for views on how it should license of data in the GBIF network. The full consultation document is available from Google Drive and DropBox.

Catalogue Of LifeGBIFIUCNPhilautusRaorchestesComputer and Information Sciences

More GBIF taxonomy fail

https://doi.org/10.59350/a6qcn-hhm33

Published June 19, 2013

Author Roderic Page

In browsing the GBIF classification in BioNames I keep coming across cases of wholesale duplication of taxa.

iPhylo

GBIF, GitHub, and taxonomy (again)

ZooKeys, GBIF, and GitHub: fixing Darwin Core Archives part 2

GBIF and Github: fixing broken Darwin Core Archives

Annotating and cleaning GBIF data: Darwin Core Archive, GitHub, ORCID, and DataCite

What can Global Biodiversity Information Facility (GBIF) do for you?

Global Biodiversity Informatics Outlook (GBIO) launched

The quality of GBIF's taxonomic classification

Cluster maps, papaya plots, and the trouble with GBIF taxonomy

GBIF and open biodiversity data: what license should GBIF use?

More GBIF taxonomy fail