Rogue Scholar

Published January 24, 2014

VertNet has announced that they have implemented issue tracking using GitHub. This is a really interesting development, as figuring out how to capture and make use of annotations in biodiversity databases is a problem that's attracting a lot of attention.

GBIFGithubGlassellaGMLPinnixaComputer and Information Sciences

GBIF, GitHub, and taxonomy (again)

https://doi.org/10.59350/m556x-qdr03

Published November 21, 2013

Author Roderic Page

Quick notes on yet another attempt to marry the task of editing a taxonomic classification with versioning it in GitHub.The idea of dumping the whole GBIF classification into GitHub as a series of nested folders looks untenable.

GBIFGithubZooKeysComputer and Information Sciences

ZooKeys, GBIF, and GitHub: fixing Darwin Core Archives part 2

https://doi.org/10.59350/npms6-2bh86

Published November 6, 2013

Author Roderic Page

Here's another example of a Darwin Core Archive that is "broken" such that GBIF is missing some information. GBIF data set A checklist to the wasps of Peru (Hymenoptera, Aculeata) comes from Pensoft, and corresponds to the paper:As with the previous example GBIF says there are 0 georeferenced records in this dataset. This is odd, because the ZooKeys page for this article lists three supplementary files, including KML files for Google Earth.

Darwin Core ArchiveGBIFGithubComputer and Information Sciences

GBIF and Github: fixing broken Darwin Core Archives

https://doi.org/10.59350/gt9sg-pj406

Published November 6, 2013

Author Roderic Page

Following on from Annotating and cleaning GBIF data: Darwin Core Archive, GitHub, ORCID, and DataCite here's a quick and dirty example of using GitHub to help clean up a Darwin Core Archive. The dataset 3i - Cicadellinae Database has 2,152 species and 4,749 taxa, but GBIF says it has no georeferenced data.

DataCiteGBIFGithubORCIDComputer and Information Sciences

Annotating and cleaning GBIF data: Darwin Core Archive, GitHub, ORCID, and DataCite

https://doi.org/10.59350/4b0m8-ysp35

Published November 1, 2013

Author Roderic Page

This is a quick sketch of a way to combine existing tools to help clean and annotate data in GBIF, particularly (but not exclusively) occurrence data. GitHub The data provider puts a Darwin Core Archive (expanded, not zipped) into a GitHub repository.

CrossrefDOIGithubJATSNLM DTDComputer and Information Sciences

Augmenting ZooKeys bibliographic data to flesh out the citation graph

https://doi.org/10.59350/mkf16-30048

Published July 17, 2013

Author Roderic Page

In a previous post (Learning from eLife: GitHub as an article repository) I discussed the advantages of an Open Access journal putting its article XML in a version-controlled repository like GitHub. In response to that post Pensoft (the publisher of ZooKeys ) did exactly that, and the XML is available at https://github.com/pensoft/ZooKeys-xml.OK, "now what?" I hear you ask.

ELifeGithubZooKeysComputer and Information Sciences

Learning from eLife: GitHub as an article repository

https://doi.org/10.59350/rjbjw-pa494

Published July 12, 2013

Author Roderic Page

Playing with my eLife Lens-inspired article viewer and some recent articles from ZooKeys I regularly come across articles that are incorrectly marked up. As a quick reminder, my viewer takes the DOI for a ZooKeys article (just append it to http://bionames.org/labs/zookeys-viewer/?doi=, e.g. http://bionames.org/labs/zookeys-viewer/?doi=10.3897/zookeys.316.5132), fetches the corresponding XML and displays the article.Taking the article

Equirectangular ProjectionGithubMapsOrthographic ProjectionPolarComputer and Information Sciences

Using orthographic projections to map organism distributions

https://doi.org/10.59350/6zzw6-fx070

Published July 1, 2012

Author Roderic Page

For a current project I'm currently working I show organism distributions using data from GBIF, and I display that data on a map that uses the equirectangular projection.

CrowdsourcingEOLFigShareFlickrGithubComputer and Information Sciences

Where is the "crowd" in crowdsourcing? Mapping EOL Flickr photos

https://doi.org/10.59350/w706f-6sr12

Published June 28, 2012

Author Roderic Page

In any discussion of data gathering or data cleaning the term "crowdsourcing" inevitably comes up. A example where this approach has been successful is the Encyclopedia of Life's Flickr pool, where Flickr users upload images that are harvested by EOL.Given that many Flickr photos are taken with cameras that have built-in GPS (such as the iPhone, the most common camera on Flickr) we could potentially use the Flickr photos not only as a source of

AjaxBLASTGithubPhylogenyPhyloinformaticsComputer and Information Sciences

BLAST a sequence and get a tree

https://doi.org/10.59350/9reva-mze47

Published January 30, 2012

Author Roderic Page

For this weeks sessions of my phyloinformatics course I'm developing some phylogeny tools. The first is a simple AJAX-based BLAST tool.

iPhylo

VertNet starts issue tracking using GitHub

GBIF, GitHub, and taxonomy (again)

ZooKeys, GBIF, and GitHub: fixing Darwin Core Archives part 2

GBIF and Github: fixing broken Darwin Core Archives

Annotating and cleaning GBIF data: Darwin Core Archive, GitHub, ORCID, and DataCite

Augmenting ZooKeys bibliographic data to flesh out the citation graph

Learning from eLife: GitHub as an article repository

Using orthographic projections to map organism distributions

Where is the "crowd" in crowdsourcing? Mapping EOL Flickr photos

BLAST a sequence and get a tree