Rogue Scholar

Published June 24, 2015

I spent last Friday and Saturday at ( Research in the 21st Century: Data, Analytics and Impact , hashtag #ReCon_15) in Edinburgh. Friday 19th was conference day, followed by a hackday at CodeBase. There's a Storify archive of the tweets so you can get a sense of the meeting. Sitting in the audience a few things struck me. No identifier wars, DOIs have won and are everywhere.

Creative CommonsGeoJSONGeophylogenyGithubPLoSComputer and Information Sciences

Visualising Geophylogenies in Web Maps Using GeoJSON

https://doi.org/10.59350/q7mg0-yq203

Published June 24, 2015

Author Roderic Page

I've published a short note on my work on geophylogenies and GeoJSON in PLoS Currents Tree of Life : At the time of writing the DOI hasn't registered, so the direct link is here. There is a GitHub repository for the manuscript and code. I chose PLoS Currents Tree of Life because it is (supposedly) quick and cheap.

Ross MounceSpecimen CodesText MiningComputer and Information Sciences

Text mining for museum specimen identifiers

https://doi.org/10.59350/xvdw8-nc818

Published May 19, 2015

Author Roderic Page

This post is a response to Ross Mounce's post Text mining for museum specimen identifiers. As Ross notes in that post, mining literature for specimen codes is something I've been interested in for a while (search for specimen codes on iPhylo), and @Aime Rankin (formerly an undergraduate student at Glasgow) did some work on this as well. It's great to see progress in this area.

BioNamesBirdsGBIFHolotypesIONComputer and Information Sciences

The value of ION to GBIF

https://doi.org/10.59350/2bpr6-11k45

Published May 14, 2015

Author Roderic Page

This a quick writeup of an analysis I did to make the case that the list of names held by the Index of Organism Names (ION) (part of Thomson Reuters) would be very useful for GBIF. I must declare a bias, in that I've spent a good chunk of the last 3-4 years exploring the ION database and investigating ways to link the taxonomic names it contains to the primary taxonomic literature, culminating in building BioNames.

Bouchout DeclarationOpen AccessOpen DataComputer and Information Sciences

Putting some bite into the Bouchout Declaration

https://doi.org/10.59350/z8g5k-peb09

Published May 8, 2015

Author Roderic Page

I've put off writing this post about the Bouchout Declaration for a number of reasons. I attended the meeting that launched the declaration last year, and from my perspective that was a frustrating meeting.

GBIFGoogle DocsMaterial ExaminedSpecimen CodesWeb ServicesComputer and Information Sciences

Looking up specimen codes in GBIF using Google Spreadsheet

https://doi.org/10.59350/c236y-8bn90

Published April 21, 2015

Author Roderic Page

Playing with the my "material examined" tool I've been working on, I wondered whether I could make use of it in, say, a spreadsheet. Imagine that I have a spreadsheet of museum codes and want to look those up in GBIF. I could create a service for Open Refine but Open Refine is a bit big and clunky, you have to fire up a Java application and point your browser at it, and Open Refine isn't as intuitive or as flexible as a spreadsheet.

ChallengeGBIFComputer and Information Sciences

GBIF Ebbe Nielsen Challenge finalists announced

https://doi.org/10.59350/w3afa-cyh60

Published April 15, 2015

Author Roderic Page

The six finalists for the GBIF Ebbe Nielsen Challenge have been announced by GBIF: The finalists all receive a €1,000 prize, and now have the possibility to refine their work and compete for the grand prize of €20,000 (€5000 for second place). As the rather cheesy quote above suggests, I think the challenge has been a success in terms of the interest generated, and the quality of the entrants.

GBIFGenbankKnowledge GraphSpecimen CodesComputer and Information Sciences

Linking specimen codes to GBIF

https://doi.org/10.59350/g6gq1-crg31

Published April 15, 2015

Author Roderic Page

I've put together a working demo of some code I've been working on to discover GBIF records that correspond to museum specimen codes. The live demo is at http://bionames.org/~rpage/material-examined/ and code is on GitHub. To use the demo, simply paste in a specimen code (e.g., "MCZ 24351") and click Find and it will do it's best to parse the code, then go off to GBIF and see what it can find.

ChallengeGBIFComputer and Information Sciences

GBIF Ebbe Nielsen Challenge submissions: judging begins

https://doi.org/10.59350/bqe4h-2ek08

Published March 10, 2015

Author Roderic Page

The GBIF Ebbe Nielsen Challenge has closed and we have 23 submissions for the jury to evaluate. There's quite a range of project types (and media, including sound and physical objects), and it's going to be fascinating to evaluate all the entries (some of which are shown below). This is the first time GBIF has run this challenge, so it's gratifying to see so much creativity in response to the challenge.

GBIFOZCAMComputer and Information Sciences

More examples of data duplication and loss in GBIF: Australian bats in bits

https://doi.org/10.59350/rnwpd-3nj27

Published February 20, 2015

Author Roderic Page

Quick notes on another example of data duplication in GBIF. I'm in the process of building a tool to map specimen codes to GBIF records, and came across the following example.

iPhylo

Thoughts on ReCon 15: DOIs, GitHub, ORCID, altmetric, and transitive credit

Visualising Geophylogenies in Web Maps Using GeoJSON

Text mining for museum specimen identifiers

The value of ION to GBIF

Putting some bite into the Bouchout Declaration

Looking up specimen codes in GBIF using Google Spreadsheet

GBIF Ebbe Nielsen Challenge finalists announced

Linking specimen codes to GBIF

GBIF Ebbe Nielsen Challenge submissions: judging begins

More examples of data duplication and loss in GBIF: Australian bats in bits