Rogue Scholar

Published February 23, 2012

Author Roderic Page

Duplicate records are the bane of any project that aggregates data from multiple sources.

CitationCollectionsIdentifiersSpecimensTAXACOMComputer and Information Sciences

Yet another reason why we need specimen identifiers, now!

https://doi.org/10.59350/56hff-14t75

Published January 18, 2012

Author Roderic Page

This message appeared on the TAXACOM mailing list:Given that most specimens lack resolvable digital identifiers (a theme I've harped on about before, most recently in the context of DNA barcoding), answering this kind of query ends up being a case of searching publications for text strings that contain the acronym of the collection.

Darwin Core RipletDNA BarcodingDOIGBIFIdentifiersComputer and Information Sciences

DNA Barcoding, the Darwin Core Triplet, and failing to learn from past mistakes

https://doi.org/10.59350/aq4wb-dt356

Published December 11, 2011

Author Roderic Page

Given various discussions about identifiers, dark taxa, and DNA barcoding that have been swirling around the last few weeks, there's one notion that is starting to bug me more and more.

BHLDOIHandlesIdentifiersIONComputer and Information Sciences

Mapping names to literature: closing in on 250,000 names

https://doi.org/10.59350/ak0kf-7d998

Published November 29, 2011

Author Roderic Page

Following on from my earlier post Linking taxonomic names to literature: beyond digitised 5×3 index cards I've been slowly updating my latest toy:http://iphylo.org/~rpage/itaxonThis site displays a database mapping over 200,000 animal names to the primary literature, using a mix of identifiers (DOIs, Handles, PubMed, URLs) as well as links to freely available PDFs where they are available.

Cool URIsCrossrefDOIDomain NamesIdentifiersComputer and Information Sciences

The demise of phthiraptera.org and the perils of using Internet domain names as identifiers

https://doi.org/10.59350/v5jjp-2mm35

Published January 14, 2011

Author Roderic Page

Geoffery Bilder's comments about the unsuitability of URLs as long term identifiers (as opposed, say, to DOIs) came to mind when I discovered that the domain phthiraptera.org is up for sale: This domain used to be home to a wealth of resources on lice (order Phthiraptera). I discovered that ownership of the domain had expired when a bunch of links to PDFs returned by an iSpecies search for Collodennyus all bounced to the holding page

BHLData QualityGoogle BooksIdentifiersMetadataComputer and Information Sciences

Biodiversity Heritage Library, Google books, and metadata quality

https://doi.org/10.59350/7jqt3-75k06

Published September 19, 2009

Author Roderic Page

I've been playing recently with the Biodiversity Heritage Library (BHL), and am starting to get a sense for the complexities (and limitations) of the metadata BHL stores about publications.

BegoniaDOIEdinburghIdentifiersIPNIComputer and Information Sciences

Nomenclators + digitised literature = fail

https://doi.org/10.59350/xmdgk-h1824

Published May 7, 2009

Author Roderic Page

Continuing with RSS feeds, I've now added wrappers around IPNI that will return for each plant family a list of names added to the IPNI database in the last 30 days. You can see the list at here.One thing which is a constant source of frustration for me is the disconnect between nomenclators (lists of published names for species) and scientific publishing.

Digital LibraryDOIIdentifiersLibraryComputer and Information Sciences

Defrosting the Digital Library

https://doi.org/10.59350/b5f86-nmg07

Published November 5, 2008

Author Roderic Page

Duncan Hull alerted me to his paper "Defrosting the Digital Library: Bibliographic Tools for the Next Generation Web" ( PloS Computational Biology , doi:10.1371/journal.pcbi.1000204). Here's the abstract:It's an interesting read, and it also cites my bioGUID project.[Image from dave 7]

CrossrefDOIIdentifiersISSNWorldCatComputer and Information Sciences

When ISSN's disappear, taking DOIs with them

https://doi.org/10.59350/tgzjs-n7m39

Published September 4, 2008

Author Roderic Page

I've been using ISSN's (International Standard Serial Number) to uniquely identify journals, both to generate article identifiers, and as a parameter to send to CrossRef's OpenURL resolver. Recently I've come across journals that change their ISSN, which has fairly catastrophic effects on my lookup tools.

CrossrefDOIIdentifiersJSTORSICIComputer and Information Sciences

When DOIs collide and then disappear: when is a unique, resolvable identifier a bad idea?

https://doi.org/10.59350/22px6-n3j36

Published May 29, 2008

Author Roderic Page

As much as I like the idea of a globally unique, resolvable identifier, my recent experience with JSTOR is making me wonder.JSTOR has three identifiers for articles it archives, DOIs, SICIs, and stable URLs (the later being introduced with the new platform released April 4, 2008). Previously JSTOR would publish DOIs for many of its articles.

iPhylo

How many specimens does GBIF really have?

Yet another reason why we need specimen identifiers, now!

DNA Barcoding, the Darwin Core Triplet, and failing to learn from past mistakes

Mapping names to literature: closing in on 250,000 names

The demise of phthiraptera.org and the perils of using Internet domain names as identifiers

Biodiversity Heritage Library, Google books, and metadata quality

Nomenclators + digitised literature = fail

Defrosting the Digital Library

When ISSN's disappear, taking DOIs with them

When DOIs collide and then disappear: when is a unique, resolvable identifier a bad idea?