Rogue Scholar

Published June 1, 2023

Authors Maëlle Salmon, Christophe Dervieux

The Pandoc CLI by John MacFarlane is a really useful tool: for instance, rOpenSci community manager Yanina Bellini Saibene recently asked Maëlle whether she could convert a Google Document into a Quarto book.Maëlle solved the request with a combination of Pandoc (conversion from docx to HTML then to Markdown through pandoc::pandoc_convert()) and XPath.You can find the resulting experimental package quartificate on GitHub.Pandoc is not only

HugoBlogdownR MarkdownMarkdownKnitrComputer and Information Sciences

Miscellaneous Wisdom about R Markdown & Hugo Gained from Work on our Website

https://doi.org/10.59350/m5pk1-gxy73

Published April 23, 2020

Author Maëlle Salmon

Whilst working on the blog guide, Stefanie Butland and I consolidated knowledge we had already gained, but it was also the opportunity to up our Rmd/Hugo technical game.Our website uses Hugo but not blogdown ¹ to render posts: every post is based on an .md file that is either written directly or knit from an .Rmd file.We wanted to provide clear guidance for both options, and to stick to the well-documented Hugo way of e.g. inserting

MarkdownRMarkdownXml2CommonmarkCrulComputer and Information Sciences

We cleaned our website URLs with R!

https://doi.org/10.59350/e1hxf-36d41

Published December 19, 2019

Authors Maëlle Salmon, Scott Chamberlain, Stefanie Butland

Last year we reported on the joy of using commonmark and xml2 to parseMarkdown content, like the source of this website built withHugo, in particular to extractlinks,at the time merely to count them. How about we go a bit further and usethe same approach to find links to be fixed? In this tech note we shallreport our experience using R to find broken/suboptimal links and fixthem.What is a bad URL?

MarkdownR MarkdownXml2CommonmarkTinkrComputer and Information Sciences

tinkr: editing Markdown documents using XML tools

https://doi.org/10.59350/j2km8-ckt35

Published October 1, 2018

Author Maëlle Salmon

Remember our recent post showing that one can wrangle Markdown filesprogrammatically without regex? Thattech note showed how to convert Markdown bodies to XML in order toextract information from them.

CodemetaCodemetarMarkdownXml2CommonmarkComputer and Information Sciences

All the Badges One Can Earn: Parsing Badges of CRAN Packages READMEs

https://doi.org/10.59350/88kqe-q5379

Published September 10, 2018

Author Maëlle Salmon

A while ago weonboarded anexciting package, codemetarby Carl Boettiger. codemetar is an R specificinformation collector and parser for the CodeMetaproject. In particular, codemetar candigest metadata about an R package in order to fill the termsrecognized by CodeMeta. This meansextracting information from DESCRIPTION but also from e.g. continuousintegration ¹ badges in the README!

MarkdownR MarkdownXml2CommonmarkTech NotesComputer and Information Sciences

In praise of Commonmark: wrangle (R)Markdown files without regex

https://doi.org/10.59350/qsf4h-aqv65

Published September 5, 2018

Author Maëlle Salmon

You might have read my blog post analyzing the social weather ofrOpenScionboarding,based on a text analysis of GitHub issues. I extracted text out ofMarkdown-formatted threads with regular expressions. I basicallyhammered away at the issues using tools I was familiar with until itworked! Now I know there’s a much better and cleaner way, that I’llpresent in this note. Read on if you want to extract insights abouttext, code, links, etc.

PackagesCommonmarkMarkdownComputer and Information Sciences

High Performance CommonMark and Github Markdown Rendering in R

https://doi.org/10.59350/h5crj-yst34

Published December 2, 2016

Author Jeroen Ooms

This week the folks at Github have open sourced their fork of libcmark (based on the extensive PR by Mathieu Duponchelle), which they use to render markdown text within documents, issues, comments and anything else on the Github website.

rOpenSci - open tools for open science

Troubleshooting Pandoc Problems as an R User

Miscellaneous Wisdom about R Markdown & Hugo Gained from Work on our Website

We cleaned our website URLs with R!

tinkr: editing Markdown documents using XML tools

All the Badges One Can Earn: Parsing Badges of CRAN Packages READMEs

In praise of Commonmark: wrangle (R)Markdown files without regex

High Performance CommonMark and Github Markdown Rendering in R