New research article type embeds live code and data

The non-profit scientific research publication platform eLife recently announced the Reproducible Document Stack (RDS).

Image by:

Opensource.com

While science is supposed to be about building on each other's findings to improve our understanding of the world around us, reproducing and reusing previously published results remains challenging, even in the age of the internet. The basic format of the scientific paper—the primary means through which scientists communicate their findings—has more or less remained the same since the first papers were published in the 18th century.

This is particularly problematic because, thanks to the technological advancements in research over the last two decades, the richness and sophistication of the methods used by researchers have far outstripped the publishing industry's ability to publish them in full. Indeed, the Methods section in research articles remains primarily a huge section of text that does not reflect the complexity or facilitate the reuse of the methods used to obtain the published results.

Working together on a solution

To counter these challenges, eLife teamed up with Substance and Stencila in 2017 to develop a stack of open source tools for authoring, compiling, and publishing computationally reproducible manuscripts online. Our vision for the project is to create a new type of research article that embeds live code, data, and interactive figures within the flow of the traditional manuscript and to provide authors and publishers with the tools to support this new format throughout the publishing lifecycle.

As a result of our collaboration, we published eLife's first computationally reproducible article in February 2019. It was based on a paper in the Reproducibility Project: Cancer Biology collection. The reproducible version of the article showcases some of the possibilities with the new RDS tools: scientists can share the richness of their research more fully, telling the complete story of their work, while others can directly interact with the authors, interrogate them, and build on their code and data with minimal effort.

The response from the research community to the release of our first reproducible manuscript was overwhelmingly positive. Thousands of scientists explored the paper's inline code re-execution abilities by manipulating its plots, and several authors approached us directly to ask how they might publish a reproducible version of their own manuscripts.

Encouraged by this interest and feedback, in May we announced our roadmap towards an open, scalable infrastructure for the publication of computationally reproducible articles. The goal of this next phase in the RDS project is to ship researcher-centered, publisher-friendly open source solutions that will allow for the hosting and publication of reproducible documents, at scale, by anyone. This includes:

Developing conversion, rendering, and authoring tools to allow researchers to compose articles from multiple starting points, including GSuite tools, Microsoft Word, and Jupyter notebooks
Optimizing containerization tools to provide reliant and performant reproducible computing environments
Building the backend infrastructure needed to enable the options for live-code re-execution in the browser and PDF export at the same time
Formalizing an open, portable format (DAR) for reproducible document archives

What's next, and how can you get involved?

Our first step is to publish reproducible articles as companions of already accepted papers. We will endeavor to accept submissions of reproducible manuscripts in the form of DAR files by the end of 2019. You can learn more about the key areas of innovation in this next phase of development in our article "Reproducible Document Stack: Towards a scalable solution for reproducible articles."

The RDS project is being built with three core principles in mind:

Openness: We prioritize building on top of existing open technologies as well as engaging and involving a community of open source technologists and researchers to create an inclusive tool stack that evolves continuously based on user needs.
Interoperability: We want to make it easy for scientists to create and for publishers to publish reproducible documents from multiple starting points.
Modularity: We're developing tools within the stack in such a way that they can be taken out and integrated into other publisher workflows.

And you can help. We welcome all developers and researchers who wish to contribute to this exciting project. Since the release of eLife's first reproducible article, we have been actively collecting feedback from both the research and open source communities, and this has been (and will continue to be) crucial to shaping the development of the RDS.

If you'd like to stay up to date on our progress, please sign up for the RDS community newsletter. For any questions or comments, please contact us. We look forward to having you with us on the journey.

This article is based in part on "Reproducible Document Stack: Towards a scalable solution for reproducible articles" by Giuliano Maciocci, Emmy Tsang, Nokome Bentley, and Michael Aufreiter.