Data and the FAIR Principles

Module Overview: Data and the FAIR Principles


Teaching: A FAIR bit of time. min
Exercises: Specific exercises will be available via specific lessons. min
  • Why is FAIR important?

  • Who is this module for?

  • How can I get some help if I get stuck solving an exercise or answering a question?

  • When and where are the future ReproNim training workshops?

  • This module should provide you with the ability to work with your data in a FAIR manner

  • Provide researchers with the proper information on FAIR data so that they can be submitted to the specified workflows and executions environments in a reproducible fashion

A key foundation needed to support reproducible research is the proper handling of research outputs. There are a number of best practices and tools available to researchers to ensure that their data is sufficiently FAIR (Findable, Accessible, Interoperable, and Reusable).

What is a ReproNim module?

A ReproNim module is a set of steps or “lessons”, in which we have gathered material on the web to guide you through the acquisition of a concept or tool or technique. You will take this journey and for each lesson, we try to then ask you a few questions or do some exercise so that you will have an idea of how comfortable you feel with the material to be acquired.

Who is this module for?

The module is for you if you are a biomedical researcher, an informatics researcher, or student, and you are working with neuroimaging (or not) and you want to know about reproducibility and data. To ensure data supports reproducible research, the FAIR principles were issued through FORCE11: the Future of Research Communications and e-Scholarship. The FAIR principles put forth characteristics that contemporary data resources, tools, vocabularies and infrastructures should exhibit to ensure understandability, assist discovery and reuse by third-parties. Wilkinson et al.,2016. FAIR stands for: Findable, Accessible, Interoperable and Re-usable.

What will I learn?

You will learn how to properly work with data to ensure that they are FAIR. This module will also provide information on technologies and platforms that can be utilized within your research.

How much time should this take me?

That really depends on your familiarity with concepts covered in the episodes and your capacity to write some code. If you have no familiarity at all, this may take you a longer time, for instance 2 full weeks. If you have good familiarity, some of the information will be already partially known and it may take you a few days to go in detail through this material.

What are the current lessons for this module?

  1. Lesson 1. Overview of strategies for making research outputs available, with an emphasis on data
  2. Lesson 2. Introduction to ethics surrounding privacy, data sharing, and algorithmic decision-making
  3. Lesson 3. Overview of best practices in data publishing
  4. Lesson 4. Provides information on a number of research data management platforms
  5. Lesson 5. Semantic Data Representations

Do I need to code? If yes, in what language?

You can learn a lot without coding, however, some of the lessons and exercises will require some coding. So, yes, you should code. We have (mostly) adopted Python for the language. It may not be your first choice but we think some knowledge of Python coding will help you anyways. We will try to help as much as possible by providing tutorials, examples, and links to installation instructions.

ReproNim is truly open

The ReproNim training events can only accommodate a limited number of participants. Nevertheless, we are committed to openness and we are committed to providing our materials in an open format, with liberal licenses, and through a publicly accessible website. You can also contribute to this training module - just fork us on GitHub!

Key Points