Research data

Here there is a list of research data related to COVID-19, stored on the data repositories. Research data are those records used by researchers as primary sources.

This is a special version of the BIP! Finder to facilitate exploration of the literature related to COVID-19, based on the subset of the CORD-19 Dataset launched by Semantic Scholar.

CORD-19-on-FHIR is a data superset from the COVID-19 Open Research Dataset (CORD-19, above), provided by the Allen Institute to support research on COVID-19/SARS-CoV-2/Novel Coronavirus. It is represented in the FHIR RDF and was produced by mining data from the CORD-19 dataset and adding semantic annotations. The objective is to facilitate the link with other biomedical datasets, allowing answers to research questions.



COVID-19 Data Sharing/BR it is an initiative of FAPESP in cooperation with the University of São Paulo, Instituto Fleury, Hospital Sírio-Libanês and Hospital Israelita Albert Einstein, with the objective of quickly providing research data related to COVID-19.

COCI presents the Coronavirus open citations dataset, curated by OpenCitations, currently contains (as of May 16, 2020) information on 189,697 citations and the 49,719 articles cited or involved in those citations. The complete data set is stored in JSON format on Zenodo under a Creative Commons CC license, to allow anyone to use this data for any purpose. Coronavirus Open Citations Dataset. Version 2.0. Zenodo.

Dryad is an international open access repository to research data, especially data underlying scientific and medical publications.

Figshare is an open access online repository, where researchers can preserve and share their research results, including figures, datasets, images and videos.

GISAID initiative promotes the international sharing of virus sequences, clinical and epidemiological data related to human viruses, geographical and species-specific data associated with avian and other animal viruses.


 Neural Covidex applies state-of-the-art neural network models and artificial intelligence (AI) techniques to answer questions using the COVID-19 (CORD-19) Open Research Dataset provided by the Allen Institute for AI. The interface is powered by Blacklight, Solr and the Anserini IR toolkit. The efforts include back-end components (code and pre-built indexes) for directly searching and manipulating the collection via Pyserini (Python bindings for Anserini), along with sample integration of search with the HuggingFace’s neural models. Examples of Python notebooks and other resources are available at Neural Covidex. The entire software stack behind Neural Covidex is available open-source on Github which means that in addition to using the search application, individual components such as the searcher, neural reranker and passage highlighter can also be reused elsewhere. 

Through a partnership between large research groups, CORD-19 was launched. It is a free resource that gathers almost 30,000 scientific articles and another hundred research data that can be used by other researchers in studies to help combat the novel Coronavirus.



The Allen Institute for AI has partnered with leading research groups to prepare and distribute the COVID-19 (CORD-19) Open Research Dataset, a free resource for more than 47,000 academic articles, including more than 36,000 with full text on COVID-19 and the virus family Coronaviridae for use by the global research community. This dataset aims to mobilize researchers to apply the recent advances in natural language processing to generate new insights in support for the fight against this infectious disease.



The GO FAIR initiative has structured an implementation network aiming at making SARS CoV-2 FAIR virus data FAIR, meaning that they are Findable, Accessible, Interoperable and thus Reusable by humans and machines, during this COVID-19 epidemic. The technical components that make this possible can remain in place, waiting in ready state for potential future infectious disease outbreaks.

Zenodo is a general purpose open access repository developed under the European OpenAIRE program and operated by CERN. It allows researchers to deposit datasets, research software, reports and other digital research-related artifacts.

More On the Directory

The Directory of open access scientific information sources on Coronavirus aims to gather the sources of national and international scientific information, which provides content on Coronavirus and COVID-19 freely available on the Internet.

  • Instituto Brasileiro de Informação em Ciência e Tecnologia.
    SAUS Quadra 05 Lote 06 Bloco H – Asa Sul, Brasília-DF, 70070-912 - Brasil

Follow us