Arboviral disease record data - Dengue and Chikungunya, Brazil, 2013-2020

Sci Data. 2022 May 10;9(1):198. doi: 10.1038/s41597-022-01312-7.


One of the main categories of Neglected Tropical Diseases (NTDs) are arboviruses, of which Dengue and Chikungunya are the most common. Arboviruses mainly affect tropical countries. Brazil has the largest absolute number of cases in Latin America. This work presents a unified data set with clinical, sociodemographic, and laboratorial data on confirmed patients of Dengue and Chikungunya, as well as patients ruled out of infection from these diseases. The data is based on case notification data submitted to the Brazilian Information System for Notifiable Diseases, from Portuguese Sistema de Informação de Agravo de Notificação (SINAN), from 2013 to 2020. The original data set comprised 13,421,230 records and 118 attributes. Following a pre-processing process, a final data set of 7,632,542 records and 56 attributes was generated. The data presented in this work will assist researchers in investigating antecedents of arbovirus emergence and transmission more generally, and Dengue and Chikungunya in particular. Furthermore, it can be used to train and test machine learning models for differential diagnosis and multi-class classification.

Publication types

  • Dataset

MeSH terms

  • Arboviruses*
  • Brazil / epidemiology
  • Chikungunya Fever* / epidemiology
  • Dengue* / epidemiology
  • Humans
  • Neglected Diseases
  • Zika Virus Infection*