See also: Sharing data on COVID-19 | Services for research on COVID-19
Available data on COVID-19
Reference resources
This section lists the main international reference resources for protein data. Among them are the ELIXIR Core Data Resources, a selection of European resources of fundamental importance to the life science community and the long-term preservation of biological data.
UniProt is the reference resource for protein sequences of all organisms. A dedicated section collects the most recent SARS-CoV-2 data.
PRoteomics IDEntifications database (PRIDE) collects data from mass spectrometry experiments. SARS-CoV-2 and COVID-19 data are available in a dedicated section.
Human Protein Atlas (HPA) collects human protein expression data. The database is divided into several sections focusing on: tissue level, subcellular expression, protein expression in the brain and blood. Sections dedicated to specific pathologies and metabolic pathways are also present. Among these, a section is devoted to the expression of human proteins involved in SARS-CoV-2 infection.
Protein Data Bank in Europe (PDBE) is a database of 3D structures for proteins and nucleic acids mainly obtained by X-ray and NMR crystallography. A section is dedicated to structural data of SARS-CoV-2.
Electron Microscopy Public Image Archive (EMPIAR) is a database for electron microscopy and tomography data.
Resources at the European COVID-19 Data Portal
- The European COVID-19 Data Portal is a resource developed at the European Bioinformatics Institute (EMBL-EBI) and proposes a collection of data like sequences, function and classification of SARS-CoV-2 and human proteins involved in the infection (the resource is based on UniProt data).
Resources developed in Italy
The Molecular INTeraction database (MINT) allows to annotate protein-protein interactions published in literature and manually curated by a team of experts. MINT is part of the ELIXIR Core Data Resources, developed at the Bioinformatics and Computational Biology Unit of the University of Rome Tor Vergata. MINT, together with other members of the IMEx consortium, recently annotated protein-protein interactions data between SARS-CoV2 and related strains of Coronaviridae and their host. Over 6900 molecular interactions from more than 220 scientific papers have been annotated and made publicly available.
The SIGnaling Network Open Resource database (SIGNOR 2.0) stores the signaling data published in literature in a structured format through binary causal relationships between biological entities, allowing their graphical representation as flows. The resource is developed and maintained at the Bioinformatics and Computational Biology Unit of the University of Rome Tor Vergata. The cellular pathways modulated during SARS-CoV2 infection are available at the COVID-19 Causal Networks page. SIGNOR 2.0 is part of the international project COVID-19 Disease Map.
MobiDB is a database for annotations on protein disorder and structural mobility/flexibility. The presence of disorder/flexibility in protein structures is known to facilitate interactions with partners such as enzymes and receptors, as well as to affect the protein function. The resource is developed at the BioComputing UP lab. of the University of Padua. A specific section has been developed for SARS-CoV-2 proteins such as Spike glycoprotein, predicted to be disordered at the C-terminal tail.
DisProt is a database of intrinsically disordered proteins where annotation is manually curated. The resource is developed at the BioComputing UP lab. of the University of Padua. A specific section is devoted to SARS-CoV-2 proteins.