Ricerca di contatti, progetti,
corsi e pubblicazioni

ModelArchive - Open Research Data (ORD) best practices for computational macromolecular models

Persone

 

Cavalli A.

(Partner di progetto)

Persone esterne

Schwede Torsten

(Responsabile)

Abstract

Proteins, DNA, and RNA are essential for all biological processes, and their functions are intertwined with their 3D structure. Traditionally, structures are determined experimentally, mainly with X-ray crystallography, NMR, and cryo-EM techniques, but recently computational methods have made impressive progress in accurate 3D protein structure prediction. In fact, the journal Nature has nominated protein structure prediction as “Method of the Year 2021”. The structural biology community has pioneered open research data principles, as exemplified by the Protein Data Bank (PDB), the global de facto standard archive of experimentally-determined macromolecular structures. However, the PDB does not archive structures determined through computational modelling, resulting in computational models stored in undefined locations, in incompatible formats, and lacking essential metadata. Following recommendations from an international community workshop, we have developed an archive for computed macromolecular structures, ModelArchive (https://modelarchive.org), and an extension of the mmCIF data format to store model metadata. However, data standards and best practices are not yet established for complex computational models involving proteins, DNA, RNA and/or small molecules, different conformational states of the same macromolecule, and synthetic proteins constructed through design methodologies. With the technical infrastructure of ModelArchive now established, we are in a good position to further develop ORD practices in our community. This includes defining and promoting best practices for data and metadata standards, establishing deposition policies with publishers and funding agencies, improving usefulness of protein models through linking accuracy estimates and accompanying metadata, and connecting to other ORD resources to make models easily findable, accessible and reusable.

Informazioni aggiuntive

Acronimo
ModelArchive
Data d'inizio
01.01.2023
Data di fine
31.12.2024
Durata
24 Mesi
Enti finanziatori
Partner esterni
UniBS, UniL, EPFL, SIB
Stato
In corso
Categoria
swissuniversities / Open Research Data calls / Measure A1, Track B: Establish projects