Advances in annotation and validation of nucleic acids

Bohdan Schneider, Lada Biedermannová, Paulína Božíková, Michal Malý,
Terezie Prchalová,
Jakub Svoboda & Jiří Černý

Institute of Biotechnology of the Czech Academy of Sciences, 252 50 Vestec, Czech Republic

bohdan.schneider@ibt.cas.cz)

 

Structural biology operates robust tools and data for proteins; the field of nucleic acid (NA) structure analysis and prediction lags behind. This is serious for RNA structures in the context of the ability to predict their variable 3D structures and for DNA for understanding of their mutual recognition with proteins. More reliable prediction of RNA 3D structures is primarily hindered by a scarcity of sequence alignments of functionally wide spectrum of RNAs. A better understanding of both RNA and DNA could be much more profound if more high-quality structures were available, and the quality of the existing structures was gauged more reliably [1]. Our work addresses the latter, improving the quality of the existing NA structures and methods of their analysis. Current challenges in NA structure quality include: (A) Inconsistent application of valence geometry restraints. (B) Poorly refined backbone conformations. (C) Incomplete or incorrect base pairing assignments. This presentation will introduce key features of fully redesigned DNATCO version 5.0 at dnatco.datmos.org [2], a suite of web-based tools designed to enhance NA structure annotation and validation. DNATCO includes:

An Annotation TAB for clear structural overviews. Quite significantly, the new DNATCO provides full annotation of base pairs in the Leontis-Westhof classification schema [3] both in tabular form and - to the best of our knowledge - for the first time as an integrated part of molecular graphics.

A Validation TAB offering detailed analysis, including dinucleotide conformation analysis (based on our NtC classification [4]), novel RMSD/RSCC scatterplots, and detailed valence geometry evaluation.

A Browse TAB for multi-view structure visualization.

Our ongoing development aims to provide tools for direct structure modification during the refinement process, further contributing to higher-quality NA structural data.

1.         Schneider, B., Sweeney, B.A., Bateman, A., Cerny, J., Zok, T. and Szachniuk, M. (2023). When will RNA get its AlphaFold moment? Nucleic Acids Res, 51, 9522-9532.

2.         Jiří Černý, Michal Malý, Paulína Božíková, Terezie Prchalová, Jakub Svoboda, Lada Biedermannová & Bohdan Schneider: DNATCO v5.0: Integrated Web Platform for 3D Nucleic Acid Structure Analysis. submitted to Nucleic Acids Research (2025).

3.         Leontis, N.B. and Westhof, E. (2001) Geometric nomenclature and classification of RNA base pairs. RNA, 7, 499-512.

4.         Jiří Černý, Paulína Božíková, Jakub Svoboda & Bohdan Schneider: A unified dinucleotide alphabet describing both RNA and DNA structures. Nucleic Acids Research. 48: 6367-6381 (2020). doi: 10.1093/nar/gkaa383.

This research was funded by Czech Academy of Sciences, grant RVO 86652036 and by grant LM2023055 to ELIXIR CZ from MEYS Czech Republic.