RNAstructure

Version 5.1

Copyright 1996 - 2010

by David H. Mathews


Description of the Secondary Structure Prediction Algorithm

The prediction of a secondary structure is based on Zuker's algorithm for free energy minimization, using the nearest neighbor parameters of Doug Turner and co-workers. A recursive algorithm is used that generates an optimal structure and a series of alternate structures.


These alternate structures are called suboptimal structures, defined as structures with free energy similar to the lowest free energy (optimal) structure. The number of suboptimal structures generated is controlled by two parameters: the maximum number of structures to be generated and % sort.


A third parameter, window size, may also be used. This controls how different the suboptimal structures must be from each other. A small window size allows generation of very similar structures, while a larger window size generates relatively more diverse structures.


For more details, see the description of the module "Fold."


Accuracy

A recent report (Mathews et al, see References) examined the accuracy of this method by comparing predictions to structures determined by comparative sequence analysis. When domains of less than 700 nucleotides were used, the algorithm was able to correctly predict 73% of base pairs from a database of 151,503 nucleotides containing small subunit rRNAs, large subunit rRNAs, group I introns, group II introns, RNase P RNA, SRP RNA, and tRNAs.


Precision

Within a small increment of free energy of the lowest structure, there can be many alternative structures for a helix appearing in the suboptimal structures. A dot plot can show how well defined a region of structure is. (See "Dot Plot.")