A sequence must be saved as a text file with a SEQ extension. The file must have the following format:
An example of the format is given here:
; (first line of file) Comments must start with a semicolon.
;
; There can be any number of comments.
A title must immediately follow on the next line and be on one line.
AAA GCGG UUTGTT UTCUTaaTCTXXXXUCAGG1
A CT (Connectivity Table) file contains secondary structure information for a sequence. These files are saved with a CT extension. When entering a structure to calculate the free energy, the following format must be followed.
The CT file may hold multiple structures for a single sequence. This is done by repeating the format for each structure without any blank lines between structures.
The CT file format is such that any files generated by RNAstructure are compatible with mfold/unafold and many other software packages.
Folding constraints are saved in plain text with a CON extension. These can be hand edited. For multiple entries of a specific type of constraint, entries are each listed on a separate line. When there is no constraint of a type, there are no lines required. Note that all specifiers, followed by "-1" or "-1 -1", are expected by RNAstructure. The file format is as follows:
DS:
XA
-1
SS:
XB
-1
Mod:
XC
-1
Pairs:
XD1 XD2
-1 -1
FMN:
XE
-1
Forbids:
XF1 XF2
-1 -1
A sample file with constraints is:
DS:
15
25
76
-1
SS:
17
18
20
35
-1
Mod:
2
15
-1
Pairs:
16 26
-1 -1
FMN:
-1
Forbids:
15 27
-1 -1
The file format for SHAPE reactivity comprises two columns. The first column is the nucleotide number, and the second is the reactivity.
Nucleotides for which there is no SHAPE data can either be left out of the file, or the reactivity can be entered as less than -500. Columns are separated by any white space.
Note that there is no header information. Nucleotides 1 through 10 have no reactivity information. Nucleotide 11 has a normalized SHAPE reactivity of 0.042816. Nucleotide 12 has a normalized SHAPE reactivity of 0, which is NOT the same as having no reactivity when using the pseudo-energy constraints.
By default, RNAstructure looks for SHAPE data files to have the file extension SHAPE, but any plain text file can be read. An example of such a file is given below.
9 -999
10 -999
11 0.042816
12 0
13 0.15027
14 0.16201
List files have a LIS extension. This file contains any number of sequences of any length or nucleic acid, each on its own line. For example:
CUGAGCCAAG
GGGCUCAACG
GGCGUGAGAAAC