[ Team LiB ] Previous Section Next Section

getorf

getorf finds and outputs the sequences of open reading frames (ORFs).

Here is a sample session with getorf:

% getorf -minsize 300
Input sequence: embl:eclaci
Output sequence [eclaci.orf]:

Mandatory qualifiers:

[-sequence] (seqall)

Sequence database USA.

[-outseq] (seqoutall)

Output sequence(s) USA.

Optional qualifiers:

-table (menu)

Code to use. See the fuzztran description for codes.

-minsize (integer)

Minimum nucleotide size of ORF to report.

-find (menu)

This is a small menu of possible output options. The first four options are to select either the protein translation or the original nucleic acid sequence of the open reading frame. There are two possible definitions of an open reading frame: it may be a region that is free of STOP codons or it may be a region that begins with a START codon and ends with a STOP codon. The last three options are probably only of interest to those who want to investigate the statistical properties of the regions around potential START or STOP codons. The last option assumes that ORF lengths are calculated between two STOP codons.

Advanced qualifiers:

-[no]methionine (boolean)

START codons at the beginning of protein products will usually code for Methionine, despite what the codon will code for when it is internal to a protein. This qualifier sets all such START codons to code for Methionine by default.

-circular (boolean)

Is the sequence circular?

-[no]reverse (boolean)

Set this to be false if you do not want to find ORFs in the reverse complement of the sequence.

-flanking (integer)

If you chose an option in the type of sequence to find that provides the flanking sequence around a STOP or START codon, this allows you to set the number of nucleotides on either side of that codon to output. If the region of flanking nucleotides crosses the start or end of the sequence, no output is given for this codon.

    [ Team LiB ] Previous Section Next Section