6.2 Command-Line Options

Table 6-2 through Table 6-6 summarize Readseq's command-line options.

Table 6-2. Primary pptions

Option

Definition

-a[ll]

Select all sequences. "all" causes processing of all sequences (default now for Version 2, for compatibility with version 1). Use" items=1,2,3" to select a subset.

-c[aselower]

Change to lower case. "caselower" and "CASEUPPER" will convert sequence case.

-C[ASEUPPER]

Change to UPPERCASE.

-degap[=-]

Remove gap symbols. "degap=symbol" will remove this symbol from output sequence (- normally).

-f[ormat=]#

Format number for output.

-f[ormat=]Name

Format name for output. See formats list (Table 6-1) for names and numbers. "format=genbank", "format=gb", "format=xml", etc., selects an output format. You can also use format number, but these numbers may change with revisions. Alternate names of formats are listed in Table 6-1. "Pearson|FastA|fa" allows "pearson", "fasta", or "fa" as a name). This is case-insensitive.

-inform[at]=#

Input format number.

-inform[at]=Name

Input format name. Assume input data is this format. "inform=genbank" lets you specify data input format. Normally Readseq guesses the input format (usually correctly). Use this option if you wish to bypass this input format guessing.

-i[tem=2,3,4]

Select Item number(s) from several. "items=2,3,4" will select these sequence records from a multisequence input file.

-l[ist]

List sequences only. "list" will list titles of sequence records.

-o[utput=]out.seq

Redirect Output. "output=file", sends output to named file.

-p[ipe]

Pipe (command line, < stdin, > stdout). "pipe" will cause input data to come from STDIN and output go to STDOUT Unix standard files (unless -out is given and input file given), and no prompting or progress reports will occurr.

-r[everse]

Reverse-complement of input sequence. "reverse" will write the sequence from end to start, and DNA bases are complemented. Amino residues are not complemented.

-t[ranslate=]io

Translate input symbol [i] to output symbol [o]. Use several -tio to translate several symbols translates given sequence bases, e.g., -tAN to change "A" to "N".

-v[erbose]

Verbose progress. "verbose" will print some progress reports.

-ch[ecksum]

Calculate & print checksum of sequences.

Table 6-3. Documentation and feature table extraction options

Option

Definition

-feat[ures]=exon,CDS...

Extract sequence of selected features.

-nofeat[ures]=repeat_region,intron...

Remove sequence of selected features. "feature=CDS,intron" lets you specify those features to extract, or remove, in the output. Currently this causes each feature to produce a new sequence record.

-field=AC,ID...

Include selected document fields in output.

-nofield=COMMENT,...

Remove selected document fields from output.

Table 6-4. Subrange options

Option

Definition

-subrange=-1000..10

Extract subrange of sequence for feature locations:

-subrange=1..end

-subrange=end-10..end+99

-extract=10000..99999

Extract all features and sequence from given base range.

Table 6-5. Pair, unpair options

Option

Definition

-pair=1

Combine features (fff,gff) and sequence files to one output.

-unpair=1

Split features, sequence from one input to two files.

Table 6-6. Pretty format options

Option

Definition

-wid[th]=#

Sequence line width.

-tab=#

Left indent.

-col[space]=#

Column space within sequence line on output.

-gap[count]

Count gap chars in sequence numbers.

-nameleft, -nameright[=#]

Name on left/right side [=max width].

-nametop

Name at top/bottom.

-numleft, -numright

Seq index on left/right side.

-numtop, -numbot

Index on top/bottom.

-match[=.]

Use match base for 2..n species.

-inter[line=#]

Blank line(s) between sequence blocks .

[ Team LiB ]