6.2 Command-Line Options
Table 6-2
through Table 6-6 summarize Readseq's command-line
options.
Table 6-2. Primary pptions
-a[ll]
|
Select all sequences. "all" causes
processing of all sequences (default now for Version 2, for
compatibility with version 1). Use"
items=1,2,3" to select a subset.
|
-c[aselower]
|
Change to lower case. "caselower"
and "CASEUPPER" will convert
sequence case.
|
-C[ASEUPPER]
|
Change to UPPERCASE.
|
-degap[=-]
|
Remove gap symbols. "degap=symbol"
will remove this symbol from output sequence (- normally).
|
-f[ormat=]#
|
Format number for output.
|
-f[ormat=]Name
|
Format name for output. See formats list (Table 6-1) for names and numbers.
"format=genbank",
"format=gb",
"format=xml", etc., selects an
output format. You can also use format number, but these numbers may
change with revisions. Alternate names of formats are listed in Table 6-1.
"Pearson|FastA|fa" allows
"pearson",
"fasta", or
"fa" as a name). This is
case-insensitive.
|
-inform[at]=#
|
Input format number.
|
-inform[at]=Name
|
Input format name. Assume input data is this format.
"inform=genbank" lets you specify
data input format. Normally Readseq guesses the input format (usually
correctly). Use this option if you wish to bypass this input format
guessing.
|
-i[tem=2,3,4]
|
Select Item number(s) from several.
"items=2,3,4" will select these
sequence records from a multisequence input file.
|
-l[ist]
|
List sequences only. "list" will
list titles of sequence records.
|
-o[utput=]out.seq
|
Redirect Output. "output=file",
sends output to named file.
|
-p[ipe]
|
Pipe (command line, < stdin, > stdout).
"pipe" will cause input data to
come from STDIN and output go to STDOUT Unix standard files (unless
-out is given and input file given), and no prompting or progress
reports will occurr.
|
-r[everse]
|
Reverse-complement of input sequence.
"reverse" will write the sequence
from end to start, and DNA bases are complemented. Amino residues are
not complemented.
|
-t[ranslate=]io
|
Translate input symbol [i] to output symbol [o]. Use several -tio to
translate several symbols translates given sequence bases, e.g., -tAN
to change "A" to
"N".
|
-v[erbose]
|
Verbose progress. "verbose" will
print some progress reports.
|
-ch[ecksum]
|
Calculate & print checksum of sequences.
|
Table 6-3. Documentation and feature table extraction options
-feat[ures]=exon,CDS...
|
Extract sequence of selected features.
|
-nofeat[ures]=repeat_region,intron...
|
Remove sequence of selected features.
"feature=CDS,intron" lets you
specify those features to extract, or remove, in the output.
Currently this causes each feature to produce a new sequence record.
|
-field=AC,ID...
|
Include selected document fields in output.
|
-nofield=COMMENT,...
|
Remove selected document fields from output.
|
Table 6-4. Subrange options
-subrange=-1000..10
|
Extract subrange of sequence for feature locations:
- -subrange=1..end
- -subrange=end-10..end+99
|
-extract=10000..99999
|
Extract all features and sequence from given base range.
|
Table 6-5. Pair, unpair options
-pair=1
|
Combine features (fff,gff) and sequence files to one output.
|
-unpair=1
|
Split features, sequence from one input to two files.
|
Table 6-6. Pretty format options
-wid[th]=#
|
Sequence line width.
|
-tab=#
|
Left indent.
|
-col[space]=#
|
Column space within sequence line on output.
|
-gap[count]
|
Count gap chars in sequence numbers.
|
-nameleft, -nameright[=#]
|
Name on left/right side [=max width].
|
-nametop
|
Name at top/bottom.
|
-numleft, -numright
|
Seq index on left/right side.
|
-numtop, -numbot
|
Index on top/bottom.
|
-match[=.]
|
Use match base for 2..n species.
|
-inter[line=#]
|
Blank line(s) between sequence blocks .
|
|