Glimmer : glimmer_extract - extracts subsequences corresponding to long orfs (S. Salzberg)
Some explanations about the options
Main parameters
- enter either the name of a file or the actual data
- if you are using Netscape 2.x or later, you can select a file by typing its name, or better, by selecting it with the Netscape file browser (Browse button)
- OR you can type your data in the next area, or cut and paste it from another application.
- (but not both)
- list of start/stop pairs
- The second command-line argument is the name of thecoordinate file. It must contain a list of pairs of positions in thefirst file, one per line. The format of each entry is:
- 'IDstring' 'start position' 'stop position'
- This file should contain no other information, so if you're using the output of glimmer or long-orfs , you'll have to cut off header lines.
- skip the start codon (-skip)
- makes the output omit the first 3 characters of each sequence, i.e., it skips over the start codon. This was the default behaviour of the previous version of the program.
- minimum length of sequences (-l)
- makes the output omit any sequences shorter than n characters. n includes the 3 skipped characters if the -skip switch is on.
-
- Sequence format
- The sequence will be automatically converted in the format needed for the program
- providing you enter a sequence either:
- in plain (raw) sequence format or in one of the following known formats:
- IG,GenBank,NBRF,EMBL,GCG,DNAStrider,Fitch,fasta,Phylip,PIR,MSF,ASN,PAUP,CLUSTALW
- You may enter in the text area a database entry code, or an accession number, in this form:
database:entry_name
or:
database:accession.
References:
A.L. Delcher, D. Harmon, S. Kasif, O. White, and S.L. Salzberg. Improved microbial gene identification with GLIMMER. Nucleic Acids Research, 27:23, 4636-4641.
S. Salzberg, A. Delcher, S. Kasif, and O. White. Microbial gene identification using interpolated Markov models. Nucleic Acids Research 26:2 (1998), 544-548.
Pise form generator version: 5.a (19 Oct 2006 12:41)