Software Tools :: Software by Function

The Center for Genomics and Bioinformatics provides access to a variety of software and database services, some of which are available at no charge to the research community (e.g., EMBOSS, PHYLIP) and some of which are provided free of charge to the IU community (e.g., GCG, SeqLab, TRANSFAC)

The available tools are broken down into several categories. Finding the right tool is simple; pick a category to match the analysis you're interested in. The subsequent page will provide documentation, notes, and links to specific sequence analysis programs to assist you.

Function Description
Sequence Creation and Manipulation Create and edit sequence files
  • edit sequences
  • shuffle, corrupt, mask sequences
  • complement, reverse-complement nucleic acid sequences
Mapping and Digests Find cut sites and digestion fragments
  • restriction enzyme cut sites and fragments
  • proteolytic enzyme cut sites and fragments
  • protein isoelectric points
Format conversions -- sequences and other files Convert files from one format to another
  • sequence files
  • scoring matrix files
  • restriction enzyme files
  • profile files
Motifs, patterns, and profiles Find motifs, patterns, and domains in sequences
  • simple sequence patterns
  • functional sites or domains
  • profiles/HMMs representing a sequence family
  • MEME (Multiple Expectation maximization for Motif Elicitation)
Translations Convert sequences from one type to another
  • translate DNA/RNA to protein
  • reverse translate protein to DNA/RNA
Gene finding and codon analysis Locate potential genes and exons in nucleic acid sequences; find regions of codon bias
  • search by content -- ORF analysis, codon bias, ...
  • search by signal -- promoters, intron/exon boundaries, ...
  • search by homology -- locate ESTs on genomic sequences, ...
  • combination approaches -- TESTCODE, Markov models, ...
Database reference search

Database similarity search
Find sequences of interest in a database by searching the annotation section of database entries with keywords

DNA sequence assembly Assemble nucleic acid sequences from overlapping fragments
  • trace file viewers
  • sequence fragment preprocessing
  • sequence assembly project management
Sequence alignments and comparisons Align two or more sequences
  • pair-wise global and local alignments
  • multiple sequence alignments
  • information about sequence alignments
Phylogenetic analysis Create phylogenetic trees from sequence data
  • distance matrix methods, neighbor joining
  • parsimony methods
  • maximum likelihood methods
  • synonymous vs nonsynonymous substitutions
Primers and probes Find PCR or sequencing primers; find hybridization probes
  • primer and probe candidates
  • primer pair compatibility for PCR
  • oligonucleotide melting temperatures
Information engineering in biology Programming, web services, and databases
  • Perl, Python, Java programming; Bio projects
  • XML, DAS, GEML, MAML data annotation formats
  • Gene Ontology Consortium
  • web services (CGI, servlets, CORBA)
  • databases, information systems, data mining
RNA secondary structure analysis Predict RNA folding; find inverted and tandem repeats in nucleic acid sequences
Protein sequence analysis Examine protein features and properties; determine secondary and tertiary structure

This website will look much better in a browser that supports web standards, but it has been designed so that it is still usable and accessible to any browser or web-enabled device.