Software Tools ::

Introduction

FAQ (Frequently Asked Questions) at Phylip Home Page

Programs

DNA and RNA Sequences

  • dnapars: Estimates phylogenies by the parsimony method using nucleic acid sequences.
    • Allows use of all IUB ambiguity codes, and estimates ancestral nucleotide states. Gaps treated as a fifth nucleotide state.
  • dnadist: Computes four different distances between species from nucleic acid sequences. The distances can be used in the distance matrix programs. The distances are:
    • Jukes-Cantor distance
    • Kimura's 2-parameter distance
    • Jin and Nei's distance (which allows for rate variation from site to site)
    • a maximum likelihood method using the model employed in dnaml
    Note that the maximum likelihood method of computing distances can be very slow.

Protein sequences

  • protpars: Estimates phylogenies from protein sequences using the parsimony method.
    • It uses a variant which counts only those nucleotide changes that change the amino acid, on the assumption that silent changes are more easily accomplished.
  • protdist: Computes a distance measure for protein sequences, using maximum likelihood estimates.
    • The measure is based on the Dayhoff PAM matrix, Kimura's 1983 approximation to it, or a model based on the genetic code plus a constraint on changing to a different category of amino acid.
    • The distances can be used in the distance matrix programs.

Distance matrix data

  • fitch: Estimates phylogenies from distance matrix data under the "additive tree model" according to which the distances are expected to equal the sums of branch lengths between the species.
    • Uses the Fitch-Margoliash criterion and some related least-squares criteria.
    • Does not assume an evolutionary clock.
    • This program will be useful with distances computed from DNA sequences, with DNA hybridization measurements, and with genetic distances computed from gene frequencies.
  • kitsch: Estimates phylogenies from distance matrix data under the "ultrametric" model (the same as the additive tree model except it assumes an evolutionary clock).
    • The Fitch-Margoliash criterion and other least-squares criteria are assumed.
    • This program will be useful with distances computes from DNA sequences, with DNA hybridization measurements, and with genetic distances computed from gene frequencies.
  • neighbor: Saitou and Nei's neighbor-joining method and the UPGMA method (Unweighted Pair Group Method with Arithmetic Mean).
    • Neighbor joining is a distance matrix method producing an unrooted tree without the assumption of a clock.
    • UPGMA does assume a clock.
    • The branch lengths are not optimized by the least-squares criterion but the methods are very fast and thus can handle much larger data sets.
    • Implemented by Mary Kuhner and John Yamato.

Discrete two-state (0,1) characters

  • mix: Estimates phylogenies using parsimony for discrete character data with two states (0 and 1).
    • Allows use of the Wagner parsimony method, the Camin-Sokal parsimony method, or arbitrary mixtures of these.
    • Reconstructs ancestral states and allows weighting of characters.
  • clique: Finds the largest clique of mutually compatible characters, and the phylogeny which they recommend, for discrete character data with two states.
    • The largest clique (or all cliques within a given size range of the largest one) are found by a very fast branch-and-bound search method.
    • The method does not allow for missing data. For such cases, the T (Threshold) option of mix may be a useful alternative.
    • Compatibility methods are particular useful when some characters are of poor quality and the rest of good quality, but when it is not known in advance which ones are which.

Bootstrapping

  • seqboot: Reads in a data set, and produces multiple data sets from it by bootstrap resampling.
    • Since most programs in the current version of the package allow processing of multiple data sets, this can be used together with the consensus tree program consense to do bootstrap (or delete-half-jackknife) analyses with most of the methods in this package.
    • This program also allows the Archie/Faith technique of permutation of species within characters.

Make consensus trees

  • consense: Computes consensus trees by the majority-rule consensus tree method, which also allows one to easily find the strict consensus tree.
    • Does NOT compute the Adams consensus tree.
    • Trees are input in a tree file in standard nested-parenthesis notation, which is produced by many of the tree estimation programs in the package.
    • This program can be used as the final step in doing bootstrap analyses for many of the methods in the package.

Back to Phylogenetic Analysis

This website will look much better in a browser that supports web standards, but it has been designed so that it is still usable and accessible to any browser or web-enabled device.