Fast Converging Phylogenetic Methods



Our research in this area combines algorithm design with simulation studies and probabilistic analyses of the performance of phylogenetic methods under various models of biomolecular sequence evolution. The performance of phylogenetic methods is evaluated according to two criteria: statistical performance, which addresses the accuracy of the method under a specified stochastic model of evolution, and computational performance, which addresses the computational requirements of the method. One of the more demanding statistical criterion is the convergence rate, which can be interpreted as the sequence length needed for a method to recover the true tree with probability 1-epsilon. Some of the recent developments in computational phylogenetics include the first analytical bounds on the performance of existing methods under various models of biomolecular sequence evolution and the first methods guaranteed to recover the true tree from sequences of polynomial length. Such "fast converging" methods may provide key techniques for the reconstruction of large, evolutionary divergent phylogenetic trees.
       This research will continue the effort to design new polynomial-time algorithms with polynomial convergence rates. This research will extend the basic theory of convergence rates of phylogenetic methods as well as develop new "fast converging" methods. All methods developed will be studied extensively using simulations of biomolecular sequence evolution as well as real datasets.
       The goal of the research is to develop algorithms that have provable performance guarantees, that provide measurable performance improvements on both real and simulated datasets, and that outperform, with respect to both running time and accuracy, all currently used methods, so that the algorithms will come into use within the biology community. The research will be based on results that were recently obtained in the development of "fast converging" methods, in the experimental assessment of phylogenetic methods, and in the implementation and performance engineering of algorithms for hard optimization problems in phylogeny.



People


Publications

Presentations

The following presentation is on fast converging methods.

  • (PDF) Fast converging phylogenetic methods (Stochastic Processes and Applications, June 27, 2005).