IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies

doi:10.1093/molbev/msu300

. 2015 Jan;32(1):268-74.

doi: 10.1093/molbev/msu300. Epub 2014 Nov 3.

IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies

Lam-Tung Nguyen¹, Heiko A Schmidt², Arndt von Haeseler¹, Bui Quang Minh³

Affiliations

¹ Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Vienna, Austria Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Vienna, Austria.
² Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Vienna, Austria.
³ Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Vienna, Austria minh.bui@univie.ac.at.

PMID: 25371430
PMCID: PMC4271533
DOI: 10.1093/molbev/msu300

Free PMC article

IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies

Lam-Tung Nguyen et al. Mol Biol Evol. 2015 Jan.

Free PMC article

. 2015 Jan;32(1):268-74.

doi: 10.1093/molbev/msu300. Epub 2014 Nov 3.

Authors

Lam-Tung Nguyen¹, Heiko A Schmidt², Arndt von Haeseler¹, Bui Quang Minh³

Affiliations

¹ Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Vienna, Austria Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Vienna, Austria.
² Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Vienna, Austria.
³ Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University of Vienna, Vienna, Austria minh.bui@univie.ac.at.

PMID: 25371430
PMCID: PMC4271533
DOI: 10.1093/molbev/msu300

Favorites

Abstract

Large phylogenomics data sets require fast tree inference methods, especially for maximum-likelihood (ML) phylogenies. Fast programs exist, but due to inherent heuristics to find optimal trees, it is not clear whether the best tree is found. Thus, there is need for additional approaches that employ different search strategies to find ML trees and that are at the same time as fast as currently available ML programs. We show that a combination of hill-climbing approaches and a stochastic perturbation method can be time-efficiently implemented. If we allow the same CPU time as RAxML and PhyML, then our software IQ-TREE found higher likelihoods between 62.2% and 87.1% of the studied alignments, thus efficiently exploring the tree-space. If we use the IQ-TREE stopping rule, RAxML and PhyML are faster in 75.7% and 47.1% of the DNA alignments and 42.2% and 100% of the protein alignments, respectively. However, the range of obtaining higher likelihoods with IQ-TREE improves to 73.3-97.1%. IQ-TREE is freely available at http://www.cibiv.at/software/iqtree.

Keywords: maximum likelihood; phylogenetic inference; phylogeny; stochastic algorithm.

Figures

Fig. 1.

Performance of IQ-TREE for fixed CPU times: (a, b) Display frequencies of log-likelihood differences for IQ-TREE minus RAxML for 70 DNA (a) and 45 AA (b) alignments. (c) and (d) show the same if IQ-TREE is compared with PhyML. IQ-TREE’s CPU times were limited to those required by RAxML and PhyML, respectively. The percentages on the dashed line in (b) and (d) represent the fraction of alignments where log-likelihood differences are smaller than 0.01.

Fig. 2.

Performance of IQ-TREE for variable CPU times: The upper plots (a, b) show the performance of IQ-TREE against RAxML using the 70 DNA (a) and 45 AA (b) alignments. The lower plots (c, d) show the same against PhyML. Each dot in the main diagrams represents for one alignment the mean differences of the CPU times (y axis) and of the mean differences of log-likelihoods (x axis) of the reconstructed trees by the programs compared. The whiskers at each point show the standard errors of the differences. The histograms at the top and the side present the marginal frequencies. Dots to the right of the vertical dashed line represent alignments where IQ-TREE found a higher likelihood. If a dot is below the horizontal dashed line, the reconstruction by IQ-TREE was faster. Percentages in the quadrants of histograms denote the fraction of alignments in that region. Percentages on the dashed line reflect the number of alignments where log-likelihood differences are smaller than 0.01 (see [b] and [d]).

Fig. 3.

Flowchart for the stochastic search algorithm. The variable count counts the number of random perturbations (box b and box c) as a new best tree was found.

See this image and copyright information in PMC

Cited by 1,663 articles

Identification of viruses infecting six plum cultivars in Korea by RNA-sequencing.
Jo Y, Choi H, Lian S, Cho JK, Chu H, Cho WK. Jo Y, et al. PeerJ. 2020 Jul 29;8:e9588. doi: 10.7717/peerj.9588. eCollection 2020. PeerJ. 2020. PMID: 32821540 Free PMC article.
Genomic Diversity of SARS-CoV-2 During Early Introduction into the United States National Capital Region.
Thielen PM, Wohl S, Mehoke T, Ramakrishnan S, Kirsche M, Falade-Nwulia O, Trovao NS, Erlund A, Howser C, Sadowski N, Morris P, Hopkins M, Schwartz M, Fan Y, Gniazdowski V, Lessler J, Sauer L, Schatz MC, Evans JD, Ray SC, Timp W, Mostafa HH. Thielen PM, et al. medRxiv. 2020 Aug 15:2020.08.13.20174136. doi: 10.1101/2020.08.13.20174136. Preprint. medRxiv. 2020. PMID: 32817965 Free PMC article.
Next generation sequencing-aided comprehensive geographic coverage sheds light on the status of rare and extinct populations of Aporia butterflies (Lepidoptera: Pieridae).
Todisco V, Vodă R, Prosser SWJ, Nazari V. Todisco V, et al. Sci Rep. 2020 Aug 18;10(1):13970. doi: 10.1038/s41598-020-70957-4. Sci Rep. 2020. PMID: 32811885 Free PMC article.
Molecular characterization and DNA methylation profile of Libyodrilus violaceous from oil polluted soil.
Ogunlaja A, Sharma V, Ghai M, Lin J. Ogunlaja A, et al. Mol Biol Res Commun. 2020 Jun;9(2):45-53. doi: 10.22099/mbrc.2019.35242.1449. Mol Biol Res Commun. 2020. PMID: 32802898 Free PMC article.
Genomic Characteristics and Potential Metabolic Adaptations of Hadal Trench Roseobacter and Alteromonas Bacteria Based on Single-Cell Genomics Analyses.
Chen M, Song Y, Feng X, Tang K, Jiao N, Tian J, Zhang Y. Chen M, et al. Front Microbiol. 2020 Jul 24;11:1739. doi: 10.3389/fmicb.2020.01739. eCollection 2020. Front Microbiol. 2020. PMID: 32793171 Free PMC article.

See all "Cited by" articles

References

1. Chor B, Tuller T. Maximum likelihood of evolutionary trees is hard. Lect Notes Comput Sci. 2005;3500:296–310.
1. Farris JS. Methods for computing Wagner trees. Syst Zool. 1970;19:83–92.
1. Felsenstein J. Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol. 1981;17:368–376. - PubMed
1. Felsenstein J. Inferring phylogenies. Sunderland (MA): Sinauer Associates; 2004.
1. Fitch WM. Toward defining course of evolution—minimum change for a specific tree topology. Syst Zool. 1971;20:406–416.

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grant support

I 760/Austrian Science Fund FWF/Austria

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations

[1] Chor B, Tuller T. Maximum likelihood of evolutionary trees is hard. Lect Notes Comput Sci. 2005;3500:296–310.

[2] Chor B, Tuller T. Maximum likelihood of evolutionary trees is hard. Lect Notes Comput Sci. 2005;3500:296–310.

[3] Farris JS. Methods for computing Wagner trees. Syst Zool. 1970;19:83–92.

[4] Farris JS. Methods for computing Wagner trees. Syst Zool. 1970;19:83–92.

[5] Felsenstein J. Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol. 1981;17:368–376. - PubMed

[6] Felsenstein J. Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol. 1981;17:368–376. - PubMed

[7] Felsenstein J. Inferring phylogenies. Sunderland (MA): Sinauer Associates; 2004.

[8] Felsenstein J. Inferring phylogenies. Sunderland (MA): Sinauer Associates; 2004.

[9] Fitch WM. Toward defining course of evolution—minimum change for a specific tree topology. Syst Zool. 1971;20:406–416.

[10] Fitch WM. Toward defining course of evolution—minimum change for a specific tree topology. Syst Zool. 1971;20:406–416.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies

Affiliations

IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies

Authors

Affiliations

Abstract

Figures

Similar articles

Cited by 1,663 articles

References

Publication types

MeSH terms

Grant support

LinkOut - more resources

Full Text Sources

Other Literature Sources