Perfect Taxon Sampling and Fixing Taxon Traceability: Introducing a Class of Phylogenetically Decisive Collections of Taxon Sets
Published version
Peer-reviewed
Repository URI
Repository DOI
Change log
Authors
Abstract
Phylogenetically decisive collections of taxon sets have the property that if trees are chosen for each of their elements, as long as these trees are compatible, the resulting supertree is unique. This means that as long as the trees describing the phylogenetic relationships of the (input) species sets are compatible, they can only be combined into a common supertree in precisely one way. This setting is sometimes also referred to as “perfect taxon sampling”. While for rooted trees, the decision if a given set of input taxon sets is phylogenetically decisive can be made in polynomial time, the decision problem to determine whether a collection of taxon sets is phylogenetically decisive concerning unrooted trees is unfortunately coNP-complete and therefore in practice hard to solve for large instances. This shows that recognizing such sets is often difficult. In this paper, we explain phylogenetic decisiveness and introduce a class of input taxon sets, namely so-called fixing taxon traceable sets, which are guaranteed to be phylogenetically decisive and which can be recognized in polynomial time. Using both combinatorial approaches as well as simulations, we compare properties of fixing taxon traceability and phylogenetic decisiveness, e.g., by deriving lower and upper bounds for the number of quadruple sets (i.e., sets of 4-tuples) needed in the input set for each of these properties. In particular, we correct an erroneous lower bound concerning phylogenetic decisiveness from the literature. We have implemented the algorithm to determine if a given collection of taxon sets is fixing taxon traceable in R and made our software package FixingTaxonTraceR publicly available.
Description
Acknowledgements: We wish to thank Mike Steel and Leo van Iersel for helpful discussions on the topic, including but not limited to insights concerning the relationship between Problems 1 and 2. Moreover, we thank Steven Kelk for patiently answering some complexity questions.
Funder: Universität Greifswald (1032)
Journal Title
Conference Name
Journal ISSN
1522-9602

