Nearest_neighbor_interchange

Tree rearrangement

Method in computational phylogenetics

Tree rearrangements are deterministic algorithms devoted to search for optimal phylogenetic tree structure. They can be applied to any set of data that are naturally arranged into a tree, but have most applications in computational phylogenetics, especially in maximum parsimony and maximum likelihood searches of phylogenetic trees, which seek to identify one among many possible trees that best explains the evolutionary history of a particular gene or species.

Basic tree rearrangements

Nearest neighbor interchange (NNI)
Subtree pruning and regrafting (SPR)
Tree bisection and reconnection (TBR)

The simplest tree-rearrangement, known as nearest-neighbor interchange, exchanges the connectivity of four subtrees within the main tree. Because there are three possible ways of connecting four subtrees,^[1] and one is the original connectivity, each interchange creates two new trees. Exhaustively searching the possible nearest-neighbors for each possible set of subtrees is the slowest but most optimizing way of performing this search. An alternative, more wide-ranging search, subtree pruning and regrafting (SPR), selects and removes a subtree from the main tree and reinserts it elsewhere on the main tree to create a new node. Finally, tree bisection and reconnection (TBR) detaches a subtree from the main tree at an interior node and then attempts all possible connections between edges of the two trees thus created. The increasing complexity of the tree rearrangement technique correlates with increasing computational time required for the search, although not necessarily with their performance.^[2]

SPR can be further divided into uSPR: Unrooted SPR, rSPR: Rooted SPR. uSPR is applied to unrooted trees, and goes like this: break any edge. Join one end of the edge (selected arbitrarily) to any other edge in the tree. rSPR is applied to rooted trees*, and goes: break any edge except the edge leading to the root node. Join one end of the edge (specifically: the end of the edge that is FURTHEST from the root) and attach it to any other edge of the tree.^[3]

* In this example the root of the tree is marked by a node of degree one, meaning that all nodes in the tree have either degree 1 or degree 3. An alternative approach, used in Bordewich and Semple, is to consider the root node to have degree 2, and to have a special rule for rSPR.

The number of SPR^[4] or TBR^[5] moves needed to get from one tree to another can be calculated by producing a Maximum Agreement Forest comprising (respectively) rooted or unrooted trees. This problem is NP hard but Fixed Parameter Tractable.

Share this article:

This article uses material from the Wikipedia article Nearest_neighbor_interchange, and is written by contributors. Text is available under a CC BY-SA 4.0 International License; additional terms may apply. Images, videos and audio are available under their respective licenses.

[felsenstein-1] [1]
Felsenstein, Joseph (2004). Inferring Phylogenies. Sinauer Associates: Sunderland, MA. ISBN 9780878931774.

[takahashi-2] [2]
Takahashi, Kei; Nei, Masatoshi (August 2000). "Efficiencies of fast algorithms of phylogenetic inference under the criteria of maximum parsimony, minimum evolution, and maximum likelihood when a large number of sequences are used". Molecular Biology and Evolution. 17 (8): 1251–1258. doi:10.1093/oxfordjournals.molbev.a026408. PMID 10908645.

[bs05-3] [3]
Bordewich, Magnus; Semple, Charles (2005). "On the computational complexity of the rooted subtree prune and regraft distance". Annals of Combinatorics. 8 (4): 409–423. doi:10.1007/s00026-004-0229-z. S2CID 13002129.

[4] [4]
Whidden, Chris; Beiko, Robert G.; Zeh, Norbert (2016). "Fixed-parameter and approximation algorithms for maximum agreement forests of multifurcating trees". Algorithmica. 74 (3): 1019–1054. arXiv:1305.0512. doi:10.1007/s00453-015-9983-z. S2CID 14297537.

[5] [5]
Chen, Jianer; Fan, Jia-Hao; Sze, Sing-Hoi (2015). "Parameterized and approximation algorithms for maximum agreement forest in multifurcating trees". Theoretical Computer Science. 562: 496–512. doi:10.1016/j.tcs.2014.10.031.

[matsuda-6] [6]
Matsuda, H. (1996). "Protein phylogenetic inference using maximum likelihood with a genetic algorithm" (PDF). Pacific Symposium on Biocomputing 1996. pp. 512–523.

[Goloboff1999-7] [7]
Goloboff, Pablo A. (1999). "Analyzing Large Data Sets in Reasonable Times: Solutions for Composite Optima". Cladistics. 15 (4): 415–428. doi:10.1006/clad.1999.0122. PMID 34902941.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

Nearest_neighbor_interchange

Tree rearrangement

Basic tree rearrangements

Tree fusion

Sectorial search

Tree drifting

Tree fusing

References

Share this article: