TY - GEN
T1 - On the Tractability of Maximal Strip Recovery
AU - Wang, Lusheng
AU - Zhu, Binhai
PY - 2009/5
Y1 - 2009/5
N2 - Given two genomic maps G and H represented by a sequence of n gene markers, a strip (syntenic block) is a sequence of distinct markers of length at least two which appear as subsequences in the input maps, either directly or in reversed and negated form. The problem Maximal Strip Recovery (MSR) is to find two subsequences G' and H' of G and H, respectively, such that the total length of disjoint strips in G' and H' is maximized (or, conversely, the number of markers hence deleted, is minimized). Previously, besides some heuristic solutions, a factor-4 polynomial-time approximation is known for the MSR problem; moreover, several close variants of MSR, MSR-d (with d > 2 input maps), MSR-DU (with marker duplications) and MSR-WT (with markers weighted) are all shown to be NP-complete. Before this work, the complexity of the original MSR problem was left open. In this paper, we solve the open problem by showing that MSR is NP-complete, using a polynomial time reduction from One-in-Three 3SAT. We also solve the MSR problem and its variants exactly with FPT algorithms, i.e., showing that MSR is fixed-parameter tractable. Let k be the minimum number of markers deleted in various versions of MSR, the running time of our algorithms are O(22.73kn +n2) for MSR, O(22.73kdn + dn2) for MSR-d, and O(25.46kn + n2) for MSR-DU.
AB - Given two genomic maps G and H represented by a sequence of n gene markers, a strip (syntenic block) is a sequence of distinct markers of length at least two which appear as subsequences in the input maps, either directly or in reversed and negated form. The problem Maximal Strip Recovery (MSR) is to find two subsequences G' and H' of G and H, respectively, such that the total length of disjoint strips in G' and H' is maximized (or, conversely, the number of markers hence deleted, is minimized). Previously, besides some heuristic solutions, a factor-4 polynomial-time approximation is known for the MSR problem; moreover, several close variants of MSR, MSR-d (with d > 2 input maps), MSR-DU (with marker duplications) and MSR-WT (with markers weighted) are all shown to be NP-complete. Before this work, the complexity of the original MSR problem was left open. In this paper, we solve the open problem by showing that MSR is NP-complete, using a polynomial time reduction from One-in-Three 3SAT. We also solve the MSR problem and its variants exactly with FPT algorithms, i.e., showing that MSR is fixed-parameter tractable. Let k be the minimum number of markers deleted in various versions of MSR, the running time of our algorithms are O(22.73kn +n2) for MSR, O(22.73kdn + dn2) for MSR-d, and O(25.46kn + n2) for MSR-DU.
UR - http://www.scopus.com/inward/record.url?scp=67650257442&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-67650257442&origin=recordpage
U2 - 10.1007/978-3-642-02017-9_42
DO - 10.1007/978-3-642-02017-9_42
M3 - RGC 32 - Refereed conference paper (with host publication)
SN - 9783642020162
SN - 364202016X
T3 - Lecture Notes in Computer Science
SP - 400
EP - 409
BT - Theory and Applications of Models of Computation
A2 - Chen, Jianer
A2 - Cooper, S. Barry
PB - Springer Verlag
T2 - 6th Annual Conference on Theory and Applications of Models of Computation (TAMC 2009)
Y2 - 18 May 2009 through 22 May 2009
ER -