TY - GEN
T1 - Efficient algorithms for the closest string and distinguishing string selection problems
AU - Wang, Lusheng
AU - Zhu, Binhai
PY - 2009
Y1 - 2009
N2 - In the paper, we study three related problems, the closest string problem, the farthest string problem and the distinguishing string selection problem. These problems have applications in motif detection, binding sites locating, genetic drug target identification, genetic probes design, universal PCR primer design, etc. They have been extensively studied in recent years. The problems are defined as follows: The closest string problem: given a group of strings s n }, each of length L, and an integer d, the problem is to compute a center string s of length L such that the Hamming distance d(s, s i )≤d for all . The farthest string problem: given a group of strings , with all strings of the same length L, and an integer d b , the farthest string problem is to compute a center string s of length L such that the Hamming distance d(s,g j) L-d b for all . The distinguishing string selection problem: given two groups of strings (bad genes) and (good genes), and , with all strings of the same length L, and two integers d b and d g with d g L-d b , the Distinguishing String Selection problem is to compute a center string s of length L such that the Hamming distance and the Hamming distance d(s,g j)d g for all . Our results: We design an O(Ln+nd(|∑-1|) d 23.25d) time fixed parameter algorithm for the closest string problem which improves upon the best known O(Ln+nd24d ×(|∑|-1) d ) algorithm in [14], where |∑| is the size of the alphabet. We also design fixed parameter algorithms for both the farthest string problem and the distinguishing string selection problem. Both algorithms run in time when the input strings are binary strings over ∑={0, 1}. © 2009 Springer Berlin Heidelberg.
AB - In the paper, we study three related problems, the closest string problem, the farthest string problem and the distinguishing string selection problem. These problems have applications in motif detection, binding sites locating, genetic drug target identification, genetic probes design, universal PCR primer design, etc. They have been extensively studied in recent years. The problems are defined as follows: The closest string problem: given a group of strings s n }, each of length L, and an integer d, the problem is to compute a center string s of length L such that the Hamming distance d(s, s i )≤d for all . The farthest string problem: given a group of strings , with all strings of the same length L, and an integer d b , the farthest string problem is to compute a center string s of length L such that the Hamming distance d(s,g j) L-d b for all . The distinguishing string selection problem: given two groups of strings (bad genes) and (good genes), and , with all strings of the same length L, and two integers d b and d g with d g L-d b , the Distinguishing String Selection problem is to compute a center string s of length L such that the Hamming distance and the Hamming distance d(s,g j)d g for all . Our results: We design an O(Ln+nd(|∑-1|) d 23.25d) time fixed parameter algorithm for the closest string problem which improves upon the best known O(Ln+nd24d ×(|∑|-1) d ) algorithm in [14], where |∑| is the size of the alphabet. We also design fixed parameter algorithms for both the farthest string problem and the distinguishing string selection problem. Both algorithms run in time when the input strings are binary strings over ∑={0, 1}. © 2009 Springer Berlin Heidelberg.
KW - Fixed parameter algorithms
KW - The closest string
KW - The distinguishing string selection
KW - The farthest string
UR - http://www.scopus.com/inward/record.url?scp=70350645572&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-70350645572&origin=recordpage
U2 - 10.1007/978-3-642-02270-8_27
DO - 10.1007/978-3-642-02270-8_27
M3 - RGC 32 - Refereed conference paper (with host publication)
SN - 3642022693
SN - 9783642022692
VL - 5598 LNCS
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 261
EP - 270
BT - Frontiers in Algorithmics
PB - Springer Verlag
T2 - 3rd International Frontiers of Algorithmics Workshop, FAW 2009
Y2 - 20 June 2009 through 23 June 2009
ER -