TY - JOUR
T1 - Finding similar questions in collaborative question answering archives
T2 - Toward bootstrapping-based equivalent pattern learning
AU - Hao, Tianyong
AU - Agichtein, Eugene
PY - 2012/6
Y1 - 2012/6
N2 - Many questions submitted to Collaborative Question Answering (CQA) sites have similar questions answered before. We propose a precise approach of automatically finding an answer to such questions by automatically identifying "equivalent" questions submitted and answered, in the past. Our method is based on automatically generating equivalent question patterns by grouping together questions that have previously obtained the same answers. The generated patterns are used as seed patterns to match more questions to extract large number of equivalent patterns by a new bootstrapping-based learning method. The resulting patterns can be applied to match a new question to an equivalent one that has already been answered, and thus suggest potential answers automatically. We experimented with this approach over a large collection of more than 200,000 real questions drawn from the Yahoo! Answers archive, automatically acquiring over 16,991 groups of equivalent question patterns. These patterns allow our method to obtain over 57% recall and over 54% precision on suggesting an answer automatically to new questions, significantly improving over baseline methods. © 2012 Springer Science+Business Media, LLC.
AB - Many questions submitted to Collaborative Question Answering (CQA) sites have similar questions answered before. We propose a precise approach of automatically finding an answer to such questions by automatically identifying "equivalent" questions submitted and answered, in the past. Our method is based on automatically generating equivalent question patterns by grouping together questions that have previously obtained the same answers. The generated patterns are used as seed patterns to match more questions to extract large number of equivalent patterns by a new bootstrapping-based learning method. The resulting patterns can be applied to match a new question to an equivalent one that has already been answered, and thus suggest potential answers automatically. We experimented with this approach over a large collection of more than 200,000 real questions drawn from the Yahoo! Answers archive, automatically acquiring over 16,991 groups of equivalent question patterns. These patterns allow our method to obtain over 57% recall and over 54% precision on suggesting an answer automatically to new questions, significantly improving over baseline methods. © 2012 Springer Science+Business Media, LLC.
KW - Bootstrapping
KW - Collaborative question answering
KW - Equivalent pattern
KW - Pattern extension
UR - http://www.scopus.com/inward/record.url?scp=84861456868&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-84861456868&origin=recordpage
U2 - 10.1007/s10791-012-9188-x
DO - 10.1007/s10791-012-9188-x
M3 - RGC 21 - Publication in refereed journal
SN - 1386-4564
VL - 15
SP - 332
EP - 353
JO - Information Retrieval
JF - Information Retrieval
IS - 3-4
ER -