TY - JOUR
T1 - Reverse engineering from an XML document into an extended DTD graph
AU - Shiu, Herbert
AU - Fong, Joseph
PY - 2009/4
Y1 - 2009/4
N2 - Extensible markup language (XML) has become a standard for persistent storage and data interchange via the Internet due to its openness, self-descriptiveness, and fexibility. This article proposes a systematic approach to reverse engineer arbitrary XML documents to their conceptual schema-extended DTD graphs-which is a DTD graph with data semantics. The proposed approach not only determines the structure of the XML document, but also derives candidate data semantics from the XML element instances by treating each XML element instance as a record in a table of a relational database. One application of the determined data semantics is to verify the linkages among elements. Implicit and explicit referential linkages are among XML elements modeled by the parent-children structure and ID/IDREF(S) respectively. As a result, an arbitrary XML document can be reverse engineered into its conceptual schema in an extended DTD graph format. [Article copies are available for purchase from InfoSci-on-Demand.com. © 2009, IGI Global.
AB - Extensible markup language (XML) has become a standard for persistent storage and data interchange via the Internet due to its openness, self-descriptiveness, and fexibility. This article proposes a systematic approach to reverse engineer arbitrary XML documents to their conceptual schema-extended DTD graphs-which is a DTD graph with data semantics. The proposed approach not only determines the structure of the XML document, but also derives candidate data semantics from the XML element instances by treating each XML element instance as a record in a table of a relational database. One application of the determined data semantics is to verify the linkages among elements. Implicit and explicit referential linkages are among XML elements modeled by the parent-children structure and ID/IDREF(S) respectively. As a result, an arbitrary XML document can be reverse engineered into its conceptual schema in an extended DTD graph format. [Article copies are available for purchase from InfoSci-on-Demand.com. © 2009, IGI Global.
KW - Data semantics
KW - Extended DTD graph
KW - Reverse engineering
KW - XML document
UR - http://www.scopus.com/inward/record.url?scp=73649134367&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-73649134367&origin=recordpage
U2 - 10.4018/jdm.2009040103
DO - 10.4018/jdm.2009040103
M3 - RGC 21 - Publication in refereed journal
SN - 1063-8016
VL - 20
SP - 38
EP - 57
JO - Journal of Database Management
JF - Journal of Database Management
IS - 2
ER -