DSpace Repository

Learning document type definition from XML documents

Show simple item record

dc.contributor.author Phanom Slisatkorn en_US
dc.date.accessioned 2015-01-12T10:39:14Z
dc.date.available 2015-01-12T10:39:14Z
dc.identifier.other AIT Thesis no.CS-00-19 en_US
dc.identifier.uri http://www.cs.ait.ac.th/xmlui/handle/123456789/175
dc.description 46 leaves en_US
dc.description.abstract Extensible Markup Language (XML) is a new Web specification especially designed for delivering structure content over the Web and currently plays an increasingly significant role in the Web application and the data interchange format. XML documents can optionally include rules to restrict the structure of elements and attributes in Document Type Definition (DTD) or XML schema, which provide a way to validate the structure and content of documents. However, DTD is not compulsory and its creation from scratch presents some complications. Therefore, this research aims to provide a learning mechanism to obtain quality DTD from a set of XML instances. We present an innovative concept by introducing the star height of the variables into our process for precisely inferring ?, +, * meta characters and enabling regular expression pattern detection between input sequences. Along with the factoring, reduction and generalization step, a concise meaningful DTD can be inferred by the learning mechanism. Experiments are carried out to demonstrate the effectiveness of the mechanism and compare it efficiency with that of the existing approaches.
dc.relation.ispartof Asian Institute of Technology. Thesis no. CS-00-19 en_US
dc.relation.ispartof Thesis no. CS-00-19 en_US
dc.subject XML (Document markup language) en_US
dc.title Learning document type definition from XML documents en_US
dc.type Thesis en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace

Advanced Search


My Account