|
|
| Home >> Working Papers Series >> Jornadas de Tratamiento y Recuperación de la Información >> A Bayesian Approach to WSD for the Retrieval of XML Documents |
|
A Bayesian Approach to WSD for the Retrieval of XML Documents
Jornadas de Tratamiento y Recuperación de la Información / Departamento de Biblioteconomía y Documentación y Departamento de Informática de la Universidad Carlos III de Madrid Abstract: Sources of XML documents are today proliferating on the World Wide Web. An important feature of XML is that information on documents structures is available on the Web together with the documents contents. This information can be exploited to improve document handling and to improve query processing. In such an heterogeneous environment as the Web, it is not reasonable to assume that there are XMLdocuments which always satisfy a certain query. A metric for quantifying the structural similarity between an XML document and a query is necessary. The aim is to develop a technique which could allow for aproximate quering, that is, based on structural similarity and synonymy between tags of XML documents. In this paper, we present analgorithm for the retrieval of XML documents which is based on the structural and semantic similarity of a document with a given query. For the semantic indexing of the tags of XML documents and queries, the naive Bayesian approach and the WordNet ontology were used.
(go top) |
Last
updated: 2008-05-15 04:02:24 DoIS team
Italian DoIS