Relaxed global term weights for XML element search

A Keyaki, K Hatano, J Miyazaki - … Workshop of the Inititative for the …, 2011 - Springer
A Keyaki, K Hatano, J Miyazaki
Comparative Evaluation of Focused Retrieval: 9th International Workshop of the …, 2011Springer
XML element search engines return XML elements which are part of XML documents as
search results. Existing studies related to XML element search are brought from the
information retrieval techniques for document search. There are some ways to calculate
global weights of each term from statistics of XML elements with 1) the same path
expression or 2) the same tag. In the first approach, the more complex a path expression is,
the less the number of XML elements with the path expression becomes. This is a problem …
Abstract
XML element search engines return XML elements which are part of XML documents as search results. Existing studies related to XML element search are brought from the information retrieval techniques for document search. There are some ways to calculate global weights of each term from statistics of XML elements with 1) the same path expression or 2) the same tag. In the first approach, the more complex a path expression is, the less the number of XML elements with the path expression becomes. This is a problem that global term weights may be calculated using statistics of a few XML elements. Such global weights are never global. The second approach also has a problem that it does not consider document structures of XML elements. To resolve the problems, we propose a method for calculating accurate global weights. In our method, we regard a path expression as an array of tags. We relax the restriction of appearance order and appearance frequency of tags in a path expression to gather similar path expressions into the same class. Therefore, we try to decrease the number of classes which hardly contain elements. Our experimental results show that our method can integrate path expressions without decreasing search accuracy with a certain test collection.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果