because the amount of information contained on the Deep Web is much larger than the surface web, how to use it well has become a popular problem to research. When a query is sent to a deep web resource and the data sources return few results or even no result, a proper query relaxation solution should be adopted to get more satisfactory results to users. In this paper, such a query relaxation solution is presented. First, it solves the problem of relaxing attributes which contain multiple key words by value. That is, such attributes are not simply removed in the relaxation, but the query values of the attributes are modified. Second, when a data source returns many result pages, instead of getting all the pages, it evaluates the quality of the results in the current page to decide whether to send another query to fetch the next page. Thus, the number of query times is reduced. Finally, the experimental results demonstrate that both the result quality and the query efficiency are improved.
I. INTRODUCTIONWith the rapid expansion of the World Wide Web, there are a large number of web databases which contain a large amount of information hidden behind the web sites. Sending queries to these data sources has become a widely used way for people to get information. However, sometimes users may not be able to send queries which could get results that satisfy them, or even could not get any result at all. Such queries are called failed queries, which means, queries get few results or no result.When the case mentioned above occurs, query relaxation should be adopted in order to get more results. Query relaxation aims to modify the original query and change the constraints to avoid failed queries. Although these results could not satisfy users needs best, users may still accept them by compromise and we could filter the results in the results integration phase.Because results are returned as pages, when we want to get a result page, actually another query is sent to the data source. It s a good idea to evaluate the quality of the result pages which have been got, and if the results are good enough, it is not necessary to get the rest pages. Thus, the query times would be greatly reduced.In this paper, we focus on the query relaxation on the deep web and propose a solution which is both effective and of high quality. When a failed query occurs, a query relaxation is executed automatically to ensure to get some results to users.The contributions of this paper are as follows: