1 shows the proposed design for the RDF crawler. The next section describes how this can be used in . Figure clustering documents according to similarity.
Fig. 1. RDF Crawler Design
Semantic Web Document clustering is an Open Research Topic and has not been experimented with until now, the advantage of this technique is that precision and recall rates of web searches can be significantly enhanced, thus reducing the problem of information overload. An enhanced version of the Suffix Tree Algorithm (Zamir and Etzioni) can be used to categorize documents according to their type. The high level structure hence produced when stored in the form of a tree it will have the advantages of faster fetch rates and a hierarchically ordered information structure. The information contained in such a high-level structure will be very precise and easy to retrieve.
2. Related Work
Staab (Staab et al, 2004)fragments of fragments of RDF from the Internet and builds a knowledge base from its data. A host of presents work on the Ontotext RDF crawler which downloads interconnected URIs to be retrieved as well as URI filtering conditions are maintained at every phase of RDF crawling. 1 /
搜索“diyifanwen.net”或“第一范文网”即可找到本站免费阅读全部范文。收藏本站方便下次阅读,第一范文网,提供最新人文社科Southampton and The Open University. Preface(8)全文阅读和word下载服务。
相关推荐: