Abstract: The exponential growth of documents available in the World Wide Web makes it ever more difficult to discover relevant information on a specific topic. In this context, growing interest is emerging in focused crawling, a technique that dynamically browses the Internet by choosing directions that maximize the probability of discovering relevant pages, given a specific topic. Predicting the relevance of a document before seeing its contents (i.e., relying on the parent pages only) is one of the central problems in focused crawling because it can save significant bandwidth resources. This paper gives an overview of the various evaluating methods and technique that can be used for focused crawlers. The ultimate aim of these techniques is to predict the values of a discrete class attribute.
Keywords: DP, Focused Crawler, Classifier, Deep Web, WEB 2.0
Title: Evaluation Methods for Focused Crawler: An Over view
Author: Dr. Vivek Chandra, Ms. Nidhi Saxena
International Journal of Computer Science and Information Technology Research
ISSN 2348-120X (online), ISSN 2348-1196 (print)
Research Publish Journals