AN EFFICIENT METHOD FOR WEB DATA EXTRACTION USING PARTIAL TREE ALIGNMENT ALGORITHM

Authors

  • M. Karthikeyan*, P. Aruna

Abstract

With the explosion of the World Wide Web, a wealth of data on many different subjects has become available online. Usually, users retrieve Web data by browsing and keyword searching. But, these traditional methods have their limitations and disadvantages. Search engine helps to retrieve the relevant web sites based on the keyword specified by the user. It performs various operations such as crawling, indexing etc. It displays thousands of links as a result of the web search, but there are many road blocks that can make this process difficult or even impossible. So, the proposed system mainly aims to eradicate the disadvantages of search engines by exploring the contents of a web page to a maximum extent. It finds the exact keywords that match a page. When the search engine searches for web pages related to exact keyword, it can return only a few pages which are highly focused, specific and relevant to the topic. By this, the end-user gets the required information related to the search. Experiment shows that new approach is feasible and effective.

Article Metrics Graph

Downloads

Published

2013-10-13