Internet searching or web searching is done by what are called web search engines.A web search engine is a tool designed to search for information on the World Wide Web. The search results are usually presented in a list of results and are commonly called hits.Google for example is a search engine. It achieved better results for many searches with an innovation called PageRank. A search engine does the following operations:
1. Web crawling
a Web crawler (sometimes also known as a spider) â€ an automated Web browser which follows every link on the site.The contents of each page are then analyzed to determine how it should be indexed.The Data so obtained through crawling about web pages are stored in an index database for use in later queries.Some store all or part of the source page (referred to as a cache) as well as information about the web pages.
When a user enters a query into a search engine it examines its index and provides a listing of best-matching web pages according to its criteria, usually with a short summary containing the document's title and sometimes parts of the text. The index is built from the information stored with the data during the indexing stage. Relevance of the results is determined by several algorithms such as Page Rank(Google).Here page Rank reagents the probability of a random user in visiting a page. Direct Hit uses popularity data (number of visitors of a specific link) more and more search engines are providing rankings based on comprehensive analysis.
The following PDF has an abstract about searching ang representation of search results.