How Search Engines Work
- How Search Engines Work
- Search engines are the names given to computer programs that allow us to access the information we want to reach quickly and easily without being lost in the infinite of the internet. In order to be able to answer inquiries sent to him, a search engine must “see” the information on the internet in advance. In other words, a search engine can only give you pages that “see” and “remember” before. If he has not seen it before, or has seen it, he will not be able to offer the pages he can not remember. But how does a computer see and remember pages? That’s where you come in from our door.
- The pages on the Internet are connected via links. It is possible to switch from one page to another on this page. What’s more, pages normally link to pages that are relevant to them. For example, it is very unlikely that you will be able to link to a French page about a heart surgery, a Spanish page or even a French page about cat food. Here search engines take advantage of this feature and start on a site and start circulating on the internet. When they see a page they look at the same page and try to understand its content, just as a user would look at it with a browser such as Internet Explorer or Firefox. Then they write the contents one at a time in their memory (hard disks) and follow the links on the page to reach the other pages and they do the same. So when visiting the internet, they visit as many sites as possible and try to remember them.
- Remembering the pages is basically due to the words on the page. In lists called “Index” they hold the words in the page, such as the contents of a book or the index at the end of a book. So they try to keep in memory which words pass through which pages they see. The more sophisticated search engines include the number of times a word is passed on the page, the location of the words on the page, the position relative to each other, the words used for external links within the page, the page heading, the headings on the page, the words in uppercase and lowercase, Such as the content of the other pages linking to the page, and the content of the external links the page links to, and stores them in indices.
- It is easy to access the information on the page when it is necessary to index it instead of hiding it as it is. To make an analogy, we still need the chapter of contents, although the whole book is there for us. When looking for a place in our book, we first look at chapter titles, page titles, etc. Or, if an important book for us is found in another book atif, we will try to keep it relevant and alive in the first place. Here is a similar idea that search engines are applying for internet pages and sites.
- When a query comes in from the user, the search engines immediately look at the indexes and try to find the pages of the words passed by the query. It then sorts the pages according to various criteria and displays the results to the user.
- Once again, search engines are basically three parts. The first part is called crawler or web spider in English and it is a part that navigates pages and collects contents. The second module is a module that inspects the pages collected from the internet and stores them in indexes. The last section, the query module, finds the queries from the user in the indexes created in the second part, and displays the order and the user.