Preview

03 - Indexes and Web crawlers

 1. How do search engines know where to look? How can search engines recommend a few pages out of the trillions that exist? The answer lies with _________________.

  web bugs

  web browsers

  web insects

  web crawlers

 2. Web crawlers are computer programs that scan the web, "reading" everything they find.

  False

  True

 3. Crawlers are also known as spiders, _____ and automatic indexers.

  mots

  bugs

  speedys

  bots

 4. These crawlers scan web pages to see what words they contain, and where those words are used. The crawler turns its findings into a giant _____.

  web

  book

  index

  spider

 5. The index is basically a__________________________.
For example, when you ask a search engine for pages about mooses, the search engine checks its index and gives you a list of pages that mention mooses.

  big list of bugs and errors on the internet.

  big list of letters that can be sorted.

   big list of words and the web pages that feature them.

  big list of web browsers that exist.

 6. Crawlers ______ the web regularly so they always have an up-to-date index of the web.

  run

  destroy

  scan

  eat

 7. Once the crawler has found information by crawling over the web, the program builds the index. The index contains the words as well as their ____________.

  chinese spelling

  meaning

  language

  location

 8. The Google Search index contains hundreds of billions of web pages and is well over 100,000,000 gigabytes in size.

  FALSE

  TRUE

 9. Google does not want to recommend disreputable websites, so if you engage in spammy practices you may be penalised by having your website _____________.

  put to the top of the search results

  indexed without your permission

  de-indexed

  re-indexed

 10. ___________ is the web crawler software used by Google, which collects documents from the web to build a searchable index for the Google Search engine.

  MoBot

  BotGo

  MooBoo

  GoogleBot