A
search engine is a software programme resident on a computer
that searches through a (usually massive) database. In the context
of the World Wide Web, the word "search engine" is most
often used for search forms that search through databases of HTML
documents gathered by a robot. Our Crawler (Spider Monkey) visits and checks URLs during server off-peak load times and feeds the result to the index. All realms of the main database are refreshed no less than every 30 days. This temp. database is minimally crawled twice monthly and while a URL is fetched from the actual site, each entry here remains for a period of roughly 60 days to verify when and how it was submitted. Note: URLs submitted to our own Site Submit Service or submitted remotely by other authorized servers do not appear in the temp. database but can be found using the Mouse House Search Engine. Spider
Monkey
abides by the Robot
Exclusion Standard. Specifically, Spider
Monkey
adheres to the 1994
Robots Exclusion Standard (RES). Where the 1996
proposed standard supercedes the 1994 standard, the proposed standard is
followed. Before you submit your site for inclusion in our database (index), are there pages you don't want indexed? If so, put the following in the head of any web page you want excluded. Our crawler (Spider Monkey) will obey this instruction and skip the document. <META NAME="robots" CONTENT="noindex"> Do you use meta content tags? You should at least set out the content of the page as succinctly as possible. If present, this will become the introduction to your page in the search results our visitors see. An example follows:
|
Searched for dns | 1-10 of 73 | 901155 pages searched |
Our search engine finds documents at Mouse House and throughout the World Wide Web. Here's how it works: you tell our search engine what you're looking for by typing in keywords, phrases, or questions in the search box. Our search engine responds by giving you a list of all the Web pages in our crawler's (we call it SpiderMonkey and you can read its technical details from the WWW robot registry by clicking here) index relating to those topics. The most relevant content will appear at the top of your results. Most foul language is ignored by our Search Engine. Conclude it is not a tool for seeking porn sites. |
Spider Monkey's index is a large, growing, organized collection of data comprised of Web pages, their content and location and discussion group pages from around the world. The 'index' becomes larger every day as people send us the addresses for new Web pages and as our systems administrators search for new material. We own sophisticated technology that crawls the Web daily during lower server load periods looking for links to new pages. When you use the Mouse House search engine, you search the entire collection using keywords or phrases, just like other search engines such as Yahoo or Alta Vista |