Publicity, Discussion and Events

About

This is an independent DIY search engine that focuses on non-commercial content, and attempts to
show you sites you perhaps weren’t aware of in favor of the sort of sites you probably already knew
existed.

The software for this search engine is all custom-built, and all crawling and indexing is
done in-house. The project is open source. Feel free to poke about in the source code or contribute
to the development!

The entire search engine is hosted off a single PC in Sweden, albeit with pretty solid specs.

The search engine is currently serving about 47 queries/minute.

Consider supporting the
project
!

Tips

This search engine isn’t particularly well equipped to answering queries
posed like questions, instead try to imagine some text that might appear
in the website you are looking for, and search for that.

Where this search engine really shines is finding small, old and obscure websites about some
given topic, perhaps
old video games,
a mystery,
theology,
the occult,
knitting,
computer science,
or art.

Updates

☛ A recipe filter has been added to the algorithm selector.

☛ The Random Mode has been overhauled, and is
quite entertaining. I encourage you to give it a spin.

☛ A simple public API is now available.

This is a keyword-based search engine. When entering multiple search terms, the search engine will
attempt to match them against documents where the terms occur in close proximity.

Search terms can be excluded with a hyphen.

While the search engine at present does not allow full text search, quotes can be used to
specifically search for names or terms in the title. Using quotes will also cause the search engine
to be as literal as possible in interpreting the query.

Parentheses can be used to add terms to the query without giving weight to the terms when ranking
the search results.

Samples

soup -chicken
Look for keywords that contain soup, but not
chicken.
“keyboard”
Look for pages containing the exact word
keyboard, not keyboards or the like.
“steve mcqueen”
Look for pages containing the exact words steve mcqueen
in that order, with no words in between.
apology (plato)
Look for pages containing apology and plato, but only rank them
based on their relevance to apology

Several special keywords are supported by the search engine.

Keyword Meaning
site:example.com Display site information about example.com
site:example.com keyword Search example.com for keyword
explore:example.com Show similar websites to example.com
ip:127.0.0.1 Search documents hosted at 127.0.0.1
links:example.com Search documents linking to example.com
tld:edu keyword Search documents with the top level domain edu.
?tld:edu keyword Prefer but do not require results with the top level domain edu.
This syntax is also possible for links:…, ip:… and site:…
q>5 The amount of javascript and modern features is at least 5 (on a scale 0 to 25)
q<5 The amount of javascript and modern features is at most 5 (on a scale 0 to 25)
year>2005 (beta) The document was ostensibly published in or after 2005 (temp. unavailable due to a bug)
year=2005 (beta) The document was ostensibly published in 2005 (temp. unavailable due to a bug)
year<2005 (beta) The document was ostensibly published in or before 2005 (temp. unavailable due to a bug)
rank>50 (beta) The ranking of the website is at least 50 in a span of 1 – 255
rank<50 (beta) The ranking of the website is at most 50 in a span of 1 – 255
format:html5 Filter documents using the HTML5 standard. This is typically modern websites.
format:xhtml Filter documents using the XHTML standard
format:html123 Filter documents using the HTML standards 1, 2, and 3. This is typically very old websites.
-special:media Filter out documents with audio or video tags
-special:scripts Filter out documents with javascript
-special:affiliate Filter out documents with likely Amazon affiliate links
-special:tracking Filter out documents with analytics or tracking code
-special:cookies Filter out documents with cookies

The estimated relevance of the search result is indicated using the color saturation
of the color of the search result, as well as the order the results are presented.

Information about the position of the match is indicated using a dot matrix
in the bottom bar of each search result. Each dot represents four sentences,
and are presented in an order of top-to-bottom, left-to-right.

⣿⠃⠀⠀   — The terms occur heavily toward the beginning of the document.

⠠⠀⡄⠁   — The terms occur sparsely throughout the document.

⠀⠁⠀⠀   — The terms occur only in a single sentence.

Potentially problems with the document are presented with a warning triangle, e.g. ⚠ 3.
Desktop users can mouse-over this to get a detailed breakdown.

Read More