Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add "Orion and Ad-blocking Tests" documentation page #659

Open
wants to merge 7 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions docs/.vitepress/config.ts
Original file line number Diff line number Diff line change
Expand Up @@ -444,6 +444,7 @@ function sidebarOrion() {
link: '/orion/privacy-and-security/ad-tracking-blocking',
items: [
{ text: 'Configure Ad & Tracking Blocking', link: '/orion/privacy-and-security/ad-tracking-blocking' },
{ text: 'Orion and Ad-blocking Tests', link: '/orion/privacy-and-security/adblock-tests' },
{ text: 'Respecting Privacy', link: '/orion/privacy-and-security/respecting-privacy' },
{ text: 'Protecting Privacy', link: '/orion/privacy-and-security/protecting-privacy' },
{ text: 'Preventing Fingerprinting', link: '/orion/privacy-and-security/preventing-fingerprinting' },
Expand Down
4 changes: 2 additions & 2 deletions docs/kagi/privacy/privacy-protection.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,9 +20,9 @@ We only collect the bare necessities to run the service. Please see our [privac

## Email Address Collection

We tie Kagi Search accounts to email addresses so we can help users with account recovery should they ever need it. Rest assured that all Kagi Searches are anonymized and are never tied to your specific account.
We tie Kagi Search accounts to email addresses so we can assist users with account recovery should they ever need it. Rest assured that all Kagi Searches are anonymized and are never linked to your specific account.

Note that you can register for Kagi Search with any email address you control. You do not have to use an email address that can easily identify you.
Please note that you can register for Kagi Search with any email address you control. You do not need to use an email address that can easily identify you.

## Building Trust

Expand Down
13 changes: 7 additions & 6 deletions docs/kagi/search-details/search-sources.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ Kagi is known for delivering a unique flavor of high-quality search results, sou

We don't stop there; we are always trying new things to surface relevant, high-quality results. For example, we recently launched the [Kagi Small Web](https://blog.kagi.com/small-web) initiative, which showcases content from personal blogs and discussions around the web. Discovering high-quality content written without the motive of financial gain gives Kagi's search results a unique flavor and makes it feel more humane to use.

Our search results also include anonymized API calls to all major search result providers worldwide, specialized search engines like [Marginalia](https://search.marginalia.nu), and sources of vertical information such as Wolfram Alpha, Apple, Wikipedia, Open Meteo, Yelp, TripAdvisor, and other APIs. Typically, every search query on Kagi will call a dozen or so different sources simultaneously, all with the purpose of bringing the best possible search results to the user in a split-second.
Our search results also include anonymized API calls to all major search result providers worldwide, specialized search engines like [Marginalia](https://search.marginalia.nu), and sources of vertical information such as Wolfram Alpha, Apple, Wikipedia, Open Meteo, Yelp, TripAdvisor, and other APIs. Typically, every search query on Kagi will call a dozen or so different sources simultaneously, all with the purpose of bringing the best possible search results to the user in a split second.

Our unique algorithms down-rank pages with a lot of ads and trackers (which we have found correlate with a decrease in content quality) and promote content from independent, ad-free sources and personal websites. This ensures that Kagi shows results that delight users and are [worth paying for](https://help.kagi.com/kagi/why-kagi/why-pay-for-search.html). Subscriptions from our members pay for search results, allowing Kagi to remain [ad-free](https://blog.kagi.com/age-pagerank-over) and [100% privacy-respecting](https://kagi.com/privacy).

Expand Down Expand Up @@ -38,17 +38,18 @@ A modern search engine is much more than just 'links'. Other sources in Kagi inc

Why do Kagi's search results stand out even though other search engines have access to the same sources? The main reasons are:

**Kagi focuses on users, not advertisers**: This user-centric approach significantly enhances our ability to highlight high-quality search results. For instance, we effectively mitigate SEO spam by down-ranking websites reliant on ads or trackers. Since these spam websites predominantly monetize through ads, they become easily detectable. Our consistent strategy of combating ads in all forms prioritizes high-quality web results. Kagi is demonstrating to the world that solid web search is possible and that [bad search results are a choice](https://pluralistic.net/2024/04/04/teach-me-how-to-shruggie/) made by legacy, ad-supported, search engines.
**Kagi focuses on users, not advertisers**: This user-centric approach significantly enhances our ability to highlight high-quality search results. For instance, we effectively mitigate SEO spam by down-ranking websites reliant on ads or trackers. Since these spam websites predominantly monetize through ads, they become easily detectable. Our consistent strategy of combating ads in all forms prioritizes high-quality web results. Kagi is demonstrating to the world that solid web search is possible and that [bad search results are a choice](https://pluralistic.net/2024/04/04/teach-me-how-to-shruggie/) made by legacy, ad-supported search engines.

**Innovative search experience**: Product features such as [promoting and blocking domains](../features/website-info-personalized-results.md) allow the users to be in control of their search feed. The most promoted and blocked domains among Kagi users can be seen [in Kagi Stats](https://kagi.com/stats?stat=leaderboard).
**Innovative search experience**: Product features such as [promoting and blocking domains](../features/website-info-personalized-results.md) allow users to be in control of their search feed. The most promoted and blocked domains among Kagi users can be seen [in Kagi Stats](https://kagi.com/stats?stat=leaderboard).

With [Search Lenses](../features/lenses.md) users can explore search results from a subset of the web, for example their favorite hobby or work related websites. With [Quick Answers](../ai/quick-answer.md) users can get a brief summary of the search results and [Summarize Page](../ai/summarize-page.md) allows Kagi users to summarize any result (even a YouTube video!).
With [Search Lenses](../features/lenses.md), users can explore search results from a subset of the web, for example, their favorite hobby or work-related websites.
With [Quick Answers](../ai/quick-answer.md), users can get a brief summary of the search results, and [Summarize Page](../ai/summarize-page.md) allows Kagi users to summarize any result (even a YouTube video!).



**Superior default algorithmic results**: Many users find Kagi's default search results to be unparalleled in quality. This stems from our unrelenting pursuit of the finest results, from all available sources, regardless of the cost. We call upon multiple, diverse, information sources for each search. We know that our members care about always getting the best search results possible, and they are ready to pay for that.
**Superior default algorithmic results**: Many users find Kagi's default search results to be unparalleled in quality. This stems from our unrelenting pursuit of the finest results from all available sources, regardless of the cost. We call upon multiple, diverse information sources for each search. We know that our members care about always getting the best search results possible, and they are ready to pay for that.

If you can not find something on Kagi, you likely can not find it anywhere.
If you cannot find something on Kagi, you likely cannot find it anywhere.

**Unique results from our own indexes (Teclis and TinyGem)**: Our in-house indexes help us uncover high-quality content from blogs and "small web" discussions, typically without ads and tracking. This specific approach lends Kagi results a more human, relatable flavor, distinguishing us from other search engines.

Expand Down
14 changes: 7 additions & 7 deletions docs/kagi/why-kagi/ai-philosophy.md
Original file line number Diff line number Diff line change
@@ -1,18 +1,18 @@
# Kagi's AI Integration Philosophy

Generative AI is a hot topic, but the technology still has flaws. Critics of AI go as far to [say](https://www.nytimes.com/2023/03/08/opinion/noam-chomsky-chatgpt-ai.html) that "*[AI] will degrade our science and debase our ethics by incorporating into our technology a fundamentally flawed conception of language and knowledge*".
Generative AI is a hot topic, but the technology still has flaws. Critics of AI go as far as to [say](https://www.nytimes.com/2023/03/08/opinion/noam-chomsky-chatgpt-ai.html) that "*[AI] will degrade our science and debase our ethics by incorporating into our technology a fundamentally flawed conception of language and knowledge*."

From an information retrieval point of view, relevant to our context of a search engine, we should acknowledge the two main limitations of the current generation of AI.

1. Large language models (LLMs) should not be blindly trusted to provide factual information accurately. They have a significant risk of generating incorrect information or fabricating details (confabulating). This can easily mislead people who are not approaching LLMs pragmatically. (*This is a product of auto-regressive nature of these models where the output is predicted one token at a time, and once it strays away from the "correct" path, for which the probablity grows exponentially with the length of the output, it is "doomed" to the end of output, without the ability to plan ahead or correct itself*).
1. Large language models (LLMs) should not be blindly trusted to provide factual information accurately. They have a significant risk of generating incorrect information or fabricating details (confabulating). This can easily mislead people who are not approaching LLMs pragmatically. (*This is a product of the auto-regressive nature of these models where the output is predicted one token at a time, and once it strays away from the "correct" path, for which the probability grows exponentially with the length of the output, it is "doomed" to the end of output, without the ability to plan ahead or correct itself*).

2. LLMs are not intelligent in the human sense. They have no understanding of the actual physical world. They do not have their own genuine opinions, emotions, or sense of self. We must avoid attributing human-like qualities to these systems or thinking of them as having human-level abilities. They are limited AI technologies. (*In a way, they are similar to how a wheel can get us from point A to point B, sometimes much more efficiently than human body can, but it lacks the ability to plan and the agility of human body to get us everywhere a human body can*)
2. LLMs are not intelligent in the human sense. They have no understanding of the actual physical world. They do not have their own genuine opinions, emotions, or sense of self. We must avoid attributing human-like qualities to these systems or thinking of them as having human-level abilities. They are limited AI technologies. (*In a way, they are similar to how a wheel can get us from point A to point B, sometimes much more efficiently than the human body can, but it lacks the ability to plan and the agility of the human body to get us everywhere a human body can*).

These limitations required us to pause and reflect on the impact on search experience, before incorporating this new technology for our customers. As a result, we came up with an AI integration philosophy that is guided by these principles:
These limitations require us to pause and reflect on the impact on the search experience before incorporating this new technology for our customers. As a result, we came up with an AI integration philosophy that is guided by these principles:

1. **AI should be used in closed, defined context relevant to search** (don't make a therapist inside the search engine, for example)
2. **AI should be used to enhance the search experience, not to create it or replace it** (meaning AI is opt-in and on-demand, similar to how we use JavaScript in Kagi, where search still works perfectly fine when JS is disabled in the browser)
3. **AI should be used to the extent that it enhances our humanity, not diminish it** (AI should be used to support users, not replace them)
1. **AI should be used in a closed, defined context relevant to search** (don't make a therapist inside the search engine, for example).
2. **AI should be used to enhance the search experience, not to create it or replace it** (meaning AI is opt-in and on-demand, similar to how we use JavaScript in Kagi, where search still works perfectly fine when JS is disabled in the browser).
3. **AI should be used to the extent that it enhances our humanity, not diminish it** (AI should be used to support users, not replace them).

While it's important to use AI tools responsibly and not overly rely on them, the design of these tools can sometimes make it difficult.

Expand Down
12 changes: 6 additions & 6 deletions docs/kagi/why-kagi/kagi-vs-competition.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,7 +30,7 @@ We’re obsessed with increasing speed and lowering latency, and we currently us

First, we optimized our technology stack to increase code execution speed and decrease connection latency.

Second, we reduced data transfer between Kagi and the browser, in some cases as much as 20x less compared to some of our competitors! This reduction has the neat side-effect of reducing CO<sub>2</sub> emissions. Using Kagi Search will benefit the environment as well as you!
Second, we reduced data transfer between Kagi and the browser, in some cases by as much as 20x less compared to some of our competitors! This reduction has the neat side effect of reducing CO<sub>2</sub> emissions. Using Kagi Search will benefit the environment as well as you!

| Product | SERP Size | CO<sub>2</sub> | Load Time |
| --- | --- | --- | --- |
Expand All @@ -51,22 +51,22 @@ Instead of trying to create a search product for billions of people, we want to

Kagi will allow you to discover well-written articles from lesser-known blogs and use features like [Lenses](../features/lenses.md) and [Personalized Results](../features/website-info-personalized-results.md#personalized_results). Ad-supported search must avoid this kind of depth and flexibility to stay profitable. Kagi has unique features, many of which can never be replicated in an ad-supported search engine.

We do not see Kagi as the Google killer. Google's scale and reach are enormous. Google also serves a purpose in the world —it did help enable our modern society to exist, with all its marvels and flaws. Heck, it even enables Kagi to exist!
We do not see Kagi as the Google killer. Google's scale and reach are enormous. Google also serves a purpose in the world — it did help enable our modern society to exist, with all its marvels and flaws. Heck, it even enables Kagi to exist!

Think of Kagi as a small, premium brand, providing a very different, tailor-made search experience for people who need and appreciate that.

## Kagi vs. DuckDuckGo

DuckDuckGo has shown the world that a privacy-first search engine is possible, and we respect this contribution. But its innovation has slowed in the past decade. And, ad-supported business models will always force a company to make compromises and balance between serving users and advertisers. In the end, DuckDuckGo's search product is just good enough (by our standard, sorry, DuckDuckGo!) and has been stagnant for years without any ground-breaking feature development.
DuckDuckGo has shown the world that a privacy-first search engine is possible, and we respect this contribution. However, its innovation has slowed in the past decade. Additionally, ad-supported business models will always force a company to make compromises and balance between serving users and advertisers. In the end, DuckDuckGo's search product is just "good enough" (by our standard, sorry, DuckDuckGo!) and has been stagnant for years without any groundbreaking feature development.

In contrast, Kagi search does not need to compromise on user experience. Everything we do is user-centric. Kagi Search already has many unique features, like [Lenses](../features/lenses.md) and [Personalized Results](../features/website-info-personalized-results.md#personalized_results). And because we depend only on our users for revenue, Kagi can and will always offer a much richer search experience for the user.
In contrast, Kagi search does not need to compromise on user experience. Everything we do is user-centric. Kagi Search already has many unique features, like [Lenses](../features/lenses.md) and [Personalized Results](../features/website-info-personalized-results.md#personalized_results). Because we depend only on our users for revenue, Kagi can and will always offer a much richer search experience for the user.

## Kagi vs. Brave Search

We appreciate that Brave is making a free search product and that it cares about user privacy. That said, we believe that Kagi Search is a better solution.

Brave Search is pursuing an ad-based model where users can pay to opt-out of ads. This means that the product direction of Brave Search will be greatly influenced by the needs of advertisers. Kagi Search does not accept advertising and our product direction is guided only by the needs of users.
Brave Search is pursuing an ad-based model where users can pay to opt out of ads. This means that the product direction of Brave Search will be greatly influenced by the needs of advertisers. Kagi Search does not accept advertising, and our product direction is guided only by the needs of users.

Brave Search also largely uses its own search index for results. Having a single-index source may be limiting.

Kagi Search includes anonymized requests to traditional search indexes including Brave, as well our own non-commercial index (Teclis), news index (TinyGem), and an AI for instant answers. Teclis and TinyGem are a result of our crawl through millions of domains, focusing primarily on non-commercial, high-quality content. More about Kagi's [search sources](../search-details/search-sources.md).
Kagi Search includes anonymized requests to traditional search indexes, including Brave, as well as our own non-commercial index (Teclis), news index (TinyGem), and an AI for instant answers. Teclis and TinyGem are a result of our crawl through millions of domains, focusing primarily on non-commercial, high-quality content. More about Kagi's [search sources](../search-details/search-sources.md).
2 changes: 1 addition & 1 deletion docs/kagi/why-kagi/kagi-vs-google.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,6 @@ Instead of trying to create a search product for billions of people, we want to

Kagi will allow you to discover well-written articles from lesser-known blogs and use features like [Lenses](../features/lenses.md) and [Personalized Results](../features/website-info-personalized-results.md#personalized_results). Ad-supported search must avoid this kind of depth and flexibility to stay profitable. Kagi has unique features, many of which can never be replicated in an ad-supported search engine.

We do not see Kagi as the Google killer. Google's scale and reach are enormous. Google also serves a purpose in the world —it did help enable our modern society to exist, with all its marvels and flaws. Heck, it even enables Kagi to exist!
We do not see Kagi as the Google killer. Google's scale and reach are enormous. Google also serves a purpose in the world — it did help enable our modern society to exist, with all its marvels and flaws. Heck, it even enables Kagi to exist!

Think of Kagi as a small, premium brand, providing a very different, tailor-made search experience for people who need and appreciate that.
Loading
Loading