![](https://lemmy.ml/pictrs/image/Xw8AxnAwFm.jpg)
![](https://lemmy.ml/pictrs/image/2QNz7bkA1V.png)
spend a lot of time lurking; hard not to have a look for your own name 😀
The alternative search engine that does what’s right, we believe that privacy is a human right and that a true alternative must have its own index 👉 https://www.mojeek.com/
spend a lot of time lurking; hard not to have a look for your own name 😀
yep, since 2004
Bin-go: https://www.searchenginemap.com/ same with many others
if you look at the repo they give thanks to:
“The commoncrawl organization for crawling the web and making the dataset readily available. Even though we have our own crawler now, commoncrawl has been a huge help in the early stages of development.”
There is nothing I can find which says how much of the index is CC and how much is their own; if there’s a decent amount of CC, this is originally for researchers etc. it’s not the best resource in the world for a search index: https://commoncrawl.org/
That being said, as an independent search engine, it’s always good to see people take on the massive task of actually building an index, not becoming a proxy.
we’re gettin’ censored here 🫠
thanks a lot for the mention, if you wanna use that fallback less and less, feeding back on algo updates via https://www.mojeek.com/eval always helps us
if you’re willing to help at all we’re always looking for feedback on specific results, and also have this page for testing staging algorithms, there’s a big change on there currently. No bother if not.
do you remember any of the queries for which you were not satisfied? We’re always looking to improve