Why the $5.2 billion sale of Russia's Yandex is significant

Mojeek@lemmy.ml · 1 month ago

do you remember any of the queries for which you were not satisfied? We’re always looking to improve

Mojeek@lemmy.ml · 2 months ago

spend a lot of time lurking; hard not to have a look for your own name 😀

Mojeek@lemmy.ml · 2 months ago

yep, since 2004

Mojeek@lemmy.ml · 5 months ago

Why the $5.2 billion sale of Russia's Yandex is significant

Mojeek@lemmy.ml · 5 months ago

Bin-go: https://www.searchenginemap.com/ same with many others

Mojeek@lemmy.ml · edit-2 5 months ago

if you look at the repo they give thanks to:

“The commoncrawl organization for crawling the web and making the dataset readily available. Even though we have our own crawler now, commoncrawl has been a huge help in the early stages of development.”

There is nothing I can find which says how much of the index is CC and how much is their own; if there’s a decent amount of CC, this is originally for researchers etc. it’s not the best resource in the world for a search index: https://commoncrawl.org/

That being said, as an independent search engine, it’s always good to see people take on the massive task of actually building an index, not becoming a proxy.

Mojeek@lemmy.ml · 5 months ago

we’re gettin’ censored here 🫠

Mojeek@lemmy.ml · 6 months ago

thanks a lot for the mention, if you wanna use that fallback less and less, feeding back on algo updates via https://www.mojeek.com/eval always helps us

Mojeek@lemmy.ml · 6 months ago

if you’re willing to help at all we’re always looking for feedback on specific results, and also have this page for testing staging algorithms, there’s a big change on there currently. No bother if not.