Even worse, Reddit itself has been getting infected with corporate AI-generated “recommendations”
Even worse, Reddit
Fair point
At least the older posts seem to be okay.
And its inaccessible to a lot of people now anyway.
That’s why I use “site:reddit.com” instead of just adding “reddit”
Don’t worry, I’m sure google will disable that soon in the same way they disable all the other search syntax that used to make searching a simple and easy task
“Search engine” is not equivalent to “Google”.
Cool, pedant. Addend “on google” to my comment then if you need, since that’s clearly the context we’re talking about here. I’m aware there are other search engines, but context should have made what I was talking about pretty fucking obvious.
(Not OP) Point taken, but in that case the solution should also be obvious. Just use a different one that does provide that. If the product sucks, hit the bricks. DDG and Kagi are looking for market share, they’d love to have you.
I do use alternatives, but I mention Google because it’s what’s relevant to the conversation at hand.
Advanced search techniques should be a class in 6th grade
Or better yet, try my filter… It’s “-site:reddit.com”!
Here’s a tip:
site:reddit.com
Makes me sad to think that this will soon be about as useful as “site:facebook.com” with the way Reddit is going.
Yeah maybe giving corpo trash exclusivity over the sum total of human knowledge wasnt the best idea?
Or do this:
-site:reddit.com
Do you think it will ever be possible to do that for all the Lemmy instances?
Kagi.com has a lens for the fediverse. A lens is basically a scope within which performing the search.
Nah. The best option we have imo is a service that indexes everything on one site so traditional search engines can find it. That requires someone to build it, and AFAIK that’s hasn’t happened.
It already works pretty well if you just add Lemmy to the search.
Lemmy’s built-in search barely works as it is, so unless some drastic changes happen it’s resounding no.
Web search engines don’t rely on sites’ built-in search features.
This is how we found anything on reddit for most of its useful life. Its search was always garbage so we relied on Google to come up with usable results.
It’s miles better than reddit’s search has ever been.
Okay but reddit is also becoming inaccessible; how to migrate this data?
I look forward to Google being forced to down rank any sites with “reddit” in the H1.
Google being forced to
What an odd phrase
I’ve spent a lot of time working in SEO.
Search results like this can drive people away from Google and toward other resources. Google likes money, and this is why they usually try to combat spammers that are gaming the system.
It’s a cat and mouse game that has been happening for years. Organic search spammers find a new thing, then Google tweaks the algorithm to downrank what they’re exploiting.
then Google tweaks the algorithm
Well you don’t have to read Cory’s newest column to understand that Google hasn’t been doing that, because they don’t have to. They do not care, at least not yet, because they have arguably become too big to care.
Well google does a horrible job at combating it
No doubt. That said, they do update the algo to combat this stuff. If you work in SEO you’re likely quite aware of what tricks currently work and no longer work.
As useful as Mozilla/5.0; AppleWebKit/537.36 (KHTML, like Gecko) Safari/537.3
What browser agent is that?
Trick is I took out the actually useful parts like Chrome, Firefox, Edge, etc. And the OS. All the agents these days have AppleWebKit and Mozilla just so old websites that look for it don’t downgrade the experience.
Yeah, make your user agent absolutely unique. Too much entropy will surely confuse the shit out server side HTTP Header tracking. 😬
Oh gee, I wasn’t aware there was more to it than the UA. Thanks for opening my eyes.
Edit: I checked your link, most of the parameters on the test require client side execution. That (client side tracking) is absolutely unrelated to what (server side tracking) I was talking about, and is something you can control (by not allowing JavaScript, for example). Please do not confuse the two. There is literally nothing you can do against server side tracking.
Yeah this isn’t my UA but I’m just saying these parts are what’s considered the supported featureset rather than information about what software the device is running.
Yes, I get that point, but I also think that it’s tempting for the privacy-minded novice to think “the less information I provide, the better!”, while in actuality, it is better to provide “more” information: the most common UA, even if it means lying about your featureset. In this case, truly, more is less.
Firefox doesn’t pretend to use AppleWebKit. It’s actually the only one which identifies itself correctly… mostly, at least:
Mozilla/5.0 (X11; Linux x86_64; rv:122.0) Gecko/20100101 Firefox/122.0
While about:support says “Window Protocol: wayland”. But that’s ok websites shouldn’t care anyway.
It’s other browsers who send things like “like Gecko” to sneak past old browser-detection code.
firefox on iphone i would guess.
Probably Netscape
There’s something very Darwinian, very artificial selection about this.
Google also sneak “reddit” into the “People also ask” section.
site:reddit.com
Am I the only one that wants to know more about this Japanese toaster you can fuck?
Surely I’m not alone here.