Rybbit - Open source Google Analytics replacement

Goldflag@lemmy.world · 3 months ago

Rybbit - Open source Google Analytics replacement

Goldflag@lemmy.world · 3 months ago

A few more screenshots in case you don’t want to leave the site

MakingWork@lemmy.ca · 3 months ago

I have no idea how to use this, but this is amazing!

Helix 🧬@feddit.org · 3 months ago

You’re awesome. Thanks!

Lung@lemmy.world · 3 months ago

Wow holy crap, great work - the world badly needs this. Im assuming the mechanism is the same, you inject a js script into your site. I’m also very interested in pure server side solutions for analytics, but they can’t hit all the features you did in a generic way afaik

Goldflag@lemmy.world · 3 months ago

Yea, we use a client-side script like almost everyone else. The major difference is that we don’t use cookies so you can avoid a lot of the cookie banner/GDPR nonsense.

Rybbit definitely isn’t the first open source cookieless web analytics platform (Plausible and Umami are the two other big ones), but it’s probably the most “all-in-one” of all these alternatives.

criss_cross@lemmy.world · 3 months ago

Do you use fingerprinting instead? Or what’s the mechanism you use?

x00z@lemmy.world · edit-2 3 months ago

GoAccess uses your server side access log.

mesa@piefed.social · 3 months ago

I would love for this to work on yunohost.

artyom@piefed.social · 3 months ago

https://apps.yunohost.org/wishlist

Otter@lemmy.ca · edit-2 3 months ago

You mentioned being frustrated at Plausible. What did you not like about it?

I haven’t tried Plausible, but it seemed popular

Goldflag@lemmy.world · 3 months ago

it didn’t have enough features, especially since the community version is heavily nerfed (it’s missing even funnels)

osprior@lemmy.world · 3 months ago

Question is the self-hosted version less featured than the paid hosted version?

This looks amazing btw.

Goldflag@lemmy.world · 3 months ago

Only very slightly so. One of the reasons I created Rybbit is because platforms like plausible and fathom have much inferior self-hosted versions (very limited featureset and basically never updated). We have a comparison here

spacelord@sh.itjust.works · edit-2 3 months ago

@Goldflag

I appreciate the intent behind Rybbit, but I have to respectfully disagree with the “only very slightly so” characterization. Looking at your official comparison table, the self-hosted version is missing:

Pages View
Web Vitals
Email reports
Google Search Console integration
VPN/Crawler/ASN tracking
Google/GitHub OAuth
Email support

That’s 7 significant features—which seems more than “very slightly” different.

More importantly, this raises AGPL compliance questions. Under AGPLv3 Section 13, if users interact with modified AGPL software over a network (your cloud version), you’re required to make the complete corresponding source code available to those users. If these cloud-only features are integrated into the same AGPL-licensed codebase, withholding them from the public repo while running them as a network service appears to conflict with the license terms.

There are really only two compliant scenarios here:

These features exist in the public repo but are just marketed as “cloud-only” (in which case the comparison table’s misleading)
These features are truly separate proprietary code that interfaces with Rybbit without being part of the AGPL-licensed work (which would require careful architectural separation)

If it’s neither—if these are AGPL-covered features running in your cloud service but withheld from the repo—that’s exactly the “loophole” the AGPL was designed to close. The irony is that you criticized Plausible and Fathom for having “much inferior self-hosted versions,” yet this appears to be a similar approach.

Could you clarify the licensing status of these cloud-only features? Are they in the public repo but disabled by default, or are they proprietary additions that don’t derive from the AGPL codebase?

ripcord@lemmy.world · 3 months ago

Thank you for your service.

ITGuyLevi@programming.dev · 3 months ago

OAuth is one thing I hate to see locked behind a paywall; it’s one thing for the pretty, management-geared stuff (dashboards and charts) to be a paid feature, but not security.

custard_swollower@lemmy.world · 3 months ago

AGPL means they are licensing it to you, they are not bound by the license because they are the copyright owners.

spacelord@sh.itjust.works · 3 months ago

That’s misleading. While copyright owners aren’t bound by their own license, AGPL Section 13 requires that when they run AGPL software as a network service, they must make the complete source available to users.

The AGPL was specifically designed to close the “SaaS loophole.” Being the copyright owner doesn’t exempt you from AGPL’s network service requirements if you’re distributing under that license.

GameGod@lemmy.ca · 3 months ago

This is flat out wrong. If you’re the copyright owner, you’re not licensing the code to yourself. The AGPL is the license under which they’re making the open source version available to YOU. The version they run themselves is proprietary.

spacelord@sh.itjust.works · 3 months ago

That’s inacurrate. Licensing representation matters. If the cloud service is genuinely presented as AGPL-licensed, Section 13 obligations apply regardless of copyright ownership. However, copyright owners remain free to maintain truly separate proprietary versions under dual licensing.

GameGod@lemmy.ca · edit-2 3 months ago

Licensing representation matters

It doesn’t, because they’re the copyright owners. Think of their software as dual licensed: They run it themselves under a proprietary license, under which they reserve all rights. That has nothing to do with the AGPL version that they license to you. The AGPL doesn’t take away the rights they have as copyright owners, nor does it preclude dual licensing.

(Are you a bot? Your reply is written like ChatGPT, and it has that self-defeating logic that ChatGPT has sometimes… eg. you wrote that you disagree with me, but then parroted the exact thing that I said.)

Goldflag@lemmy.world · 3 months ago

Everything is in the repo and cloud features are just toggled off in the self-hosted build.

spacelord@sh.itjust.works · 3 months ago

@Goldflag,

Thanks for clarifying! Good to hear everything’s in the repo and that it’s truly AGPL compliant.

Since as self-hosters we already carry the burden of maintenance, updates, security, and infrastructure costs that cloud users don’t, would you consider documenting how to enable the cloud features in self-hosted setups?

I see the docs cover basic environment variables, but not for Pages View, Web Vitals, or VPN/ASN tracking. Even if some features need extra config (SMTP, OAuth creds), having that documented would help those of us willing to do the work.

That would truly differentiate Rybbit from Plausible/Fathom—not just code parity, but empowering self-hosters with full feature access.

spacelord@sh.itjust.works · 3 months ago

Cloud features aren’t just a toggle away for self-hosters because setting CLOUD=true enables unwanted restrictions like Stripe billing endpoints and event limits—it was intended as a binary switch for enabling the entire cloud infrastructure.

Rybbit wasn’t architected so self-hosters could modularly enable advanced features. To solve this, I’ve forked the project and made all enterprise features modular and enabled by default, so people can test them.

Of course, it would be desirable that @Goldflag implements this himself because I’m sure he could do it more elegantly and maintain it properly as part of the official project going forward.

osprior@lemmy.world · 3 months ago

That’s excellent and very clear, thank you for the explanation.

EarMaster@lemmy.world · 3 months ago

The free self hosted version is heavily limited. I will stick with Plausible which may be simpler but also doesn’t want to push me into a subscription.

spacelord@sh.itjust.works · 3 months ago

Plausible is at least honest about it being an “open core” business model. They openly document what’s missing and why (funding sustainability).

The premium features are likely not in the AGPL repo at all - they’re genuinely separate proprietary code. They seem to be AGPL compliant because they’re transparent that community edition is a subset, not claiming “everything’s there, just toggled off”.

parmesancrabs@sh.itjust.works · 3 months ago

Aways a fan of alternate options, this looks quite tidy! I had a few thoughts / queries. Not at my system right now but I will test it out later.

I noticed in the screenshots you have a “users” page - but with a cookieless tracking system I would have assumed it wouldn’t be reliable to identify a long term user past individual sessions? Are you doing some hefty finger printing?

Looking at your features table has a few statements that might need adjusting. Such as GA4’s segmentation sequencing / filtering can be quite complex, I’d argue its not limited and potentially more advanced than Rybbit (not tested yet). It also has a user exploration feature.

Do you have any plans for a drag and drop style report creation, so that I could create reports with any dimensions / metrics and filter accordingly? I think that would bring a lot of flexibility to the platform for an individuals bespoke needs.

EarMaster@lemmy.world · 3 months ago

Just a word of warning for everyone: The free self hosted version is heavily limited. I will stick with Plausible which may be simpler but also doesn’t want to push me into a subscription.

Goldflag@lemmy.world · 3 months ago

as opposed to plausible community edition which is even more limited and is only updated a couple times a year?

EarMaster@lemmy.world · 3 months ago

The community edition allows me to have multiple sites, multiple users and is way easier to set up. If I ever need additional features like funnels I would need a subscription for both - Plausible is less expensive.

baatliwala@lemmy.world · 3 months ago

This looks… Great? Nice work

Vex_Detrause@lemmy.ca · 3 months ago

Is it “reebit”? OR “Raybit” ?

Goldflag@lemmy.world · 3 months ago

it’s “ribbit”

solrize@lemmy.ml · 3 months ago

Aren’t there already tons of these already? Piwik has been around for a quite a while, plus there are others mentioned in the comments.

quick_snail@feddit.nl · 3 months ago

Matomo*

StarkZarn@infosec.pub · 3 months ago

Glad to see you post this here. I’ve been experimenting with selfhosted analytics for a while now and have attempted your project here a couple times. The thing that kills me is the Clickhouse requirement. It makes it impossible to host on a lightweight VPS. Like why should my analytics platform require so much more compute than my simple static site? Am I missing something?

Goldflag@lemmy.world · 3 months ago

Clickhouse definitely takes a lot of resources! There’s unfortunately no way around that, though in my experience it runs fine on the cheapest Hetzner instances which are like $3-4 a month for 2GB of RAM. How lightweight is your VPS?

And yea, you don’t need clickhouse for a simple static site. I chose clickhouse because it Postgres or MySQL does not scale well since the main site I personally use Rybbit for sends around 20 million events a month.

It pains me to plug my competitors, but check out Umami or Goatcounter if you want a platform that uses postgres.

StarkZarn@infosec.pub · 3 months ago

Hey thanks so much for the engagement. I was trying to run it on a VPS that cost $35/year. 2GiB of RAM wasn’t quite enough to make it work for me, granted that was with the webserver and ancillary supporting services.

I’ll find an opportunity to test it out though, as rybbit looks great. I appreciate the mention on the other FOSS products, that’s a good look for you. I have plenty of experience with umami already. Cheers!

lIlIlIlIlIlIl@lemmy.world · 3 months ago

How would this compare to something like PostHog?

Goldflag@lemmy.world · 3 months ago

Posthog makes it almost impossible to actually self-host since they try to push you onto the cloud as much as possible. They say that the self-hosted version only works well up until 100k events … which is insane since their cloud free tier is 1 million events. It’s actually the reason why I built Rybbit. I tried to self-host posthog on my server but it ran it up to 100% CPU on 8 cores and didn’t even work.

Ok posthog rant done.

The other main difference is that Posthog has like 10+ different products all in one. Their web analytics is good, but it’s just kind of bland (imo) because it’s not their main focus.

Fmstrat@lemmy.world · 3 months ago

Is there any plans for a data migration feature from Plausible?

Goldflag@lemmy.world · 3 months ago

We have data migration plans in work but it doesn’t appear that a plausible migration is possible

rekabis@lemmy.ca · 3 months ago

OpenBSD does not have a docker engine. Can this be installed without docker?

quick_snail@feddit.nl · 3 months ago

What’s the advantages over awstats?

Goldflag@lemmy.world · 3 months ago

from what i know, awstats gets analytics from server-side logs while Rybbit uses a client side script. So not really and apples to apples comparison

danhab99@programming.dev · 3 months ago

The same advantages as all free and open source solution, it’s free and open source. That means how much it’s going to cost to your business is directly under your control. You can make a decision on how you acquire hardware based on your business’s needs. If you want to add or change features you can decide how to do that based on the deals you have with your programmers (like pick the developer you have with the best skills and the lowest cost), and then you get to control how much it costs you and how reliable the result is going to be.

If you feel like the support you get from customer service from Amazon or Google or Microsoft is reliable enough and you don’t need more reliability then go ahead and stick with paid products. But if you already have a team of really expensive and talented engineers you might as well let them solve problems with free and open source equipment.

Goldflag@lemmy.world · 3 months ago

I think he’s referring to https://en.wikipedia.org/wiki/AWStats

quick_snail@feddit.nl · 3 months ago

Yes. It predates aws lol