The very first stable version of lemmy-synapse ๐
What is lemmy-synapse
A humble bundle of observability and monitoring for your Lemmy cluster. Read more ...
What's included
Docker stats dashboard
...
lemmy-synapse is a light-weight observability and monitoring stack for Lemmy servers.
Using Prometheus and Grafana, it allows the admins to visualise and query the stats of their instance. v1.0.0 comes out of the box with 3 detailed dashboards:
It runs as Docker compose cluster alongside the Lemmy cluster and does not require any changes to it in most cases. Uninstalling lemmy-synapse is as easy as tearing down its cluster and deleting its installation directory.
It is no secret that I am not a fan of logs; Iโve baited (rapala in work lingo. Rapala is a Finnish brand of fishing lure, and used to mean baiting in this context) discussion in our work chat with things like:
If youโre writing log statements, youโre doing it wrong.
This is a pretty incendiary stat...
TLDR; The author argues that free-form logging is quite useless/expensive to use. They also argue that structured logging is less effective than tracing b/c of mainly the difficulty of inferring timelines and causality.
I find the arguments very plausible.
In fact I very rarely use logs produced by several services b/c most of the times they just confuse me. The only time that I heavily use logs is troubleshooting a single service and looking at its stdout (or kubectl log.)
However I have very little experience w/ tracing (I've used it in my hobby projects but, obviously, they never represent the reality of complex distributed systems.)
Have you got real world experience w/ tracing in larger systems? Care to share your take on the topic?
Turned out I didn't need to convert any series to gauges at all!
The problem was that I had botched my Prometheus configuration and it wasn't ingesting the probe results properly ๐คฆโโ๏ธ Once I fixed that, I got all the details I needed.
I'm using blackbox_exporter to monitor a dozen of websites' performance. And that is working just fine for measuring RTT and error rates.
I'm thinking about creating a single gauge for each website indicating whether it is up or down.
I haven't been able to find any convincing resource as to if it is mathematically correct to convert such series to guages/counters - let alone how to do that.
So my questions are
Have I missed a relevant option in blackbox_exporter configurations?