Hi all! I’m confused how Watchdog’s Replication Apdex works over time. For example, when I look at a large instance over the past 10 days I see days of an Apdex of about 55 for 6 of the 10 days on the left (2025-07-24 00:00:00 - now
):
However, when I zoom out to the Last 30 days
that 6 day lull is mostly gone, only to be represented by a dip to just above 80 for a short period. As well, the high of above 90 in the last day or so is gone as well, and it never gets above 88.
My concern is that if I look at the second graph I might think all is well, but a day or two so-so performance. If I look at the first graph I would think terrible things were happening for days and days.
Is there some averaging going on? I clicked on “Edit” on that graph but the calculation (see below and GH Deep link) isn’t very approachable:
(
(
sum(increase(cht_api_http_request_duration_seconds_bucket{instance=~"$cht_instance",route=~".*/get-ids",le="180",code=~"^2..$|^3..$"}[$__range]))
+ (
sum(increase(cht_api_http_request_duration_seconds_bucket{instance=~"$cht_instance",route=~".*/get-ids",le="360",code=~"^2..$|^3..$"}[$__range]))
- sum(increase(cht_api_http_request_duration_seconds_bucket{instance=~"$cht_instance",route=~".*/get-ids",le="180",code=~"^2..$|^3..$"}[$__range]))
) / 2
) / sum(increase(cht_api_http_request_duration_seconds_count{instance=~"$cht_instance",route=~".*/get-ids",code=~"^2..$|^3..$"}[$__range]))
) * 100