I am looking for a way to holistically evaluate the health of a Couch instance. I see that the
_stats and the
_system Couch endpoints give lots of interesting data as well as the CHT monitoring API. These would be most helpful if they were compiled into a longitudinal view so you can see changes over time.
My question is: can anyone share their general approach for polling and compiling this data? What software do you use? How often do you poll? Which endpoints do you collect data from? What values are most useful to watch?
(I think what I am dreaming of is the unfinished second half of these monitoring docs that gives the practical approach for what tech to use…)