Alerts and charts using Victoria Metrics#1633
Conversation
6edccc9 to
0951dda
Compare
d25e6e0 to
6f9cfe3
Compare
b918ffa to
aaaed16
Compare
254a63b to
3ddb538
Compare
2c4d550 to
f161aca
Compare
f161aca to
bfaad9a
Compare
Replace Netdata alerting with vmalert: - add vmalert init script (vmalert.initd) to start/stop vmalert service - add vmalert UCI configuration file (vmalert.conf) with datasource settings - add comprehensive alert rules - update Makefile to install vmalert configuration and rules - add detailed documentation of vmalert setup and metrics mapping - support for Mimir integration when configured via ns-plug - add ns-plug-alert-proxy that listens on 127.0.0.1:9095 and receives notifications from vmalert: the proxy verify if an alert is firing or resolved Then it translates selected alerts to the legacy portal format and forwards them to my.nethesis.it or my.nethserver.com - if Mimir credentials are present in ns-plug UCI config, the Mimir alertmanager endpoint is added as a second notifier alongside the proxy - port to Victoria Metrics also alert about non-encrypted backup - add telegraf-mwan Python script that reads /var/run/mwan3/iface_state/ to collect WAN interface connectivity state. - add telegraf-services Python script that queries ubus to collect the running state of all procd-managed services. Outputs JSON for Assisted-by: Copilot:Sonnet4.6
Changes: - migrate ping monitoring from netdata's fping plugin to telegraf's native ping input plugin - expose metrics to the UI The ping plugin uses native method (method="native") which sends ICMP packets directly without external ping command, requiring CAP_NET_RAW capability or root privileges. Metrics are tagged with influxdb_db="ping-metrics" for proper InfluxDB database routing. Assited-by: Copilot:Sonnet4.6
These plugins are required to replace all Netdata features
Netdata has been replaced by Victoria Metrics.
Tbaile
left a comment
There was a problem hiding this comment.
I really don't like that we use vm_query for bridging requests over to the API and format the data, we could allow later on directly the UI to do the requests directly to the victoria metrics instance. This will need an edit to the ns-api-server, which I'm not that fond of, leave the merge for the moment, going to check for issues on the UI and merge everything all together.
Check my latest commits for a few changes, nothing big but another couple of eyes never hurt.
Me neither but I think is good enough for most cases. Let's keep it as is, we are going to improve the implementation if it's too heavy on real machines.
It's good for me. Please go with the merge when the UI part is good for you (it's a bit ugly right now, but it works and for me we can merge as is). |
50ddd7f to
1f60266
Compare
This pull request migrates network monitoring and reporting functionality from Netdata to Telegraf and Victoria Metrics,
It raises alerts using Mimir as alertmanager for Victoria Metrics.
It keeps netdata with its own alerting for backward compatibility.
Main changes:
Replaces #1601
Reference: #1638