WIP-健康检查与监控梳理
背景
健康检查,是几乎任意一个运行的项目都需要做的。
- 需要健康检查的场景
- 中间件
- 业务系统
- 现有方案
- promethues
- datadog
- ping
- 云拨测
- 错误检测平台
- 其他
- 健康检查的环节
- 检查
- 记录
- 告警配置
- 数据统计
- 检查出错的反馈机制
- 通知
- 看板
- 自愈处理
- 自动化故障巡检机制
- 健康检查的自我检查
- 资源受限环境的使用
调研的一些项目
- https://github.com/sourcegraph/checkup
- https://github.com/alexliesenfeld/health (example 有更多)
- https://www.aiops.com/docs/ca/
- https://github.com/dimiro1/health
- https://github.com/Purple-Devs/health_check rails 参考
- https://github.com/andreas-schroeder/kafka-health-check kafka
- https://github.com/lausser/check_nwc_health
- https://github.com/InVisionApp/go-health
- https://github.com/etherlabsio/healthcheck
- https://github.com/kuberhealthy/kuberhealthy
- https://github.com/megaease/easeprobe (强大)
- https://github.com/labring/lvscare lvs and ipvc health check
- https://github.com/toricls/pingbot base on serverless
- https://github.com/hootsuite/healthchecks microservice ,可形成 https://github.com/hootsuite/health-checks-api
- https://github.com/Talento90/go-health (依赖项健康检查)
- https://github.com/grpc-ecosystem/grpc-health-probe grpc
- https://github.com/tevino/tcp-shaker
- https://github.com/AppsFlyer/go-sundheit
- https://github.com/revsys/django-health-check django 参考
- https://github.com/orf/gping ping with a graph in rust
- https://github.com/smartping/smartping 互 ping 网络
- https://github.com/keen/pingpong with graph
- https://github.com/laixintao/pingtop ping time like top
- https://github.com/aprilmadaha/pingmesh 大规模互 ping 网络
If you seek truth, you will not seek victory by dishonorable means, and if you find truth you will become invincible.
— Epictetus
本博客所有文章除特别声明外,均采用 CC BY-SA 4.0 协议 ,转载请注明出处!