天天看點

SpringBoot+Actuator+Prometheus+Consul+Grafana監控系列(二)

上一篇搭建起了一個簡單的健康名額檢查.這一節繼續整合;

項目位址在下方

所需工具下載下傳

搭建監控平台所需要的工具:

grafana: 監控資料的視圖展示;

官網下載下傳位址: https://grafana.com/

prometheus: 監控資料采集;

官網下載下傳位址: https://prometheus.io/download/#prometheus

node_exporter : 資料導出器

官網下載下傳位址: https://prometheus.io/download/#node_exporter

consul: 服務發現

官網下載下傳位址: https://www.consul.io/

伺服器搭建監控平台

安裝解除安裝腳本編寫

為了友善環境遷移或者他人用起來友善,這裡我做成一鍵安裝部署, 一鍵啟動,一鍵解除安裝;

将下載下傳好的工具上傳至伺服器指定目錄.在這裡我的目錄是/data/monitor,便于管理;

目錄下有 install目錄, exporter-install兩個目錄;

1: 将grafana,prometheus, consul安裝包上傳至/install 目錄下, 在此處寫安裝腳本;
腳本名: install-monitor.sh
#!/bin/bash

installdir="/data/monitor"
shpath=$0
toolsdir=${shpath%/*}
echo "$toolsdir"
cd $toolsdir
echo "======================unpackaging monitor app to $installdir/app ========================="
mkdir -p $installdir/app
rm -r -f $installdir/app/*
mkdir -p $installdir/app/grafana
mkdir -p $installdir/app/prometheus
mkdir -p $installdir/app/consul

tar -zxvf grafana-6.1.3.linux-amd64.tar.gz -C $installdir/app/grafana
mv $installdir/app/grafana/grafana*/*   $installdir/app/grafana
rmdir $installdir/app/grafana/grafana*

tar -zxvf prometheus-2.8.1.linux-amd64.tar.gz -C $installdir/app/prometheus
mv $installdir/app/prometheus/prometheus*/* $installdir/app/prometheus
rmdir $installdir/app/prometheus/prometheus-*

unzip consul_1.4.4_linux_amd64.zip -d $installdir/app/consul

mkdir -p $installdir/data
rm -r -f $installdir/data/*
mkdir -p $installdir/data/consul
mkdir -p $installdir/data/prometheus
mkdir -p $installdir/data/grafana
mkdir -p $installdir/cfg
rm -r -f $installdir/cfg/*
mkdir -p $installdir/cfg/consul
mkdir -p $installdir/cfg/prometheus
mkdir -p $installdir/cfg/grafana

mkdir -p $installdir/log
mkdir -p $installdir/bin
echo "==============================unpackage monitor app success  ============================="

2: 順道把解除安裝腳本也編寫,解除安裝之前需要先一鍵停止, 啟動停止腳本下面會編寫;
腳本名: uninstall-monitor.sh
#!/bin/bash

installdir="/data/monitor"
shpath=$0
toolsdir=${shpath%/*}
echo "$toolsdir"
cd $toolsdir

echo "==========================uninstall app to $installdir/app=============================="

rm -r -f $installdir/app
rm -r -f $installdir/cfg
rm -r -f $installdir/data
rm -r -f $installdir/log

echo "==========================uninstall app success=============================="

3: 退回上一次目錄,在exporter-install目錄下編寫node-exporter安裝與解除安裝腳本:
腳本名: install-node-exporter.sh
#!/bin/bash

installdir="/data/monitor"
shpath=$0
toolsdir=${shpath%/*}
echo "$toolsdir"
cd $toolsdir
echo "======================unpackaging monitor exporter to $installdir/exporter ========================="
mkdir -p $installdir/exporter
mkdir -p $installdir/exporter/node_exporter
rm -r -f $installdir/exporter/node_exporter/*
mkdir -p $installdir/log
mkdir -p $installdir/bin

tar -zxvf node_exporter-0.17.0.linux-amd64.tar.gz -C $installdir/exporter/node_exporter
mv $installdir/exporter/node_exporter/node_exporter*/*   $installdir/exporter/node_exporter
rmdir $installdir/exporter/node-exporter/node_exporter-*


echo "==============================unpackage monitor exporter success  ============================="
4: exporter 解除安裝腳本:
腳本名: uninstall-node-exporter.sh
#!/bin/bash

installdir="/data/monitor"
shpath=$0
toolsdir=${shpath%/*}
echo "$toolsdir"
cd $toolsdir
echo "======================uninstall monitor exporter to $installdir/exporter ========================="

rm -r -f $installdir/exporter/node_exporter

echo "==============================unpackage monitor exporter success  ============================="
           

配置檔案

将安裝包中的配置檔案各複制一份放于data/monitor/cfg/下對應的目錄中, 保持原始配置檔案不變;

1: grafana配置檔案,這裡主要修改host,預設3000端口,這裡就先不修改了;需要修改的可以參考:
https://grafana.com/docs/installation/configuration/
2: prometheus 配置檔案,配置需要監控的任務: 參考:https://prometheus.io/docs/prometheus/latest/configuration/configuration/
核心配置代碼:
scrape_configs:
    - job_name: 'prometheus'
      static_configs:
          - targets: ['localhost:7002']
   # 監控伺服器的名額
    - job_name: 'node_server'
      static_configs:
          - targets: ['localhost:9100']
    # 通過consul 注冊中心擷取拉取位址
    - job_name: 'metrics'
      metrics_path: /monitor/actuator/prometheus
      consul_sd_configs:
          - server: localhost:7001
            tag: metrics
      relabel_configs:
          - source_labels: ["__meta_consul_service"]
            regex: "(.*)"
            replacement: $1
            action: replace
            target_label: "service"
3: consul 配置檔案: 這裡配置主要部分,對于key的描述,可以參考官網:
https://www.consul.io/docs/agent/options.html

{
  "bootstrap_expect" : 1,
  "data_dir" : "/data/monitor/data/consul",
  "pid_file" : "/data/monitor/data/consul/consul.pid",
  "node_name" : "agent-one",
  "bind_addr" : "127.0.0.1",
  "client_addr" : "0.0.0.0",
  "ports" : {
    "http" : 7001
  },
  "ui" : true
}
           

到此為止,配置檔案編寫完成;

一鍵啟動,停止腳本

1: grafana 啟動腳本
#!/bin/bash

monitorpath="/data/monitor"
cd ${monitorpath}/app/grafana
echo "start grafana at : `date` " >${monitorpath}/log/grafana_runtime.log
nohup ${monitorpath}/app/grafana/bin/grafana-server --config ${monitorpath}/cfg/grafana/custom.ini > ${monitorpath}/log/grafana.log 2>&1 &
echo $! > ${monitorpath}/log/grafana.pid
2:grafana 停止腳本
#!/bin/bash

echo "stop program at : `date` " > /data/monitor/log/grafana_runtime.log
if [ -f "/data/monitor/log/grafana.pid" ]; then
    kill -9 `cat /data/monitor/log/grafana.pid`
    rm /data/monitor/log/grafana.pid
else
    echo "file grafana.pid not exist"
fi

3: prometheus啟動腳本
#!/bin/bash
monitorpath="/data/monitor"
cd ${monitorpath}/app/prometheus

echo "start prometheus at : `date` " > ${monitorpath}/log/prometheus_runtime.log
nohup ${monitorpath}/app/prometheus/prometheus --web.listen-address=:7002 \
        --config.file=${monitorpath}/cfg/prometheus/prometheus.yml \
        --web.read-timeout=5m \
        --web.enable-admin-api \
        --web.max-connection=10 \
        --query.timeout=2m \
        --query.max-concurreny=20 \
        --storage.tsdb.path=${monitorpath}/data/prometheus/ > ${monitorpath}/log/prometheus.log 2>&1 &
echo $! > ${monitorpath}/log/prometheus.pid
4: prometheus停止腳本
#!/bin/bash
echo "stop prometheus at : `date` " > /data/monitor/log/prometheus_runtime.log
if [ -f "/data/monitor/log/prometheus.pid" ]; then
    kill -9 `cat /data/monitor/log/prometheus.pid`
    rm /data/monitor/log/prometheus.pid
else
    echo "file prometheus.pid not exist"
fi
5: node_exporter 啟動腳本
#!/bin/bash
monitorpath="/data/monitor"
cd ${monitorpath}/exporter/node_exporter
echo "start node_exporter at : `date` " > ${monitorpath}/log/node_exporter_runtime.log
nohup ${monitorpath}/exporter/node_exporter/node_exporter > ${monitorpath}/log/node_exporter.log 2>&1 &
echo $! > ${monitorpath}/log/node_exporter.pid
6: node_exporter 停止腳本
#!/bin/bash
echo "stop node_exporter at : `date` " > /data/monitor/log/node_exporter_runtime.log
if [ -f "/data/monitor/log/node_exporter.pid" ]; then
    kill -9 `cat /data/monitor/log/node_exporter.pid`
    rm /data/monitor/log/node_exporter.pid
else
    echo "file node_exporter.pid not exist"
fi
7: consul 啟動腳本
#!/bin/bash

monitorpath="/data/monitor"
cd ${monitorpath}/app/consul

echo "start consul at : `date` " > ${monitorpath}/log/consul_runtime.log
nohup ${monitorpath}/app/consul/consul agent -server -config-dir="${monitorpath}/cfg/consul" > ${monitorpath}/log/consul.log 2>&1 &
echo $! > ${monitorpath}/log/consul.pid
8: consul停止腳本
#!/bin/bash

echo "stop consul at: `date` " > /data/monitor/log/consul_runtime.log
if [ -f "/data/monitor/log/consul.pid" ]; then
    kill -9 `cat /data/monitor/log/consul.pid`
    rm /data/monitor/log/consul.pid
else
    echo "file consul.pid not exist"
fi
9: 一鍵啟動腳本
#!/bin/bash

cd `dirname $0`
# start node_exporter
echo "start node_exporter"
./start_node_exporter.sh

#start consul
echo "start consul"
./start_consul.sh

#start prometheus
echo "start prometheus"
./start_prometheus.sh

#start grafana
echo "start grafana"
./start_grafana.sh

echo "see logs at /data/monitor/log"

10: 一鍵停止
#!/bin/bash

cd `dirname $0`

# stop grafana
echo "stop grafana"
./stop_grafana.sh

# stop prometheus
echo "stop prometheus"
./stop_prometheus.sh

# stop consul
echo "stop consul"
./stop_consul.sh


# stop node_exporter
echo "stop node_exporter"
./stop_node_exporter.sh
echo "see logs at /data/monitor/log"
           

到此所有基礎腳本編寫完畢. 進入/data/monitor/bin目錄,一鍵啟動,可以檢查是否正常運作了,

ps -ef | grep consul ,

ps -ef | grep prometheus ,

ps -ef | grep grafana,

ps -ef | grep node_export,

也可以通路檢視資料, curl localhost:9100/metrics .

由于篇幅問題, 将緊接着的内容寫在下一篇;

GitHub: https://github.com/qinyunsurd/monitor