我常用的主機(jī)監(jiān)控Shell腳本
最近時不時有朋友問我關(guān)于服務(wù)器監(jiān)控方面的問題,問常用的服務(wù)器監(jiān)控除了用開源軟件,比如:cacti,nagios監(jiān)控外是否可以自己寫shell腳本呢?根據(jù)自己的需求寫出的shell腳本更能滿足需求,更能細(xì)化主機(jī)監(jiān)控的全面性。
下面是我常用的幾個主機(jī)監(jiān)控的腳本,大家可以根據(jù)自己的情況在進(jìn)行修改,希望能給大家一點(diǎn)幫助。
1、查看主機(jī)網(wǎng)卡流量
- #!/bin/bash
 - #!/bin/bash
 - #network
 - #Mike.Xu
 - while : ; do
 - time='date +%m"-"%d" "%k":"%M'
 - day='date +%m"-"%d'
 - rx_before='ifconfig eth0|sed -n "8"p|awk '{print $2}'|cut -c7-'
 - tx_before='ifconfig eth0|sed -n "8"p|awk '{print $6}'|cut -c7-'
 - sleep 2
 - rx_after='ifconfig eth0|sed -n "8"p|awk '{print $2}'|cut -c7-'
 - tx_after='ifconfig eth0|sed -n "8"p|awk '{print $6}'|cut -c7-'
 - rx_result=$[(rx_after-rx_before)/256]
 - tx_result=$[(tx_after-tx_before)/256]
 - echo "$time Now_In_Speed: "$rx_result"kbps Now_OUt_Speed: "$tx_result"kbps"
 - sleep 2
 - done
 - done
 
2、系統(tǒng)狀況監(jiān)控
- #!/bin/sh
 - #systemstat.sh
 - #Mike.Xu
 - IP=192.168.1.227
 - top -n 2| grep "Cpu" >>./temp/cpu.txt
 - free -m | grep "Mem" >> ./temp/mem.txt
 - df -k | grep "sda1" >> ./temp/drive_sda1.txt
 - #df -k | grep sda2 >> ./temp/drive_sda2.txt
 - df -k | grep "/mnt/storage_0" >> ./temp/mnt_storage_0.txt
 - df -k | grep "/mnt/storage_pic" >> ./temp/mnt_storage_pic.txt
 - time=`date +%m"."%d" "%k":"%M`
 - connect=`netstat -na | grep "219.238.148.30:80" | wc -l`
 - echo "$time $connect" >> ./temp/connect_count.txt
 
3、監(jiān)控主機(jī)的磁盤空間,當(dāng)使用空間超過90%就通過發(fā)mail來發(fā)警告
- #!/bin/bash
 - #monitor available disk space
 - SPACE='df | sed -n '/ \ / $ / p' | gawk '{print $5}' | sed 's/%//'
 - if [ $SPACE -ge 90 ]
 - then
 - fty89@163.com
 - fi
 
4、 監(jiān)控CPU和內(nèi)存的使用情況
- #!/bin/bash
 - #script to capture system statistics
 - OUTFILE=/home/xu/capstats.csv
 - DATE='date +%m/%d/%Y'
 - TIME='date +%k:%m:%s'
 - TIMEOUT='uptime'
 - VMOUT='vmstat 1 2'
 - USERS='echo $TIMEOUT | gawk '{print $4}' '
 - LOAD='echo $TIMEOUT | gawk '{print $9}' | sed "s/,//' '
 - FREE='echo $VMOUT | sed -n '/[0-9]/p' | sed -n '2p' | gawk '{print $4} ' '
 - IDLE='echo $VMOUT | sed -n '/[0-9]/p' | sed -n '2p' |gawk '{print $15}' '
 - echo "$DATE,$TIME,$USERS,$LOAD,$FREE,$IDLE" >> $OUTFILE
 
5、全方位監(jiān)控主機(jī)
- #!/bin/bash
 - # check_xu.sh
 - # 0 * * * * /home/check_xu.sh
 - DAT="`date +%Y%m%d`"
 - HOUR="`date +%H`"
 - DIR="/home/oslog/host_${DAT}/${HOUR}"
 - DELAY=60
 - COUNT=60
 - # whether the responsible directory exist
 - if ! test -d ${DIR}
 - then
 - /bin/mkdir -p ${DIR}
 - fi
 - # general check
 - export TERM=linux
 - /usr/bin/top -b -d ${DELAY} -n ${COUNT} > ${DIR}/top_${DAT}.log 2>&1 &
 - # cpu check
 - /usr/bin/sar -u ${DELAY} ${COUNT} > ${DIR}/cpu_${DAT}.log 2>&1 &
 - #/usr/bin/mpstat -P 0 ${DELAY} ${COUNT} > ${DIR}/cpu_0_${DAT}.log 2>&1 &
 - #/usr/bin/mpstat -P 1 ${DELAY} ${COUNT} > ${DIR}/cpu_1_${DAT}.log 2>&1 &
 - # memory check
 - /usr/bin/vmstat ${DELAY} ${COUNT} > ${DIR}/vmstat_${DAT}.log 2>&1 &
 - # I/O check
 - /usr/bin/iostat ${DELAY} ${COUNT} > ${DIR}/iostat_${DAT}.log 2>&1 &
 - # network check
 - /usr/bin/sar -n DEV ${DELAY} ${COUNT} > ${DIR}/net_${DAT}.log 2>&1 &
 - #/usr/bin/sar -n EDEV ${DELAY} ${COUNT} > ${DIR}/net_edev_${DAT}.log 2>&1 &
 
放在crontab里每小時自動執(zhí)行:
- 0 * * * * /home/check_xu.sh
 
這樣會在/home/oslog/host_yyyymmdd/hh目錄下生成各小時cpu、內(nèi)存、網(wǎng)絡(luò),IO的統(tǒng)計數(shù)據(jù)
如果某個時間段產(chǎn)生問題了,就可以去看對應(yīng)的日志信息,看看當(dāng)時的主機(jī)性能如何。















 
 
 








 
 
 
 