文章目录
- 安装
- 操作Flume测试监控
Ganglia : 发音 干个你儿
安装
- 安装httpd服务与php
yum -y install httpd php
- 安装其他依赖
yum -y install rrdtool perl-rrdtool rrdtool-devel
yum -y install apr-devel
- 安装ganglia
rpm -Uvh http://dl.fedoraproject.org/pub/epel/6/x86_64/epel-release-6-8.noarch.rpm
yum -y install ganglia-gmetad
出现报错
Error: Cannot retrieve metalink for repository: epel. Please verify its path and try again
重新执行
yum -y install ganglia-gmetad
yum -y install ganglia-web
yum install -y ganglia-gmond
Ganglia由gmond、gmetad和gweb三部分组成。
gmond(Ganglia Monitoring Daemon)
是一种轻量级服务,安装在每台需要收集指标数据的节点主机上。使用gmond,你可以很容易收集很多系统指标数据,如CPU、内存、磁盘、网络和活跃进程的数据等。
gmetad(Ganglia Meta Daemon)
整合所有信息,并将其以RRD格式存储至磁盘的服务。
gweb(Ganglia Web)
Ganglia可视化工具,gweb是一种利用浏览器显示gmetad所存储数据的PHP前端。在Web界面中以图表方式展现集群的运行状态下收集的多种不同指标数据。
- 修改配置文件/etc/httpd/conf.d/ganglia.conf
vim /etc/httpd/conf.d/ganglia.conf
注释掉其他的,增加Allow from all
#
# Ganglia monitoring system php web frontend
#
Alias /ganglia /usr/share/ganglia
<Location /ganglia>
Order deny,allow
# Deny from all
# Allow from 127.0.0.1
Allow from all
# Allow from ::1
# Allow from .example.com
</Location>
- 修改配置文件/etc/ganglia/gmetad.conf
vim /etc/ganglia/gmetad.conf
修改为:
data_source "note01" 192.168.18.100
- 修改配置文件/etc/ganglia/gmond.conf
vim /etc/ganglia/gmond.conf
修改为:
name 修改自己域名,注释 # mcast_join udp_send_channel 增加host,udp_recv_channel 增加bind
cluster {
name = "note01"
owner = "unspecified"
latlong = "unspecified"
url = "unspecified"
}
/* The host section describes attributes of the host, like the location */
host {
location = "unspecified"
}
/* Feel free to specify as many udp_send_channels as you like. Gmond
used to only support having a single channel */
udp_send_channel {
#bind_hostname = yes # Highly recommended, soon to be default.
# This option tells gmond to use a source address
# that resolves to the machine's hostname. Without
# this, the metrics may appear to come from any
# interface and the DNS names associated with
# those IPs will be used to create the RRDs.
# mcast_join = 239.2.11.71
host = 192.168.18.100
port = 8649
ttl = 1
}
/* You can specify as many udp_recv_channels as you like as well. */
udp_recv_channel {
# mcast_join = 239.2.11.71
bind = 192.168.18.100
port = 8649
bind = 239.2.11.71
retry_bind = true
# Size of the UDP buffer. If you are handling lots of metrics you really
# should bump it up to e.g. 10MB or even higher.
# buffer = 10485760
}
- 修改配置文件/etc/selinux/config
vim /etc/selinux/config
修改
SELINUX=disabled
尖叫提示:selinux本次生效关闭必须重启,如果此时不想重启,可以临时生效之:
setenforce 0
- 启动ganglia
service httpd start
service gmetad start
service gmond start
访问 http://192.168.18.100/ganglia/
尖叫提示:如果完成以上操作依然出现权限不足错误,请修改/var/lib/ganglia目录的权限:
chmod -R 777 /var/lib/ganglia
操作Flume测试监控
- 修改/opt/module/flume/conf目录下的flume-env.sh配置:
JAVA_OPTS="-Dflume.monitoring.type=ganglia
-Dflume.monitoring.hosts=192.168.1.102:8649
-Xms100m
-Xmx200m"
- 启动Flume任务
bin/flume-ng agent \
> --conf conf/ \
> --name a1 \
> --conf-file job/flume-netcat-logger.conf \
> -Dflume.root.logger==INFO,console \
> -Dflume.monitoring.type=ganglia \
> -Dflume.monitoring.hosts=192.168.18.100:8649
- 发送数据观察ganglia监测图
图例说明:
字段(图表名称) | 字段含义 |
EventPutAttemptCount | source尝试写入channel的事件总数量 |
EventPutSuccessCount | 成功写入channel且提交的事件总数量 |
EventTakeAttemptCount | sink尝试从channel拉取事件的总数量。这不意味着每次事件都被返回,因为sink拉取的时候channel可能没有任何数据。 |
EventTakeSuccessCount | sink成功读取的事件的总数量 |
StartTime | channel启动的时间(毫秒) |
StopTime | channel停止的时间(毫秒) |
ChannelSize | 目前channel中事件的总数量 |
ChannelFillPercentage | channel占用百分比 |
ChannelCapacity | channel的容量 |