Go to file
Trey Dockendorf 47d48d1f85 Release v0.2.1 2020-03-18 09:22:48 -04:00
.circleci Add codecov 2020-02-20 14:29:26 -05:00
test Add torque cgroup files for testing 2020-02-13 10:53:51 -05:00
.editorconfig Initial commit 2020-02-12 11:07:45 -05:00
.gitignore Add codecov 2020-02-20 14:29:26 -05:00
.promu.yml Remove broken crossbuilds 2020-02-12 12:22:37 -05:00
CHANGELOG.md Release v0.2.1 2020-03-18 09:22:48 -04:00
Dockerfile Fix Dockerfile 2020-03-18 08:50:44 -04:00
LICENSE Misc fixes 2020-02-12 12:26:18 -05:00
Makefile s390x is not supported and not building with CGO enabled 2020-03-18 09:09:43 -04:00
Makefile.common Initial commit 2020-02-12 11:07:45 -05:00
README.md Replace swap metrics with memsw to describe the raw values 2020-02-27 11:03:20 -05:00
VERSION Release v0.2.1 2020-03-18 09:22:48 -04:00
cgroup_exporter.go Replace swap metrics with memsw to describe the raw values 2020-02-27 11:03:20 -05:00
cgroup_exporter_test.go Replace swap metrics with memsw to describe the raw values 2020-02-27 11:03:20 -05:00
codecov.yml Release v0.2.1 2020-03-18 09:22:48 -04:00
go.mod Initial commit 2020-02-12 11:07:45 -05:00
go.sum Support getting user slice information 2020-02-12 12:41:36 -05:00

README.md

cgroup Prometheus exporter

Build Status GitHub release GitHub All Releases codecov

Check mount Prometheus exporter

The cgroup_exporter produces metrics from cgroups.

This exporter by default listens on port 9306 and all metrics are exposed via the /metrics endpoint.

Usage

The --config.paths flag is required and must point to paths of cgroups to monitor. If there is /sys/fs/cgroup/cpuacct/user.slice then the value for --config.paths would be /user.slice.

Docker

Example of running the Docker container

docker run -d -p 9306:9306 -v "/:/host:ro,rslave" treydock/cgroup_exporter --path.cgroup.root=/host/sys/fs/cgroup

Install

Download the latest release

Build from source

To produce the cgroup_exporter binaries:

make build

Or

go get github.com/treydock/cgroup_exporter

Metrics

Example of metrics exposed by this exporter when looking at /user.slice paths:

cgroup_cpu_system_seconds{cgroup="/user.slice/user-20821.slice"} 1.96
cgroup_cpu_total_seconds{cgroup="/user.slice/user-20821.slice"} 3.817500568
cgroup_cpu_user_seconds{cgroup="/user.slice/user-20821.slice"} 1.61
cgroup_cpus{cgroup="/user.slice/user-20821.slice"} 0
cgroup_info{cgroup="/user.slice/user-20821.slice",uid="20821",username="tdockendorf",jobid=""} 1
cgroup_memory_fail_count{cgroup="/user.slice/user-20821.slice"} 0
cgroup_memory_total_bytes{cgroup="/user.slice/user-20821.slice"} 6.8719476736e+10
cgroup_memory_used_bytes{cgroup="/user.slice/user-20821.slice"} 6.90176e+06
cgroup_memsw_fail_count{cgroup="/user.slice/user-20821.slice"} 0
cgroup_memsw_total_bytes{cgroup="/user.slice/user-20821.slice"} 9.223371968135295e+18
cgroup_memsw_used_bytes{cgroup="/user.slice/user-20821.slice"} 0

Example of metrics exposed by this exporter when looking at /slurm paths:

cgroup_cpu_system_seconds{cgroup="/slurm/uid_20821/job_12"} 0
cgroup_cpu_total_seconds{cgroup="/slurm/uid_20821/job_12"} 0.007840451
cgroup_cpu_user_seconds{cgroup="/slurm/uid_20821/job_12"} 0
cgroup_cpus{cgroup="/slurm/uid_20821/job_12"} 2
cgroup_info{cgroup="/slurm/uid_20821/job_12",jobid="12",uid="20821",username="tdockendorf"} 1
cgroup_memory_fail_count{cgroup="/slurm/uid_20821/job_12"} 0
cgroup_memory_total_bytes{cgroup="/slurm/uid_20821/job_12"} 2.147483648e+09
cgroup_memory_used_bytes{cgroup="/slurm/uid_20821/job_12"} 315392
cgroup_memsw_fail_count{cgroup="/slurm/uid_20821/job_12"} 0
cgroup_memsw_total_bytes{cgroup="/slurm/uid_20821/job_12"} 2.147483648e+09
cgroup_memsw_used_bytes{cgroup="/slurm/uid_20821/job_12"} 315392

Example of metrics exposed by this exporter when looking at /torque paths:

cgroup_cpu_system_seconds{cgroup="/torque/1182958.batch.example.com"} 26.35
cgroup_cpu_total_seconds{cgroup="/torque/1182958.batch.example.com"} 939.568245515
cgroup_cpu_user_seconds{cgroup="/torque/1182958.batch.example.com"} 915.61
cgroup_cpus{cgroup="/torque/1182958.batch.example.com"} 40
cgroup_info{cgroup="/torque/1182958.batch.example.com",jobid="1182958",uid="",username=""} 1
cgroup_memory_fail_count{cgroup="/torque/1182958.batch.example.com"} 0
cgroup_memory_total_bytes{cgroup="/torque/1182958.batch.example.com"} 1.96755132416e+11
cgroup_memory_used_bytes{cgroup="/torque/1182958.batch.example.com"} 5.3434466304e+10
cgroup_memsw_fail_count{cgroup="/torque/1182958.batch.example.com"} 0
cgroup_memsw_total_bytes{cgroup="/torque/1182958.batch.example.com"} 1.96755132416e+11
cgroup_memsw_used_bytes{cgroup="/torque/1182958.batch.example.com"} 5.3434466304e+10