Working knowledge with at least one of the following: Nagios, ELK (ElasticSearch, Logstash, Kibana), Graphite, Grafana, collectd, statsd, PagerDuty integration
Handson experience with supporting/troubleshooting the following a plus: OpenStack, Ceph, Nginx, MySQL, RabbitMQ, Apache, HAProxy, Cisco UCS
Working knowledge with platform automation tools (preferably Ansible, but Puppet also helpful)
Working knowledge with Git and other code repository tools a plus
Working knowledge with languages/tools like: Python, Shell scripting, Ruby, cron
Strong Linux system administration experience
Very familiar with OpenSource tools (supporting, compiling, configuring, modifying, etc.)
Experience with configuration and operation of operations support systems, including service assurance, capacity management, inventory, configuration management, etc.
Experience with obtaining, processing and analyzing data to generate useful metrics, alerts, etc.
Experience supporting large server environments desired
Ability to set architectural and tactical direction based on strategic needs
Ability to create custom solutions to monitoring/metrics/logging needs for distributed systems Knowledge of current industry best practices