Insight Global is looking for an Operations Site Reliability Engineer to help with global operational support for a leading infrastructure software product companys customer-facing Saas products. You will be part of a team of engineers that demonstrates superb technical competency, operates mission-critical infrastructure and ensures the highest levels of availability (24x7x365), performance and security. This SRE would be part of the critical operations function that is responsible for the monitoring, availability and performance of production services. They would be driving automation to reduce failures, manual tasks and therefore improving overall application performance and availability. As well as responding to stakeholder requests within agreed timescales or SLO, they will also be supporting maintenance activities, critical systems, and the planning of releases related to production applications. This is an opportunity to join an organization expanding dramatically, whilst also offering a highly competitive salary, bonus and equity package. Must haves: A degree in Systems Engineering, Computer Science or related fields Professional experience working in a large cloud operations setting Experience administering Linux systems Strong hands-on experience of variants of Linux distros Operational experience of working with Amazon Web Services or Google Cloud Platform Experience of working with an automation platform to automate repetitive actions that reduce manual effort Experienced and confident in at least one scripting language such as Perl, shell, Ruby, BASH or Python Familiarity with deployment tools such as Ansible Tower and Jenkins Experience in carrying out large deployments to global infrastructure Experience of system/application administration in a distributed, customer-facing, high-availability and large-scale environments
Job Title
Site Reliability Engineer