Skip to Main Content

Job Title


Site Reliability Engineer


Company : Kindred Group plc


Location : Sydney, Australia


Created : 2025-04-05


Job Type : Full Time


Job Description

As a Site Reliability Engineer, you''ll be at the heart of ensuring the resilience and observability of our Sports Book Platform. This isn''t just about keeping the lights on; it''s about building systems and solutions that thrive under extreme pressure. You''ll collaborate with feature teams to extract rich telemetry and structured event data from their code, architect and build highly distributed observability solutions to handle massive data ingestion and develop tools that turn complex signals into actionable insights for technical and business stakeholders alike.Your mission: to ensure our systems stay fast, scalable, and reliableeven when millions of bets are placed simultaneously during major international sporting events. Whether it''s a local derby in Europe, a championship game in North America, or a global tournament, your work will be critical to delivering seamless betting experiences across continents. If you love solving complex and novel problems, architecting scalable and distributed systems, and being a key player in a high-stakes environment, we''d love to hear from you.Responsibilities:Architect, build, and maintain large-scale, distributed telemetry pipelines and observability platforms that provide real-time insight into system performance and reliability.Design innovative solutions for telemetry challenges in our uniquely asynchronous and distributed ecosystem, ensuring high visibility across services.Act as a subject matter expert, collaborating with development teams to optimise instrumentation, observability tooling, and reliability strategies.Drive capacity planning and proactive performance optimisation, always pushing the envelope to anticipate and meet evolving business and customer needs.Partner with teams to define and refine four golden signals, service levels, and error budgets, ensuring we measure and improve critical user journeys and business impact.Take ownership in high-stakes production incidents, leading deep-dive investigations and implementing long-term solutions to prevent future disruptions.Develop, refine, and automate reliability-focused tooling, reducing toil and increasing engineering efficiency across the platform.Skills and Experience:Deep expertise in site reliability engineering concepts and practices.Advanced knowledge of observability and telemetry data principles, with hands-on experience in designing and implementing solutions at scale.Experience with Linux system administration and fundamentals.Solid understanding of network fundamentals, with an emphasis on Layer 7 protocols such as gRPC, DNS, and TLS.Extensive experience with IaaS platforms, both cloud and on-prem.Strong experience with containerisation principles, tooling and orchestration.Proficiency in one or more of the following: Go, Python, C#, NodeJS or similar programming languages.Strong grasp of CI/CD automation and Infrastructure as Code (IaC) principles.Bonus Skills and Experience:Experience in fintech style operations.Experience working in large scale, low latency asynchronous systems.Hands-on experience with the Opentelemetry ecosystem.A keen interest in new technologies and industry trends in the SRE and observability space.Strong analytical and troubleshooting skills, with a systematic approach to problem-solving.Excellent verbal and written communication skills, with the ability to document systems, processes, and troubleshooting steps clearly.Benefits:We are in a fantastic new office near Barangaroo, close to Wynyard station.Our office has a sports hub, if you want to challenge a mate to a game of table tennis or darts.Fancy a good cup of coffee? We have an in-house barista to get you that perfect cup!Many social events to take part in (Melbourne Cup is just one of them).Great work life balance and flexibility.A continued commitment to employee development.Life insurance and income protection plans.Wellness benefits. #J-18808-Ljbffr