Skip to Main Content

Job Title


Site Reliability Engineer


Company : TELUS Digital AI Data Solutions


Location : Bengaluru, Karnataka


Created : 2025-04-04


Job Type : Full Time


Job Description

Job Description TELUS Digital (TD) Experience partners with the world’s most innovative brands, from tech startups to industry leaders in fintech, gaming, healthcare, and more. We empower businesses to scale and redefine possibilities with integrated customer experience and cutting-edge digital solutions. Backed by TELUS, our multi-billion-dollar parent company, we offer scalable, multi-language, and multi-shore capabilities. Our expertise spans digital transformation, AI-driven consulting, IT lifecycle management, and more – delivered with secure infrastructure, value-driven pricing, and exceptional service. AI Data Solutions: Shaping the Future of AI For nearly two decades, Telus Digital AI Data Solutions has been a global leader in providing premium data services for the ever-evolving AI ecosystem. From machine learning to computer vision and Generative AI (GenAI), we empower the next generation of AI-powered experiences with high-quality data and human intelligence to test, train and improve AI models. Backed by a community of over one million contributors and proprietary AI-driven tools, we deliver solutions designed to cover the training data needs of every project. From custom data collection to advanced data annotation and fine-tuning, our purpose-built tools deliver multimodal data for AI training projects of any complexity – from experimental pilots to ambitious large-scale programs. Examples include empowering GenAI models with human-aligned datasets and fine-tuning data across 20+ domains and 100+ languages, enabling autonomous driving and advancing extended reality applications with industry-leading data labelling. Join us to be part of an innovative team shaping the future of AI and driving digital transformation to new heights! More : About the role We are seeking a skilled Site Reliability Engineer (SRE) to join our team and ensure the reliability, performance, and scalability of our production systems. As an SRE, you will bridge the gap between development and operations by applying a software engineering mindset to system administration challenges. You’ll work closely with cross-functional teams to design, build, and maintain highly available systems while driving automation to minimise manual toil. Key Responsibilities Design, implement, and maintain scalable, reliable, and fault-tolerant systems. Monitor system performance, availability, and reliability using tools like Prometheus, Grafana, New Relic, Datadog & ELK Stack. Automate repetitive operational tasks using scripting languages (e.g., Python, Bash) and tools like Ansible, Terraform, or Kubernetes. Collaborate with development teams to improve system design, deployment pipelines, and operational processes. Respond to incidents, perform root cause analysis, and implement preventive measures to reduce future downtime. Define and track Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs). Participate in on-call rotations to ensure 24/7 system reliability and rapid incident response. Optimise infrastructure costs while maintaining performance and reliability standards. Document processes, runbooks, and system architecture to ensure knowledge sharing across teams. Develop an in-depth understanding of the product, architecture, and supporting technologies Required Skills & Experience DevOps Expertise - 3+ years of experience in DevOps, SRE, or Cloud Engineering roles. Programming Skills – Proficiency in scripting & automation using Python and Shell. Cloud & Infrastructure – Experience with AWS, Azure, or GCP and infrastructure-as-code (IaC) tools like Terraform or Ansible. Containerisation & Orchestration – Expertise in Docker & Kubernetes for scalable deployments. Networking & Security – Good understanding of firewalls, VPNs, and IAM roles for secure infrastructure. Linux Mastery – Strong knowledge of Linux administration, troubleshooting, and shell scripting. Flexibility – Participate in on-call rotations over the weekends. Equal Opportunity Employer At TELUS Digital, we are proud to be an equal-opportunity employer and are committed to creating a diverse and inclusive workplace. All aspects of employment, including the decision to hire and promote, are based on applicants’ qualifications, merits, competence and performance without regard to any characteristic related to diversity.