Job Description: Cloud Engineer Business Group: Platform and Data Operations Overview: In collaboration with the Customer Insights Data and Analytics teams and IT partners, the Data and Analytics Technology team supports the client''s data and analytics needs through tooling, projects, and IT operational support. Role Summary: The Cloud Engineer will be responsible for designing, building, monitoring, tuning, and troubleshooting platform-wide initiatives supported by Data and Analytics Technology. This includes platform building, tooling, integration, process automation, platform enhancement, and delivery of new projects. Candidate Value Proposition: The successful candidate will have the opportunity to work on a highly visible project, opening up numerous opportunities within the organization. Typical Day in Role: Manage GCP Cloud Infrastructure: Design, build, and implement solutions using standard public cloud services like Google Cloud Storage, BigQuery, Dataproc, Vertex AI Notebooks, Cloud Run, and Cloud Functions, utilizing Terraform Modules. Release Control Management: Maintain and enhance the Release Control Management pipeline using Terraform, Cloud Build, GitHub Actions, and Bitbucket/GitHub repositories. Client Pipeline Management: Implement CI/CD processes by designing industry-standard Cloud Build for deploying infrastructure and analytics workloads using Terraform, Docker, Cloud Build, GitHub Actions, Artifact Registry, and other build/deployment activities. Credential Security: Set up HashiCorp Vault and Secret Manager for secret management. Integrate security solutions with client interfaces. Client User Acceptance Testing: Lead clients in user acceptance testing for component and base image upgrades, ensuring smooth transitions and minimal disruptions. Advanced Monitoring and Troubleshooting: Troubleshoot and resolve performance issues to ensure optimal system performance. Vendor and Technical Support Interaction: Regularly meet with product vendors and technical support to fine-tune and troubleshoot software components, ensuring the highest level of system performance and reliability. Client Support: Assist tenants with troubleshooting issues related to GCP and its services. Mentorship: Mentor junior engineers in best practices for building, deploying, testing, and supporting services, fostering a culture of continuous learning and improvement. Hadoop Cluster Management: Provision, configure, and manage Hadoop clusters within the Dataproc environment. Optimize Dataproc cluster configurations for performance, cost-efficiency, and stability. Candidate Requirements/Must-Have Skills: 5+ years of experience managing a public cloud platform for an enterprise on GCP with technical expertise in foundational and data services. 3+ years of experience using Infrastructure as Code tooling like Terraform to manage large-scale infrastructure platforms with strong knowledge of best practices for access control and least-privilege policy. 5+ years of experience in DevOps , building CI/CD pipelines using GitHub, Artifactory, etc., to reduce cycle times and ensure quality. 10+ years of IT experience in managing and developing applications or platforms ensuring scalability, reliability, and security. 2+ years of experience in public cloud-managed services for data and analytics, including data warehousing, data lakes, ETL services, machine learning, or data governance and security . 5+ years of experience in languages like Python, Go, or Java , and scripting skills (shell scripting, Python, Perl, Ansible) to automate tasks, create scripts, and develop infrastructure as code. Nice-to-Have Skills: Experience managing and administering Hadoop clusters (e.g., Cloudera, Dataproc, Hortonworks) with a strong understanding of the Hadoop ecosystem and its core components. Certification in GCP (GCP Cloud Associate) is desirable. Experience with Docker/Container, including setting up and managing Docker registries and creating Docker files to create custom images. Knowledge of overlay networking for inter-container communications from different nodes and external servers/infrastructure. Experience setting up Kubernetes or similar platforms on-premises/cloud (On-prem Rancher experience is a plus). Soft Skills Required: Authentic, decisive, resilient, highly collaborative, curious, and results-oriented. Superior writing, research, presentation, and project management skills. Strong problem solver who supports the team in navigating challenges and/or asks for help when needed. Superior writing, communication, interpersonal, and project management skills. Team player with effective communication skills (verbal and written). Able to see tasks through to completion without significant guidance. Self-managed and results-oriented with a sense of ownership. Education & Certificates: A university degree in Mathematics, Science, Engineering, Management, or a relevant field.
Job Title
Cloud Engineer (GCP)