IN EmploymentAlert | Coffeee.io | ML Ops Engineer
Skip to Main Content

Job Title


Coffeee.io | ML Ops Engineer


Company : Coffeee.io


Location : Varanasi, Uttar pradesh


Created : 2025-01-07


Job Type : Full Time


Job Description

Job Title: AWS MLOps Engineer Experience: 8+ years of experience as an AWS Engineer, DevOps Engineer, or Cloud Engineer, with a focus on CI/CD pipelines and machine learning solutions.Important Note: Assessment is a mandatory criteria for shortlist :To complete the assessment, simply copy and paste the link into your browser, log in, and open the assessment.Kindly ensure that you attempt it before the deadline, which is 15th December.Assessment Link: Summary: Coffeee.io is looking for an experienced AWS Engineer to join our IT Department to manage and optimize the Continuous Integration/Continuous Deployment (CI/CD) pipeline for machine learning (ML) solutions. The successful candidate will work closely with Model Development team (ML Engineers), data scientists, and DevOps teams to ensure smooth deployment, scaling, and monitoring of ML models on AWS. This role requires a deep understanding of AWS cloud services, DevOps practices, and machine learning infrastructure. Key Responsibilities: 1. CI/CD Pipeline Management & Automation: 1. Design, implement, and maintain robust CI/CD pipelines for deploying machine learning models and solutions. 2. Automate and streamline deployment processes using AWS services such as Code Pipeline, Code Build, Code Deploy, and Code Commit. 3. Ensure seamless integration of model training, testing, and deployment stages within the CI/CD pipeline. 4. Set up and manage infrastructure as code (IaC) using tools like AWS Cloud Formation or Terraform for creating scalable and reliable environments for ML applications. 5. Automate deployment, scaling, and monitoring of machine learning models in AWS environments using AWS Lambda, ECS, EKS, and Sage Maker. 2. AWS Cloud Services Management & Security: a. Manage and configure AWS cloud services such as EC2, S3, Sage Maker, Lambda, and others to support machine learning pipelines and production environments. b. Use AWS Sage Maker for managing the ML lifecycle, including data preparation, training, tuning, and model deployment. c. Set up automated workflows for model retraining and versioning based on new data inputs and performance metrics. d. Ensure compliance with industry standards and internal policies regarding data privacy, security, and governance for machine learning solutions. e. Implement best practices in DevOps, including version control, code quality checks, and deployment automation using AWS services. f. Continuously improve infrastructure by staying up-to-date with new AWS features, best practices, and emerging technologies. 3. Monitoring & Optimization: a. Monitor the performance of deployed ML models and pipelines using AWS Cloud Watch, CloudTrail, and other monitoring tools. b. Implement automated testing, validation, and monitoring processes to ensure models perform as expected in production environments. c. Optimize costs and performance by automating resource scaling, ensuring high-availability, and improving pipeline efficiency.4. Collaboration & Support: a. Collaborate with data scientists, machine learning engineers, and DevOps teams integrate ML models into production systems. b. Provide support and troubleshooting expertise for pipeline issues, including model failures, deployment bottlenecks, and scaling problems. c. Work closely with security teams to implement best practices for security and compliance, ensuring that data and models are protected within AWS. Key Skills & Qualifications: a. Education: Bachelor’s degree in Computer Science, Information Technology, or a related field. b. Experience: 8+ years of experience as an AWS Engineer, DevOps Engineer, or Cloud Engineer, with a focus on CI/CD pipelines and machine learning solutions. c. Strong expertise in AWS cloud services (S3, EC2, Sage Maker, Lambda, Code Pipeline, etc.).d. Experience with CI/CD tools like AWS Code Pipeline, GitLab CI, or similar platforms. e. Proficiency with containerization tools such as Docker and Kubernetes for managing microservices architecture. f. Knowledge of infrastructure as code (IaC) tools such as AWS CloudFormation, Terraform etc. g. Familiarity with machine learning frameworks (e.g., TensorFlow, PyTorch) and deployment in production environments. h. Experience in setting up and managing CI/CD pipelines for machine learning or data science solutions. i. Familiarity with version control systems like Git and deployment automation practices. j. Strong knowledge of monitoring and logging tools (e.g., AWS CloudWatch) for real-time performance tracking. k. Ability to work collaboratively with cross-functional teams, including data scientists, ML engineers, and DevOps teams. l. Strong verbal and written communication skills for documentation and knowledge sharing.