Job Summary The Cloud Operations Engineer is responsible for ensuring the operational integrity, stability, and reliability of cloud-based systems and applications hosted on AWS. This role involves managing, monitoring, and automating AWS infrastructure and services, implementing configuration management practices, leveraging Infrastructure as Code (IaC) to streamline operations. This role involves collaboration with stakeholders and adhering to security and compliance standards. Duties & Responsibilities AWS Infrastructure Management Design, install, configure, and maintain AWS cloud infrastructure, including server and serverless architectures. Architect and design improvements using new native solutions in AWS and/or alternative Cloud Environment. Good knowledge of core cloud services such as VMs, Containers, App Services, Virtual Networks, ASGs, Application Gateways, Load balancers and S3 accounts. Regularly evaluate cloud applications, designs, and best practices. Implement secure networking solutions such as VPC, subnets, routing and security groups. Manage security groups, IAM roles, and policies for secure access control. Automation and Optimization Create and maintain automated solutions for repetitive tasks, including infrastructure provisioning, monitoring, and patching. Optimize cloud resources for cost management and performance. Implement auto-scaling and elasticity solutions to ensure infrastructure reliability and efficiency. Configuration Management and Infrastructure as Code (IaC) Use Infrastructure as Code tools to provision, manage and version AWS resources. Implement and maintain configuration management tools to standardize and automate configurations. Monitoring and Alerting Configure and maintain monitoring and alerting systems to ensure the health and performance of the infrastructure. Application Deployment and Support Deploy, configure, and support applications and services on AWS infrastructure. Collaborate with DevOps teams to optimize and automate CI/CD pipelines. Change and Incident Management Prepare and document Change Requests and Methods of Procedures (MOPs) for infrastructure changes. Troubleshoot and resolve incidents affecting cloud services in accordance with predefined Service Level Agreements (SLAs). Escalate issues to internal teams or third-party vendors as required. Incident Response and Post-Incident Reporting Participate in incident response processes and post-incident reporting (PIR). Identify and deploy fixes to prevent recurrence of issues. Security and Compliance Work with IT Security to ensure cloud infrastructure aligns with security best practices and compliance requirements. Work with internal architecture, Solution Delivery(PMO), governance, and security teams to ensure that all security, governance and business continuity requirements and best practices are integrated and implemented. Implement encryption, backup, and disaster recovery solutions. Safety and Compliance Actively engage in the companys Safety Management System (SMS) by reporting hazards and incidents encountered during daily operations. Collaboration and Documentation Coordinate with internal stakeholders, DevOps, and third-party vendors for deployments and fixes. Work within a cross-functional team of System Ops Engineers, App Admins, Data Engineers DevOps and DBAs to specify, design, develop, test, and implement AWS cloud services and solutions. Provide guidance and knowledge to other team members, and promote efficiency, productivity, innovations, and knowledge-sharing across multi-functional teams. Work closely with internal business partners to gather requirements, design and implement solutions, manage technical operations, and triage and resolve operational issues. Maintain detailed and accurate system, application, and infrastructure documentation. On-Call Support Participate in an after-hours on-call rotation for critical incident resolution. Other Responsibilities Perform additional related duties as assigned by management. Behavioural Competencies Concern for Safety: Identifying hazardous or potentially hazardous situations and taking appropriate action to maintain a safe environment for self and others. Teamwork: Working collaboratively with others to achieve organizational goals. Passenger/Customer Service: Providing service excellence to internal and/or external customers (passengers). Initiative: Dealing with situations and issues proactively and persistently, seizing opportunities that arise. Results Focus: Focusing efforts on achieving high quality results consistent with the organizations standards. Fostering Communication: Listening and communicating openly, honestly, and respectfully with different audiences, promoting dialogue and building consensus. Qualifications Strong experience with AWS services such as EC2, S3, RDS, Lambda, CloudFormation, and VPC. 5+ years of experience in IT Infrastructure Operations with a minimum of 3 years of experience in AWS. Proficiency in automation tools (e.g., AWS CLI, Terraform, or CloudFormation). Familiarity with monitoring tools like CloudWatch, Dynatrace. Experience with scripting languages such as Python, PowerShell, or Bash. Experience with implementing, supporting and monitoring servers and applications in both Windows and Linux environments. Experience integrating applications and systems. Understanding of ITIL practices and change management processes. Strong problem-solving skills and ability to perform under pressure. Excellent communication and documentation skills. Experience with DevOps, CI/CD pipelines and Configuration Management is an asset. Willingness to work flexible hours, including after-hours and on-call rotations. Location Toronto Downtown Office (250 Yonge Street) #LI-Hybrid Company Description Since 2006, Porter Airlines has been elevating the experience of economy air travel for every passenger, providing genuine hospitality with style, care and charm. Porters fleet of Embraer E195-E2 and De Havilland Dash 8-400 aircraft serves a North American network from Eastern Canada. Headquartered in Toronto, Porter is an Official 4 Star Airline in the World Airline Star Rating. Visit or follow @porterairlines on Instagram, Facebook and Twitter. #J-18808-Ljbffr
Job Title
Cloud Operations Engineer