Senior Site Reliability Engineer - AWS
Deutsche Telekom IT Solutions Zobraziť všetky práce
- Košice, Košický kraj
- Trvalý pracovný pomer
- Plný úväzok
- Collaborate with architects and application engineers to ensure applications are maintainable, scalable, and follow appropriate disaster recovery
- Develop and automate standard operating procedures around common failure scenarios and manual operating tasks
- Work in scrum teams alongside architects and product managers to support and deploy new infrastructures and operational requirements
- Design, manage and maintain tools to automate manual operational processes
- Build and maintain production systems on AWS using ALB, ELB, WAF's, S3, Serverless, API's, Route53, etc. with the use of IaC (Terraform and
- Leverage deep expertise to plan and lead the deployment of cloud solutions into production environments with the use of CI/CD pipelines
- Create practical demonstrations of proposed solutions and demonstrate them to other members of the team
- Contributing to the development of best practices for Infrastructure as Code, software build tools, and Continuous Integration
- Work and collaborate with multi-national teams in an international environment.
- Have strong hands-on experience with AWS cloud services, especially services such as ALB/ELB, WAF, S3, Route53, serverless components, and API integrations.
- Have solid experience designing, deploying, and operating cloud-native infrastructure in production environments.
- Have practical experience with Infrastructure as Code, particularly using Terraform and/or CloudFormation.
- Have experience building and maintaining CI/CD pipelines for automated infrastructure and application deployments.
- Have strong automation and scripting skills (e.g., Python, Bash, or similar) to eliminate manual operational work.
- Have experience designing high availability, scalability, and disaster recovery architectures in cloud environments.
- Have strong troubleshooting and incident management skills, with the ability to quickly identify root causes and minimize service impact.
- Have experience creating and maintaining runbooks, operational documentation, and architecture documentation.
- Have worked in Agile/Scrum teams, collaborating closely with architects, engineers, and product managers.
- Have experience implementing and promoting DevOps best practices, including infrastructure automation, monitoring, and continuous improvement.
- Have experience building and maintaining operational tooling and automation frameworks to support production systems.
- Have strong communication and collaboration skills, enabling effective work across distributed and international teams.
- Are proactive, solution-oriented, and comfortable demonstrating and explaining technical solutions to peers and stakeholders.
- Financial benefits
- Benefits with focus on learning and development
- Benefits with focus on health and sport
- Benefits with focus on family and work - life balance
- Other benefits
- Please be informed that our remote working possibility is only available within Slovakia due to European taxation regulation.