In this role, you will be responsible for the availability, latency and performance of our customer's platforms.
You will work proactively to identify areas for improvement, implementing automation, DevOps tooling and observability to reduce the impact of failure, provide scale and improved cost efficiency.
You will triage monitoring events and service desk requests, following a well defined incident response process and provide guidance and expertise to customers in a clear, calm and concise manner.
Finally, you will work under the guidance and mentoring of our Solutions Architects to undertake cloud infrastructure and transformation projects in-line with Well-Architected principals.
AWS, Lambda, Kubernetes, ECS, Terraform, Packer, Helm, Jenkins, Puppet, Grafana, Prometheus, ElasticSearch, Aurora, Kinesis, DynamoDB, ElastiCache, KMS, SSM, IAM.
Key skills / experience