Job Description
- Ensure the availability of infrastructure and platform of the internal company and also each of the products released in a multi-company and hybrid (cloud and on-prem) environment.
- Work as an integrated part of the software engineering organization, understanding the application architecture.
- Orchestrate the provisioning, load balancing, dynamic configuration, monitoring, and resource optimization of servers across cloud providers, data centers, and availability zones.
- Manage development of internal engineering productivity tools and be responsible for development and operation of continuous integration and deployment pipeline.
- Implement and maintain full operational compliance against various security and compliance requirements.
- Consistently improve performance and reliability as the platform scales, driving continuous improvement through operational metrics.
- Familiarity with Cloud Platform (AWS, GCP, Aliyun, Azure, etc)
- Familiar with Infrastructure as Code Provisioning tool (Terraform, Ansible, Chef, Puppet, etc)
- Strong container-orchestration system background (K8s, EKS, GKE, etc)
- Experience in handling service mesh and its challenges (such as but not limited to : service discovery, load balancing, failure recovery)
- Experienced in Continuous Integration and Deployment process (Jenkins, GitLab CI, Github Actions, etc)
- Good understanding of system observability concept (such as but not limited to : metrics, monitoring, alerting, logging, tracing, etc)
byOrange
