Work location: Ho Chi Minh
Salary:
Industry: IT - Software
Deadline to apply:
Level: Experienced (Non - Manager)
Experience:
Key Responsibilities
· Manage, operate, and optimize CT Group’s hybrid infrastructure, including On-Premise systems, FPT Cloud, and Azure Cloud.
· Administer and maintain Kubernetes (K8s) clusters, including Helm, Ingress, and Service Mesh (Istio, Linkerd).
· Design, implement, and maintain CI/CD pipelines, Infrastructure-as-Code (IaC), and automation solutions using GitLab CI, Jenkins, ArgoCD, Terraform, and Ansible.
· Operate and optimize ERP systems (Odoo, SAP, Dynamics…) and web applications (PHP, Python, Node.js, .NET), ensuring performance, scalability, and security.
· Design and maintain system architecture for high availability, scalability, disaster recovery, and hybrid/multi-cloud operations.
· Implement system observability: monitoring, logging, alerting, and tracing (Prometheus, Grafana, ELK, Loki, Alertmanager).
· Ensure infrastructure and application security: IAM, RBAC, TLS, PKI, WAF, secrets management, web and API security.
· Collaborate with Data, Infra, and Security teams to integrate infrastructure with data platforms and internal applications.
· Support DevOps, SRE, and FinOps practices to optimize system reliability and cloud operating costs.
· Troubleshoot incidents, perform root cause analysis, and execute performance tuning for mission-critical systems.
Required Qualifications and Skills
Education & Experience
· Bachelor’s degree in Computer Science, Information Systems, or related field.
· 5–8 years of experience as a DevOps Engineer, System Engineer, or equivalent role.
· Strong foundation in DevOps, System Engineering, Networking, Security, Cloud, and Kubernetes.
· Proven track record in system design, cloud architecture, and infrastructure automation.
Technical Skills
· Kubernetes: cluster management, Helm, Ingress, Istio/Linkerd.
· Cloud Platforms: Azure, FPT Cloud, multi-cloud, hybrid cloud.
· On-Premise Systems: VMware, Proxmox, Bare Metal servers.
· CI/CD & IaC: GitLab CI, Jenkins, ArgoCD, Terraform, Ansible.
· Monitoring & Observability: Prometheus, Grafana, ELK, Loki, Alertmanager.
· Scripting & Automation: Bash, Python, Go.
· Networking: TCP/IP, VPN, Load Balancers, Firewalls, NAT, DNS.
· Security: IAM, RBAC, TLS, PKI, WAF, audit logging, secrets management, API/web app security.
· System & Cloud Architecture: High availability, disaster recovery, cost optimization, multi-region.
Preferred Qualifications
· Experience operating large-scale data systems (Big Data, Kafka, Airflow, Data Warehouses).
· Hands-on experience with AI/ML platforms and MLOps pipelines.
· Relevant certifications:
o Kubernetes: CKA, CKAD
o Cloud: Azure Administrator, Azure DevOps Engineer
o Infrastructure: Terraform Associate
o Security: Network Security, Cloud Security
Soft Skills
· Strong analytical and troubleshooting skills, with the ability to diagnose and resolve complex infrastructure issues.
· Proactive, system-oriented thinking with a focus on scalability, cost efficiency, and reliability.
· Excellent communication and collaboration skills to work across teams (Data, Infra, Security, Applications).
· Ability to manage multiple priorities and meet deadlines in a fast-paced, multi-project environment.
· Strong sense of responsibility for system quality and performance.
· Leadership mindset with the ability to mentor and guide junior engineers.
Benefits
· Competitive salary based on skills and experience.
· Performance-based bonuses and rewards for outstanding contributions.
· Opportunities to work on large-scale, diverse projects developed by the company.
· Professional working environment with a transparent and respectful culture.
· Access to training programs, certifications, and career development opportunities.
· Comprehensive health insurance and other employee benefits as per company policy
https://www.ctgroupvietnam.com/ Number of employees: 1.000-4.999