On-site Full Time
YALLO Group -
Saudi , Jeddah
--
YALLO Group

Job Details

We are seeking an experienced Dev Ops Administrator to manage, operate, and optimize our containerized platforms built on Kubernetes, Rancher, and Longhorn. The role focuses on ensuring high availability, performance, security, and reliability of Kubernetes clusters across development, staging, and production environments.
The ideal candidate will have strong hands-on experience with Kubernetes operations, Rancher management, persistent storage using Longhorn, and Linux system administration, along with a solid understanding of Dev Ops best practices.
Key Responsibilities
Kubernetes Administration Install, configure, upgrade, and manage Kubernetes clusters (on-premise and/or cloud). Operate and maintain RKE / RKE2 / K3s based clusters. Troubleshoot cluster-level issues related to API server, etcd, scheduler, controller-manager, and kubelet. Perform cluster scaling, upgrades, backup, and disaster recovery. Monitor cluster health, performance, and resource utilization. Rancher Management Deploy, configure, and maintain Rancher (HA and single-node setups). Manage downstream clusters via Rancher. Handle RBAC, authentication (AD/LDAP/OIDC), and multi-tenant access. Troubleshoot Rancher UI, API, and cluster connectivity issues. Perform Rancher upgrades and version compatibility assessments. Longhorn Storage Administration Deploy and manage Longhorn as a persistent storage solution. Monitor volume health, replicas, snapshots, and backups. Troubleshoot storage latency, degraded volumes, and replica rebuilds. Implement storage best practices and performance tuning. Plan and test backup and restore strategies for persistent workloads. CI/CD & Dev Ops Practices Integrate Kubernetes with CI/CD pipelines (Jenkins, Git Lab CI, Git Hub Actions, etc.). Support container image build, scan, and deployment workflows. Manage Helm charts and Kubernetes manifests. Implement Git Ops practices (Argo CD / Flux is a plus). Monitoring, Logging & Security Implement and maintain monitoring and alerting (Prometheus, Grafana). Manage centralized logging solutions (ELK / EFK). Apply Kubernetes security best practices, including:Network policies Pod Security Standards Secrets management Perform vulnerability assessments and patching.

Similar Jobs

About YALLO Group
Saudi, Jeddah
Information Technology and Services