Senior Engineer (Kubernetes Orchestrator) for NATO with security clearance
Would you like to join the leading international intergovernmental organization?
The Senior Engineer (Kubernetes Orchestrator) will be responsible for the design, implementation, and continuous evolution of scalable, secure, and resilient Kubernetes platforms using SUSE Rancher as the central management plane. The role supports key ITM and project initiatives (CUR3580, Artemis P3, AI Felix, SANDI 2, JICO), addressing capacity constraints and enabling the onboarding of new projects.
Responsibilities:
Design, build, and evolve highly scalable, secure, and resilient Kubernetes platforms utilizing SUSE Rancher as the central management plane.
Handle the end-to-end lifecycle of RKE/RKE2 clusters (provisioning, scaling, patching and upgrading) across hybrid, multi-cloud or bare-metal environments with minimal to zero downtime.
Develop and maintain automated CI/CD pipelines and implement GitOps workflows (using tools like Gitlab, ArgoCD or Rancher Fleet) to streamline continuous application delivery.
Monitor platform health using tools like Prometheus and Grafana, respond to system alerts and troubleshoot complex containerized workload issues.
Enforce security policy's, including network policies, Rancher RBAC configurations and automated container image scanning.
Proactively monitor resource utilization (CPU, memory, persistent storage) to right-size clusters, scale nodes dynamically and optimize cloud or on-prem infrastructure.
Create and maintain technical documentation, including runbooks and architecture diagrams
Essential Qualifications & Experience:
Minimum of 4 years of dedicated, hands-on experience in DevOps or Platform Engineering roles.
Over 4 years of sustained experience designing, deploying and maintaining highly available production Kubernetes clusters, heavily utilizing the SUSE Rancher ecosystem.
A solid track record of leading complex migrations, moving legacy virtualized applications into containerized, microservice-based architectures.
Expertly provision, secure, and lifecycle-manage large-scale, multi-cluster Kubernetes environments using SUSE Rancher UI/API and Rancher Manager.
Write modular, reusable code to automate infrastructure provisioning across hybrid environments using Ansible.
Architect and maintain sophisticated, zero-downtime deployment pipelines using GitOps principles with Gitlab, ArgoCD or Rancher Fleet.
Author/manage advanced Helm charts and manage complex stateful and stateless deployments.
Engineer comprehensive monitoring and logging stacks (Prometheus, Grafana, ELK) to establish SLIs/SLOs and configure automated remediation. Experience with Rancher Observability is a plus.
Develop robust automation scripts in Python or Bash.
Rapidly diagnose and resolve high-severity incidents across the entire stack, from kernel-level container issues to complex network routing failures.
Comprehensive understanding of the underlying architectures of RKE, RKE2, K8S /K3s and knowing exactly when to apply each distribution.
Knowledge of zero-trust models, Rancher RBAC, Pod Security Admissions, container scanning and enforcing governance.
CNIs tools (Calico, Cilium, Canal) deep understanding of Software Defined Network and Container Networking, network policies and managing complex Ingress and Service Mesh (Istio) architectures.
Expert knowledge of Container Storage Interfaces (CSI), disaster recovery planning and managing highly available storage solutions like SUSE Longhorn.
Extensive practical experience operating distributed Kubernetes environments across a mix of bare-metal, on-premises hypervisors (vSphere, Nutanix), and public clouds (AWS, Azure, GCP).
Experience acting as a technical escalation point, guiding junior team members, and establishing DevOps best practices across development teams.
If you've read the description and feel this role is a great match, we'd love to hear from you! Click "Apply for this job" to be directed to a brief questionnaire. It should only take a few moments to complete, and we'll be in touch promptly if your experience aligns with our needs.
- Department
- Infrastructure & Platform Support
- Locations
- Braine-l'Alleud