Platform Engineer
Are you the Kubernetes and Platform Expert that we need ?
We're looking for an experienced Platform Engineer to join the Consumer Centricity Platform Operations team. In this role, you'll be responsible for the operational stability, security, and reliability of a mission-critical Kubernetes platform powering customer-facing applications and APIs.
This is not a traditional DevOps or SRE position. You'll act as the operational owner of the platform, ensuring all environments (Development, Test, Acceptance, and Production) remain secure, available, and performant while supporting continuous improvements and operational excellence.
If you enjoy solving complex infrastructure challenges, working with cloud-native technologies, and maintaining highly available production platforms, we'd love to hear from you.
Role
Kubernetes Platform Operations
- Operate and maintain a production-grade Kubernetes platform.
- Manage platform components including ingress controllers, service discovery, and workload identity.
- Troubleshoot cluster issues involving networking, scheduling, DNS, storage, and pod lifecycle.
- Execute controlled platform upgrades and maintenance with rollback strategies.
Reliability & Incident Management
- Participate in a 24/7 on-call rotation for critical platform services.
- Lead incident response, troubleshooting, and recovery activities.
- Perform Root Cause Analyses (RCA) and implement preventive improvements.
- Maintain operational runbooks and documentation.
Observability & Monitoring
- Operate and improve the platform observability stack.
- Monitor metrics, logs, traces, and alerting.
- Improve monitoring quality while reducing alert fatigue.
- Support production troubleshooting through telemetry analysis.
Platform Change & Release Management
- Plan, execute, and document platform changes.
- Ensure changes follow structured change management processes.
- Communicate effectively with stakeholders during releases and maintenance windows.
Security & Compliance
- Manage RBAC, secrets, certificates, and network security controls.
- Apply platform patches and security updates.
- Support vulnerability remediation and audit activities.
Automation & Platform Improvements
- Automate operational processes using Infrastructure as Code and GitOps principles.
- Standardize platform operations and reduce manual effort.
- Contribute to continuous platform improvements.
Profile
Kubernetes Expertise
- Extensive production experience operating Kubernetes environments.
- Strong knowledge of:
- Multi-cluster architectures
- RBAC and security best practices
- Network Policies
- Stateful workloads
- Storage (CSI, PV/PVC)
- Autoscaling (HPA/VPA)
- Admission Controllers
- Performance tuning and troubleshooting
GitOps & CI/CD
Experience with:
- ArgoCD
- GitOps deployment models
- Harness.io pipelines
- Helm
- Kustomize
- Git-based release workflows
Containers & Artifact Management
Knowledge of:
- Docker
- Harbor
- JFrog Artifactory
- Secure image management and vulnerability scanning
Security
Experience with:
- HashiCorp Vault
- Secrets management
- TLS & certificate lifecycle
- Supply chain security
- Kubernetes security best practices
Observability
Hands-on experience with:
- OpenTelemetry
- Prometheus or VictoriaMetrics
- Grafana
- Loki
- Tempo
- SLI/SLO monitoring concepts
Networking
Strong understanding of:
- TCP/IP
- DNS
- Kubernetes networking
- Ingress
- Reverse proxies
- TLS/mTLS
- API routing
- Network Policies
Automation
- Advanced Bash scripting
- Automation mindset
- Infrastructure as Code
Nice to Have
Experience with:
- Kong API Gateway
- Redis
- PostgreSQL
- MongoDB
- Kargo
- Kubernetes database deployment patterns
Offer
Long-term Freelance Contract
3 jours de télétravail