We have 30 years of expertise in designing and building custom software systems. We provide software development services focusing on complex high-load applications, AI and BI solutions, and mobile apps.
We are looking for a Platform Operations / DevOps Engineer to join our platform team with a strong focus on operational excellence, maintenance, and execution.
This role is not about designing architecture from scratch, it is about running, improving, and scaling an existing platform reliably.
You will work with predefined designs and standards and will be responsible for their implementation, operational readiness, and long-term stability.
Working hours: This role requires availability during 6:00 AM – 2:00 PM Eastern Time (ET).
Core responsibilities:
- Runbooks and operational readiness work:
- Write and maintain runbooks, upgrade checklists, rollback steps, and common troubleshooting guides.
- Improve ticket triage, improve documentation, reduce repeat incidents through follow-up fixes.
- Kubernetes maintenance and routine improvements:
- Execute cluster upgrades using the agreed upgrade plan.
- Day-to-day cluster hygiene: node group updates, add-on updates, certificate rotation procedures, cluster tooling updates.
- Implement GPU scheduling configuration and related operational tasks once the design is set.
- DevOps implementation work:
- Implement CI/CD improvements from an agreed plan (pipeline reliability fixes, runner image updates, caching, build performance).
- Build and maintain container images and delivery workflows following FTE-defined standards (hardening, scanning integration, reproducibility).
- Observability implementation:
- Build dashboards and alerts from established guidelines.
- Add instrumentation hooks and standard monitors to services and workloads.
- Operationalize performance tuning loops by creating repeatable reports and dashboards.
Requirements: -
5+ years of experience in DevOps role.
-
Hands-on experience with Kubernetes in production.
-
Strong operational mindset (KTLO, maintenance, reliability).
-
Experience with CI/CD pipelines and container-based delivery.
-
Experience with observability tools (metrics, dashboards, alerts).
-
Ability to work with existing designs, standards, and processes.
- Fluent English communication skills (B2 level or higher).
What we offer: - Collaboration via a B2B contract with payments in EUR or USD, depending on your preference, or through a labor contract if you are based in Georgia, Serbia, or Kazakhstan.
- Flexible work schedule.
- Possibility to work remotely (excluding Russia and Belarus).
- Opportunities for professional growth.
- A company laptop to ensure a comfortable and efficient work setup.