Lead MLOps Engineer

2 дней назад


Львов, Львовская область, Украина Capgemini Полный рабочий день

Lviv, Odesa, Rivne, HITECHPARKODESA, Kyiv

Lead MLOps Engineer (Ukraine)

At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world's most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and engineering services across all industries. Join us for a career full of opportunities. Where you can make a difference. Where no two days are the same.

Your Client

Our client is at the forefront of revolutionizing AI computing by re-engineering infrastructure at the system level. Its architecture, combined with sophisticated software intelligence, abstraction, and an orchestration layer, enables developers to leverage a diverse array of compute resources, achieving efficient and reliable computing at a fraction of the cost. Founded by industry veterans from Nvidia, Apple, Tesla, Intel, and Zoox, it's shaping the future of AI.

As the Lead/Staff AI Runtime Engineer, you'll play a pivotal role in the design, development, and optimization of the core runtime infrastructure powering distributed training and deployment of large AI models. This is a hands-on leadership role - ideal for a systems-minded software engineer who thrives at the intersection of AI workloads, runtimes, and performance-critical infrastructure.

Your Role

  • Own the core runtime architecture supporting AI training and inference at scale.
  • Design resilient and elastic runtime features (for example, dynamic node scaling and job recovery) within the custom PyTorch-based stack.
  • Optimize distributed training reliability, orchestration, and job-level fault tolerance.
  • Profile and enhance low-level system performance across training and inference pipelines.
  • Improve packaging, deployment, and integration of customer models in production environments.
  • Design and maintain libraries and services that support the full model lifecycle: training, checkpointing, fault recovery, packaging, and deployment.
  • Implement observability hooks, diagnostics, and resilience mechanisms for deep-learning workloads.
  • Champion best practices in CI/CD, testing, and software quality across the AI Runtime stack.
  • Work cross-functionally with Research, Infrastructure, and Product teams to align runtime development with customer and platform needs.
  • Guide technical discussions, mentor junior engineers, and help scale the AI Runtime team's capabilities.

Your Profile

  • PyTorch, TensorFlow, JAX (Advanced)
  • Python, C++ (Go/Rust optional)
  • Distributed training frameworks
  • Multi-GPU, multi-node optimization
  • Container orchestration (Kubernetes, Docker)
  • CI/CD, fault recovery, job scheduling
  • TorchElastic, Ray, custom orchestrators
  • Runtime architecture and systems performance tuning

Nice to Have

  • Contributions to PyTorch internals or open-source deep learning infrastructure projects.
  • Intel OpenVINO
  • Familiarity with LLM training pipelines, checkpointing, or elastic training orchestration.
  • Experience with Kubernetes, Ray, TorchElastic, or custom AI job orchestrators.
  • Background in systems research, compilers, or runtime architecture for high-performance computing (HPC) or machine learning.
  • Start-up experience.
  • Ability to travel to the EU.

What You Will Love About Working Here

  • We care about all our employees and want them to feel as comfortable as possible. That's why we offer them health insurance from the first days, regardless of the probationary period.
  • The gift from the company - Christmas holidays from 25 December to 31 December.
  • Сooperation with Superhumans center and Veteran HUB. Capgemini Engineering has supported the launch of psychological rehabilitation department of Superhumans. Our team also donated over UAH prosthetics for three Ukrainian defenders. Currently, we support psychological counseling provided by the Veteran Hub, and we have implemented an internal policy making the company friendly to military and veterans with the assistance of the Hub.

Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem.

Ref. code

360473-en_GB

Posted on

13 Nov 2025

Experience level

Experienced Professionals

Contract type

Permanent

Location

Lviv, Odesa, Rivne, HITECHPARKODESA, Kyiv

Business unit

Engineering and RandD Services

Brand

Capgemini Engineering

Professional communities

Software Engineering


  • Tech Lead

    2 недель назад


    Львов, Львовская область, Украина Shae Group Полный рабочий день 1 500 $ - 2 500 $

    Lead architecture for digital twin + clinical AI platforms — senior applied AI technical leadership roleTitle: Tech Lead (Platform Architect + Applied AI Engineering Lead) — Digital Twin & Clinical AI — Remote (Contractor)Location: Remote — Offshore-friendly. Ideal overlap with Americas/Europe time zones. Preferred regions include: Philippines,...

  • Tech Lead

    2 недель назад


    Львов, Львовская область, Украина Shae Group Полный рабочий день 20 000 $ - 30 000 $ в год

    Lead architecture for digital twin + clinical AI platforms — senior applied AI technical leadership roleTitle: Tech Lead (Platform Architect + Applied AI Engineering Lead) — Digital Twin & Clinical AI — Remote (Contractor)Location: Remote — Offshore-friendly. Ideal overlap with Americas/Europe time zones. Preferred regions include: Philippines,...

  • Senior Software Engineer

    4 дней назад


    Львов, Львовская область, Украина Robots & Pencils Полный рабочий день

    Location:Lviv, Ukraine (Remote-Friendly)Robots & Pencils is seeking a Senior Software Engineer for our Conversational AI & Agents practice. You'll build agentic chat experiences that enable effective customer interactions.As a senior contributor, you'll own full-stack development from design to release. You'll work with cross-functional teams to define...

  • Junior Frontend Engineer

    4 дней назад


    Львов, Львовская область, Украина DataRobot Полный рабочий день

    Job Description:DataRobot delivers AI that maximizes impact and minimizes business risk. Our platform and applications integrate into core business processes so teams can develop, deliver, and govern AI at scale. DataRobot empowers practitioners to deliver predictive and generative AI, and enables leaders to secure their AI assets. Organizations worldwide...

  • Lead ML/AI Engineer

    6 дней назад


    Львов, Львовская область, Украина AIDA Recruitment Полный рабочий день

    For our client, we are looking for a Senior AI/ML Engineerto join an AI project and drive the design, development, and optimization of cutting-edge retrieval-augmented generation (RAG) solutions.This role is ideal for a highly skilled engineer passionate about AI/ML systems, distributed architectures, and vector search technologies. You will play a key role...

  • Lead Machine Learning Engineer

    1 неделя назад


    Львов, Львовская область, Украина Just Answer Полный рабочий день 120 000 ₴ - 180 000 ₴ в год

    About UsJustAnswer is the leading AI + Human professional services platform, on a mission to revolutionize how people access expert help. Since 2003, we've connected millions of customers across 196 countries with verified professionals in real time—anytime, anywhere. With a powerful combination of human expertise and cutting-edge AI, we're transforming...

  • Data Engineer

    2 дней назад


    Львов, Львовская область, Украина Capgemini Полный рабочий день

    Lviv, Odesa, Rivne, HITECHPARKODESA, KyivData Engineer (AWS, Ukraine)At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world's most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology...

  • senior python engineer

    2 недель назад


    Львов, Львовская область, Украина UKEESS Software House Полный рабочий день 80 000 ₴ - 150 000 ₴ в год

    DESCRIPTIONThe UKEESS Software House team is looking for a Senior Python Engineer to join our team for a full-time position (remotely in Ukraine or in Lviv's office).If you are looking for a chance to switch technologies for Machine Learning, this is the opportunity for you. We will assist you during your journey from Python Development to Machine Learning...

  • Middle/Senior Engineer

    4 дней назад


    Львов, Львовская область, Украина Binariks Полный рабочий день

    Binariksis looking for a highly motivated and skilledSenior Engineerto join our team.Our project is a healthcare data platform for Medicare patient management. It addresses the problem of fragmented medical data across healthcare facilities by consolidating information into a complete, readable patient chart for providers and staff. Ultimately, it enables...

  • Senior C Engineer

    1 неделя назад


    Львов, Львовская область, Украина Capgemini Engineering Полный рабочий день 60 000 ₴ - 120 000 ₴ в год

    At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world's most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and...