CU(E)BES

Loading...

CU(E)BES

November 15, 2025

DevOps Engineer

About the AI startup

At this AI startup, we believe in the transformative power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to seamlessly integrate into daily working life, making AI accessible to everyone.

We democratize AI by offering high-performance, optimized, open-source, and cutting-edge models, products, and solutions. Our comprehensive AI platform caters to enterprise needs, whether on-premises or in cloud environments. Our offerings include Le Chat, the AI assistant for life and work.

We are a dynamic and collaborative team passionate about AI and its potential to revolutionize society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed across France, the USA, the UK, Germany, and Singapore. We are creative, low-ego, and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact.

Role Summary

We are building one of Europe’s largest AI infrastructure offerings, providing our customers with a private and integrated stack in various form factors, from bare-metal servers to fully-managed PaaS. As a DevOps Engineer, you will join a rapidly growing team to help build, scale, and automate our computing management stack. Your primary responsibility will be to build fault-tolerant and reliable infrastructure to support both our internal processes and customer platforms.

Location: France (🇫🇷) and the UK (🇬🇧) are the primary locations, but remote work is also possible under certain conditions (see below).

What you will do

As a Software/DevOps Engineer in our Compute team, your main responsibility will be to engineer robust and dependable infrastructure that supports both our internal operations and customer-facing platforms.

Key Responsibilities:

Design, build, and operate a scalable Kubernetes-based platform to host large-scale AI and HPC workloads, ensuring high performance, reliability, and security.

Own the entire lifecycle of cluster management, from initial setup and provisioning to global operations, by integrating and developing essential software components—including automation, monitoring, and orchestration tools.

Drive infrastructure innovation by designing workflows, tools (scripts, APIs, dashboards), and CI/CD pipelines to enhance system reliability, availability, and observability.

Champion a zero-trust security model by strengthening IAM, networking (VPC), and access controls to protect the platform.

Develop user-centric features that simplify operations for both sysadmins and end customers, reducing friction in daily workflows.

Lead incident resolution with thorough root-cause analysis to prevent recurrence and improve system resilience.

About you

• Proven experience in an Infrastructure Engineering role (Software Engineer, DevOps, Site Reliability Engineer, or Platform Engineer)

• Strong proficiency in software development (preferably Golang) and knowledge of software development best practices

• Deep understanding of Kubernetes internals and hands-on experience with containerization and orchestration tools (Docker, Kubernetes, Openstack, etc.)

• Familiarity with infrastructure-as-code tools like Terraform or CloudFormation

• Knowledge of monitoring, logging, alerting, and observability tools (Prometheus, Grafana, ELK, Datadog, etc.)

• Exposure to highly available distributed systems and site reliability issues in critical environments (issue root cause analysis, in-production troubleshooting, on-call rotations, etc.)

• Experience working against reliability KPIs (observability, alerting, SLAs)

• Excellent problem-solving and communication skills

• Self-motivation and ability to thrive in a fast-paced startup environment

Now, it would be ideal if you also had:

• Experience with HPC workload managers (Slurm) and distributed storage systems (Lustre, Ceph)

• Demonstrated history of contributing to open-source projects (e.g., code, documentation, bug fixes, feature development, or community support)

Location & Remote

This position offers both in-office and remote work options.

This role is primarily based in one of our European offices, either Paris, France, or London, UK. We prioritize candidates who either reside there or are open to relocating. We firmly believe in the importance of in-person collaboration to cultivate strong relationships and ensure seamless communication within our team.

In certain specific situations, we may also consider remote candidates based in one of the countries listed in this job posting, currently France, the UK, Germany, Belgium, the Netherlands, Spain, and Italy. However, we kindly request that all new hires visit our Paris HQ office for the following reasons:

– During the first week of their onboarding, we will provide accommodation and travel expenses.

– At least two days per month, they will be required to visit our office.

What we offer:

– Competitive salary and equity

– Health insurance

– Transportation allowance

– Sport allowance

– Meal vouchers

– Private pension plan

– Generous parental leave policy

– Visa sponsorship

We may utilize artificial intelligence (AI) tools to assist in various aspects of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools support our recruitment team but do not replace human judgment. Ultimately, final hiring decisions are made by humans. If you have any questions or concerns about how your data is processed, please don’t hesitate to contact us.

To apply for this job email your details to ufpuq2zjvsnh@n3plcpnl0241.prod.ams3.secureserver.net

Archives

No archives to show.

Categories

  • No categories