hero

Portfolio Careers

companies
Jobs

Observability Team Lead - Cloud Engineering

monday.com

monday.com

Software Engineering
Tel Aviv-Yafo, Israel
Posted on May 9, 2025

Observability Team Lead - Cloud Engineering

  • R&D
  • Tel-Aviv, Israel
  • Management
  • Full-time

Description

We are monday.com, a global software company transforming how businesses run. Our product suite can adapt to the needs of diverse industries and use cases within one powerful platform, empowering ~245,000 customers worldwide to reimagine how work gets done, drive greater efficiency, and scale like never before.

With over 2,500 employees across the globe, we grow by prioritizing transparency and knowledge sharing. We care about the impact you make, not the hours you clock, so we encourage initiative, ownership, and fresh thinking. We back our people with flexible work, wellness and mental health support, and a work environment built on collaboration.

We're looking for an Observability Team Lead to join our Cloud Engineering Group. monday.com runs hundreds of micro-services across multiple regions and a fast-growing cell-based architecture. Clear, cost-effective observability is critical for keeping our SLAs tight, reducing blast radius, and enabling developers to move fast with confidence. The Observability team owns the entire signals such as metrics, logs, traces, profiling, alerting, etc.. so every engineering team can “build it and observe it” out of the box.

About The Role

  • Lead the design, implementation, and maintenance of our observability infrastructure which manages billions of observability events (logs, traces, metrics) per day.
  • Set vision & roadmap - Define plans for observability for the entire company to be aligned with our production strategy.
  • Own the platform end-to-end - Operate and evolve Datadog, Coralogix, OpenTelemetry, PagerDuty and more while keeping initiatives for internal systems.
  • Enable R&D teams - Provide auto-discovered dashboards, golden-signal templates, and tooling so every service ships with standard monitoring from day one
  • Champion best practices - Run internal workshops, publish monthly insights, and contribute to monday’s R&D.

Requirements

  • 5+ years building or operating large-scale observability / SRE platforms, including 3+ years in a leadership role.
  • Deep hands-on experience with Datadog (or similar), distributed tracing, log pipelines, and observability tooling.
  • Familiarity with Kubernetes, and microservice architectures (cells or multi-cluster experience a plus).
  • Excellent communication skills. Able to translate dashboards into stories that matter to engineers and executives alike.
  • Software-engineering background. Go/Python/TypeScript and IaC familiarity (Terraform/CDK).

Social Title

Observability Team Lead