Infra Engineer
groundcover
Infra Engineer
- Engineering
- Tel Aviv
- Senior
- Full-time
Description
groundcover is a fast-growing B2D company set to reinvent the way developers monitor their cloud-native applications and impact their organization’s scale. We help teams by providing them with the insights they need to troubleshoot better and faster when trouble hits. We believe that even the most complex systems can have a flawless and amazing user journey.
Our product is built on a unique Bring Your Own Cloud (BYOC) architecture, challenging the traditional SaaS model and introducing complex infrastructure challenges that require innovative solutions. We’re looking for a highly skilled Infrastructure Engineer to join our team and own the scaling, management, and automation of our platform’s distributed environments. If you’re excited about building high-scale distributed systems and solving deep DevOps and infrastructure challenges, let’s talk.
What You’ll Do:
- Own and scale infrastructure – Design, build, and optimize the backbone of our observability platform, ensuring seamless deployment across hundreds of distributed environments.
- Solve complex scalability challenges – Tackle unique problems in multi-cluster Kubernetes environments, multi-cloud setups, and high-ingestion observability pipelines.
- Manage data at scale – Build and optimize configurable data pipelines, ensuring efficient ingestion, storage, and querying of large volumes of observability data with resilience, consistency, and analytical capabilities.
- Automate everything – Develop infrastructure as code, improve CI/CD processes, and automate environment provisioning for reliability and efficiency.
- Enhance system reliability – Design robust monitoring, alerting, and self-healing mechanisms for a high-scale production environment.
- Collaborate cross-functionally – Work closely with backend engineers, product teams, and customers to design scalable, developer-friendly infrastructure.
- Adopt and implement cutting-edge technologies – Continuously evaluate and introduce new tools and frameworks to improve scalability, performance, and cost efficiency.
- Improve deployment efficiency – Optimize Helm charts, Kubernetes operators, and Terraform configurations to streamline environment creation and lifecycle management.
Requirements
- 5+ years of experience in DevOps, SRE, or Infrastructure Engineering roles.
- Strong expertise in Kubernetes, Terraform, Helm, and cloud environments (AWS, GCP, or Azure).
- Experience with scalable observability stacks (e.g., ClickHouse, VictoriaMetrics, OpenTelemetry) is a huge plus.
- Deep understanding of distributed systems, networking, and containerized workloads.
- Proficiency in at least one programming language (Go, Python, or similar) for automation and tooling
- Passion for building scalable, reliable, and efficient infrastructure.
- A problem-solving mindset with the ability to tackle complex technical challenges independently.