Your Tasks
- You design and develop components of the Observability Suite as sovereign cloud services, focusing on the integration of the Grafana LGTM stack (Loki, Grafana, Tempo, Mimir).
- You extend our system architecture by identifying and implementing relevant open-source projects (e.g., OpenTelemetry, Prometheus) to capture logs, metrics, and traces.
- You develop Kubernetes operators to automate the life cycle of cloud services
- You develop and maintain REST APIs that allow our customers to programmatically access and control their monitoring and alerting lifecycles.
- You take ownership in an agile "You Build It - You Run It" environment, acting as a bridge between development and SRE.
- You conduct complex root cause analyses using logs, tracing and metrics - eat your own dogfood - identifying bottlenecks & issues and implementing sustainable fixes.
Your Profile
- You have deep enthusiasm for Software Engineering, Cloud-Native Observability and SRE
- You actively own the entire software development lifecycle, go and k8s being your bread and butter - k8s operators preferably also a part of your arsenal
- You are experienced with - or eager to master - the Grafana ecosystem
- You understand the observability needs in today's cloud environment services and know how to instrument them effectively.
- You enjoy discovering new technologies (e.g. OpenTelemetry) and are excited about sharing your knowledge with the community and the team.
- You don't just look at dashboards; you understand the underlying data structures and how to optimize them for high-cardinality environments.