Senior/Semi Senior DevOps
and Observability Specialist
				and Observability Specialist
En Zyzygy, we focus on providing the best solutions, practices, and methodologies through technological innovation, with the goal of generating tangible and sustainable value for the clients who trust us.
We are looking for a DevOps and Observability Specialist with a strong focus on tools like Prometheus (PromQL)- Grafana, Chronosphere, Datadog, Splunk, Dynatrace, New Relic, etc. to develop high-impact and innovative projects, collaborating with major clients and startups.
Your responsibilities will be:
- 
- Architect and implement scalable observability solutions using different observability tools: Design and build end-to-end monitoring strategies, from data collection to visualization and alerting, leveraging the full potential of each platform.
- Master PromQL and related observability tools: Gain deep expertise in different observability tools:  to analyze metrics, logs, and distributed traces, while integrating with other relevant ecosystem tools.
- Correlate data for actionable insights: Construct intelligent dashboards and alerts to proactively identify performance bottlenecks, anomalies, and potential issues.
- Optimize user experience: Leverage observability data to enhance system availability, latency, and overall performance.
- Collaborate with engineering teams: Partner with development teams to embed observability best practices and a data-driven mindset into the software development lifecycle.
- Customer-facing experience: Advise customers in defining their observability needs and guide them in adopting best practices.
Te ofrecemos:
- Be part of a growing company that values teamwork, innovation, and diversity.
- Opportunities for personal and professional growth, with challenging projects and continuous learning.
- Work flexibility, with a mostly remote model and in-person workspaces to foster collaboration and team building.
Valoramos perfiles con:
- Proven experience in observability: Demonstrated ability to design, implement, and scale monitoring solutions in complex and distributed systems. Hands-on experience is essential.
- Practical experience writing and optimizing PromQL queries for dashboards, alerting, and troubleshooting. Ability to build reliable, actionable observability workflows using Prometheus or Prometheus-compatible backends.
- Technical proficiency: Strong understanding of Kubernetes, Docker, and cloud-native environments.
It’s a plus:
- Strong data analysis skills: Ability to extract meaningful insights from large datasets and translate them into actionable recommendations.
- Programming skills.
- Problem-solving mindset: A proactive ability to identify root causes of complex issues and propose effective, scalable solutions.
