Orioninnovationnaukri
Observability Lead
Company
Role
Observability Lead
Location
Job type
-
Posted
23 hours ago
Salary
Job description
Summary: The Dynatrace Observability Lead/Architect is responsible for defining and driving the end-to-end observability strategy, architecture, integration, and implementation. This role ensures alignment with SRE and DevOps practices, enables AIOps capabilities, and leads the onboarding, optimization, and continuous improvement of observability across the organization.
Responsibilities:
- Define observability strategy, standards, and architecture (metrics, logs, traces, APM)
- Establish SLIs, SLOs, error budgets, and observability readiness standards
- Define tagging, metadata, and telemetry standards (naming, formats, traceability)
- Design integration architecture across CI/CD, observability, and ITSM platforms
- Define and enforce governance model, compliance, and audit requirements
- Establish KPIs and success metrics aligned with business and reliability goals
- Consolidate findings from assessments and identify risks and improvement areas
- Develop observability maturity assessment (DevOps/SRE alignment)
- Define implementation roadmap (quick wins vs long-term initiatives)
- Drive stakeholder workshops and validations
- Lead integration of CI/CD pipelines with observability (logs, metrics, traces, deployment markers)
- Implement tagging strategy (app, env, version, commit ID)
- Integrate with incident, change, and release management systems
- Define and implement service-level KPIs and golden signals
- Build operational and business dashboards
- Configure event correlation and alert tuning to reduce noise
- Track reliability metrics (MTTR, SLOs, change failure rate)
- Define and implement AIOps architecture and use cases
- Drive AIOps (anomaly detection, RCA, event correlation, predictive insights)
- Implement automation, runbooks, and self-healing workflows
- Lead application onboarding and ensure standards adoption
- Act as primary stakeholder interface and drive best practices across teams
Required Skills & Qualifications:
- Proven hands-on experience with Dynatrace SaaS and Managed environments
- Strong knowledge of Azure, AWS, and GCP cloud infrastructures
- Proficiency in infrastructure monitoring (VMs, containers), application performance monitoring (APM), and log analytics
- Familiarity with ActiveGate, OneAgent, Network zone, Extension deployment, and SNMP device monitoring
- Strong experience in Digital experience monitoring
- Strong analytical skills and understanding of AIOps practices
- Excellent communication and documentation skills
- Experience in creating dashboards and defining alert strategies
- Strong experience with Dynatrace (APM, Infra, Logs, DEM, AIOps)
- Deep understanding of SRE, DevOps, and Observability principles
- Experience with CI/CD tools and cloud platforms (Azure/AWS/GCP)
- Knowledge of ITSM integrations and incident management workflows
- Experience in automation, scripting, and AIOps implementations
Preferred Qualifications:
- Dynatrace certification (Associate or Professional)
- ITIL or Service Management experience
- Exposure to enterprise security and compliance standards
- Experience working in multi-tenant or large enterprise environments
Experience Level :10 to 13 years


