Orioninnovationnaukri

Orioninnovationnaukri

Observability Lead

Role

Observability Lead

Job type

-

Posted

23 hours ago

Share this job

Salary

Not disclosed by employer

Job description

Summary: The Dynatrace Observability Lead/Architect is responsible for defining and driving the end-to-end observability strategy, architecture, integration, and implementation. This role ensures alignment with SRE and DevOps practices, enables AIOps capabilities, and leads the onboarding, optimization, and continuous improvement of observability across the organization.

Responsibilities:

  • Define observability strategy, standards, and architecture (metrics, logs, traces, APM)
  • Establish SLIs, SLOs, error budgets, and observability readiness standards
  • Define tagging, metadata, and telemetry standards (naming, formats, traceability)
  • Design integration architecture across CI/CD, observability, and ITSM platforms
  • Define and enforce governance model, compliance, and audit requirements
  • Establish KPIs and success metrics aligned with business and reliability goals
  • Consolidate findings from assessments and identify risks and improvement areas
  • Develop observability maturity assessment (DevOps/SRE alignment)
  • Define implementation roadmap (quick wins vs long-term initiatives)
  • Drive stakeholder workshops and validations
  • Lead integration of CI/CD pipelines with observability (logs, metrics, traces, deployment markers)
  • Implement tagging strategy (app, env, version, commit ID)
  • Integrate with incident, change, and release management systems
  • Define and implement service-level KPIs and golden signals
  • Build operational and business dashboards
  • Configure event correlation and alert tuning to reduce noise
  • Track reliability metrics (MTTR, SLOs, change failure rate)
  • Define and implement AIOps architecture and use cases
  • Drive AIOps (anomaly detection, RCA, event correlation, predictive insights)
  • Implement automation, runbooks, and self-healing workflows
  • Lead application onboarding and ensure standards adoption
  • Act as primary stakeholder interface and drive best practices across teams

Required Skills & Qualifications:

  • Proven hands-on experience with Dynatrace SaaS and Managed environments
  • Strong knowledge of Azure, AWS, and GCP cloud infrastructures
  • Proficiency in infrastructure monitoring (VMs, containers), application performance monitoring (APM), and log analytics
  • Familiarity with ActiveGate, OneAgent, Network zone, Extension deployment, and SNMP device monitoring
  • Strong experience in Digital experience monitoring
  • Strong analytical skills and understanding of AIOps practices
  • Excellent communication and documentation skills
  • Experience in creating dashboards and defining alert strategies
  • Strong experience with Dynatrace (APM, Infra, Logs, DEM, AIOps)
  • Deep understanding of SRE, DevOps, and Observability principles
  • Experience with CI/CD tools and cloud platforms (Azure/AWS/GCP)
  • Knowledge of ITSM integrations and incident management workflows
  • Experience in automation, scripting, and AIOps implementations

Preferred Qualifications:

  • Dynatrace certification (Associate or Professional)
  • ITIL or Service Management experience
  • Exposure to enterprise security and compliance standards
  • Experience working in multi-tenant or large enterprise environments

Experience Level :10 to 13 years

Resume ExampleCover Letter Example

Explore more