Yesenergy
Site Reliability Engineer
Company
Role
Site Reliability Engineer
Location
Job type
-
Found on Mokaru
14 hours ago
Salary
Job description
Join the Market Leader in Electric Power Data and Analytics Solutions
The electrical grid is the largest and most complicated machine ever built. Yes Energy’s industry-leading electric power trading analytics software provides real-time visibility into the massive amount of data generated by the North American electrical grid daily. Our unique and innovative view of the data informs real-time trading decisions and mid-to-long-term investment decisions that keep utility prices low, support the energy transition, and keep the grid running. It’s both challenging work and work with a purpose.
Be a part of our successful, growing business during international transformation.
Position Summary
We are hiring a Site Reliability Engineer to serve as a senior, hands-on reliability leader across all product lines. This role sits within the Systems Administration team, part of the Product Technology Services (PTS) group, and is focused squarely on operational excellence: incident response, systems availability, monitoring and alerting, release support, and reliability improvements across our production services.
During your working hours, you will be expected to take ownership of active incidents: respond to pages, coordinate response across engineering teams, diagnose production issues, restore service quickly, and drive clear communication through resolution. Incident response and operational readiness are central to the role, not occasional side responsibilities.
This is a senior individual contributor and team-lead role responsible for setting SRE standards, mentoring additional SREs as the function grows, unblocking engineering teams, and improving the systems, pipelines, and practices that keep Yes Energy products reliable at scale.
Position Details
- Salary Range: Net 14.000 – 18.000 RON/month
- Location: Hybrid (Bucharest, Romania)
- Schedule: Full-time; 2-3 days in the office
- Reporting to: Manager of Systems Administration
Primary Responsibilities
- Respond to pages across all product lines and lead incident response from initial detection through mitigation and recovery, while driving root-cause remediation that reduces repeat incidents, prevents similar future alerts, and improves overall service reliability.
- Serve as the incident owner when online, coordinating cross-functional responders and making clear decisions under pressure to restore service quickly.
- Build and improve monitoring, alerting, dashboards, service-level objectives (SLOs), runbooks, and escalation processes so issues are detected quickly and responders have useful context.
- Operate and troubleshoot Linux and Windows systems across AWS, Azure, OCI, and related hybrid or multi-cloud environments.
- Support production web applications, containers, and Kubernetes workloads, with a focus on reliability, scalability, and availability.
- Work with load balancers, forward and reverse proxies, DNS, networking, firewalls, security groups, and traffic-routing patterns to diagnose and resolve availability and performance issues.
- Unblock engineering teams by diagnosing and fixing Jenkins jobs, CI/CD pipelines, deployment failures, environment issues, and release blockers.
- Partner with Engineering, Security, DBA, and Product Technology Services teams to improve operational readiness, production support models, and reliability practices.
- Mentor SRE and Systems team members, establish practical standards, and help lead the growth of a stronger site reliability function.
Minimum Qualifications
- Bachelor's or Master's degree in Computer Science, Information Technology, or a related field; or equivalent practical experience.
- Minimum of five years of experience supporting mission-critical production infrastructure, SaaS platforms, web applications, or service-oriented systems.
- Deep hands-on AWS experience, including production operations for compute, networking, IAM, storage, load balancing, monitoring, and troubleshooting; greater depth is strongly valued.
- Proven incident management experience, including responding to pages, leading high-severity incidents, coordinating responders, writing postmortems and RCA, and driving corrective actions.
- Experience with containers and Kubernetes, monitoring and alerting systems, CI/CD tooling such as Jenkins and Bitbucket, and operational automation or scripting.
- Strong communicator and collaborator who can provide technical leadership, delegate effectively, mentor engineers, and unblock teams during high-pressure operational work.
- Working knowledge of scripting and automation tools such as Python, PowerShell, Bash, Terraform, CloudFormation, Azure CLI, or AWS CLI.
- Strong Linux and Windows systems administration and troubleshooting experience in production environments.
Key Competencies & Preferred Qualifications
- Problem Solving: Frames and solves complex, ambiguous production issues; surfaces cross-system or systemic failure modes.
- Systems Thinking: Takes a broad, holistic view of how software, infrastructure, and real-time data pipelines interact.
- System Design & Maintenance: Expert ability to build and sustain scalable infrastructure while enforcing strict reliability standards.
- Security & Compliance: Ensures system configurations adhere to robust access controls, encryption, and data compliance policies.
- Effective Communication: Translates high-pressure incident details into clear, calm, and actionable updates for stakeholders.
- Prioritization & Time Management: Balances fast-moving, high-urgency incident response with long-term strategic reliability projects.
- Team Focus: Fosters an inclusive environment built on a blameless postmortem culture and collaborative knowledge sharing.
At Yes Energy, we value connecting directly with candidates. We kindly ask that third-party recruiters and agencies not submit resumes, as we are not open to external recruiting partnerships.
ABOUT YES ENERGY
Overview
Yes Energy delivers real-time market data and electric power trading decision solutions. Over 1,000 market participants use Yes Energy solutions daily. The business is a leader in all aspects of information content collection and management, developing and delivering data and market analytics solutions. Since its inception in 2008, Yes Energy has become a trusted and respected supplier of innovative and reliable solutions focused on the needs of power market analysts, traders, and trade managers. Yes Energy has a team of over 350 amazing professionals in Boulder, CO (HQ); Boston, MA; Chicago, IL; Glendora, CA; Richmond, VA; London, United Kingdom; Auckland, New Zealand; and Bucharest, Romania.
Culture
Yes Energy has been named one of the Best Places to Work in Colorado, and we have the culture to prove it. At Yes Energy, we care about saying “Yes” to customers. We like to listen, learn, and develop our solutions in line with their needs. We think about customers as business partners, and when we help them be more successful … we are more successful, too.
Around the office, our culture is driven by some pretty fundamental values that we’re proud of:
- We love innovation and solving tough challenges;
- We are “high standards people” who combine passion and pride with hard work and rewards of all kinds-- in an ethic that is consistent across the company.
- We’re team-focused with a flat hierarchy-- we work in small teams on well-defined projects that directly impact the success of the business;
- We play to the strengths and experience of each person while each of us also works along a continuum of roles adjacent to our focus area. This presents the challenge of maintaining a broad set of skills as well as an opportunity to learn and contribute in many ways.
- We are constantly growing. Professional development happens every day and every year.
Compensation and Benefits
We offer highly competitive salaries and real bonuses that are achievable and that you can impact. Our benefits package includes private medical insurance, wellness/gym benefits, flexible vacation, and flexible work schedules. Yes Energy encourages and funds investment in both formal and informal professional development.
Yes Energy provides equal employment opportunities to all employees and applicants without regard to race, color, religion, sex, national origin, age, disability, genetics, or any other protected status. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, compensation, and training.


