JOIN THE TEAM BRINGING ADVANCED AUTONOMY TO THE BUILT WORLD

At Bedrock, we’re moving AI out of the lab and into the real world. Our team is composed of industry veterans who helped launch Waymo, scaled Segment to a $3.2B acquisition, and grew Uber Freight to $5B in revenue. Today, we’re deploying autonomous systems on heavy construction machinery across the country, accelerating project schedules of billion-dollar infrastructure projects and improving safety on job sites. Backed by $350M in funding, we’re working quickly to close the gap between America's surging demand for housing, data centers, manufacturing hubs, and the construction industry's growing labor shortage.

This is where algorithms meet steel-toed boots. You’ll collaborate with construction veterans and world-class engineers to solve physical-world problems that simulations can’t touch. If you're ready to apply cutting-edge technology to solve meaningful problems alongside a talented team—we'd love to have you join us.

THE ROLE

You'll own Bedrock's agentic AI platform: the infrastructure, frameworks, and tooling that power AI agents across the company. You'll also build and ship the first wave of production agents: from agents that help engineers understand our autonomous systems, to Field Triage agents that accelerate debugging of real-world issues, to workflow automation agents that keep our distributed team aligned.

This is a high-impact, high-autonomy role at the intersection of AI/LLM engineering, cloud infrastructure, and developer tooling. Part of the Cloud Platform team, you will work closely with the Autonomy team, and the Product/Ops teams to make agents a core part of how Bedrock operates.

WHAT YOU'LL DO

Build the agentic platform that enables Bedrock engineers to create, deploy, and monitor AI agents on AWS. This includes agent orchestration, tool/MCP server integration, evaluation and observability, and production deployment pipelines.
Ship the first agents grounded in Bedrock specificities. These include a Q&A agent for understanding system behaviors, a Field Triage agent for mapping real-world issues to spec nodes, and an Annotator agent that improves data labeling quality.
Build workflow automation agents for internal productivity—Slack thread summarization, Linear ticket management, cross-tool orchestration—that make every team at Bedrock faster.
Design and implement MCP servers that expose Bedrock's internal systems as tool interfaces for agents.
Build evaluation and observability infrastructure for agents—tracing, cost tracking, quality metrics, and automated regression testing to ensure agents are reliable and improving.
Iterate on agent UX patterns beyond simple chat—building interfaces that surface agent outputs in the right context (Slack, dashboards, CLI, pull requests) and enable human-in-the-loop workflows.

Also, you get to drive 100,000 lb excavators.

WHAT WE'RE LOOKING FOR

4+ years of professional experience in software engineering, with demonstrated ownership of production systems.
Strong Python skills and comfort with async patterns, API design, and cloud-native development on AWS.
Hands-on experience building LLM-powered applications including agents, tool use, RAG, prompt engineering, and evaluation. You've shipped something real with LLMs, not just experimented.
Systems thinking. You can design an agent platform that's modular, observable, and maintainable, not just a collection of scripts.
Comfort working across the stack. You'll interface with cloud infrastructure, data systems, frontend teams, and end users and you're energized by that breadth.
Strong written communication. Agents are only as good as their grounding context. You'll help shape how we encode domain knowledge for AI consumption.

PREFERRED QUALIFICATIONS

Experience with agent frameworks such as AWS Strands, LangGraph, CrewAI, Claude Agent SDK, or similar orchestration tools.
Familiarity with MCP (Model Context Protocol) or similar tool-use / function-calling patterns for LLMs.
Experience building developer platforms or internal tooling that other engineers use daily.
Experience with observability and evaluation frameworks for AI/ML systems (tracing, cost analysis, quality metrics).
Experience in an early-stage startup environment designing, building, and launching new products or major features from scratch.

Our roles are often flexible. If you don't fit all the criteria, or are in another location (especially one where we have an office like SF or NY) please apply anyway! We'd love to consider you.

Agentic Platforms & Agents Engineer

Job description

Explore more

Career resources

Similar jobs

Technical Program Manager - Cloud Infra & Data

Senior Director, Partner Accounts - Snowflake

Scientist, Immunology

Manager, Accounting

Job Fair NYC

Data Analyst – GTM & Customer Success