Nvidia
Senior Network Engineer, System Verification
Company
Role
Senior Network Engineer, System Verification
Location
Israel
Job type
Full time
Posted
2 hours ago
Salary
Job description
NVIDIA seeks an exceptional Network Engineer to join the Vertical Verification Group. As a senior engineer, you will contribute to the Vertical Verification work, concentrating on validating and improving complex Ethernet and InfiniBand systems within NVIDIA’s Data Center and HPC environments.
You will play a key role in testing advanced networking features, building complex customer-like topologies, and ensuring NVIDIA products meet industry-leading benchmarks of performance, efficiency, and quality. You will collaborate closely with architects, software engineers, and hardware teams across NIC, HCA, switches, CPUs, and GPUs in a fast-paced and highly technical environment. This is an outstanding opportunity to join a highly skilled team and contribute significantly to groundbreaking technology!
What you’ll be doing:
- Review architecture, build, and requirements for new networking features across Ethernet and InfiniBand portfolios
- Compose and build complex testbed topologies that emulate customer environments
- Implement, run and optimize integration test plans including functional, regression, and performance testing
- Identify, reproduce, and debug issues; work closely with R&D to drive root cause analysis and resolution
- Collaborate with automation teams
- Analyze and optimize network performance, latency, and efficiency
- Provide clear status updates, reports, and insights on system quality and performance
What we need to see:
- B.Sc. in Computer Science, Electrical Engineering, or related field (or equivalent experience)
- 8+ years of experience in networking, system validation, or related engineering roles
- Strong hands-on experience with Linux-based systems
- Deep understanding of networking protocols (TCP/IP, UDP, Ethernet, VLANs, L2/L3)
- Experience with routing, switching, and modern data center network architectures
- Strong troubleshooting, debugging, and analytical skills in distributed environments
- Experience with test methodologies (functional, regression, performance, scale)
- Scripting or programming experience (Python, Bash, etc.)
- Independent, fast learner with strong ownership and communication skills
Ways to stand out from the crowd:
- Experience with RDMA technologies (RoCE / InfiniBand)
- Knowledge of congestion control algorithms and performance tuning
- Familiarity with AI workloads and their networking requirements
- Experience with HPC environments and benchmarking tools
- Background with virtualization or container technologies (Kubernetes, KVM, etc.)