Principal Network Reliability Engineer

June 18, 2026
Urgent

Job Description

Principal Network Reliability Engineer (IC4)
Company

Oracle Careers

Business Unit

Oracle Cloud Infrastructure (OCI)

Career Level

IC4 – Principal Engineer

Employment Type

Full-Time

Experience Required
Bachelor’s Degree + 10+ years of Network Engineering experience
OR
Master’s Degree + 8+ years of Network Engineering experience
About the Role

Oracle Cloud Infrastructure (OCI) is one of the world’s largest hyperscale cloud platforms, operating across more than 40 global regions and supporting mission-critical applications for enterprise customers worldwide.

As a Principal Network Reliability Engineer, you will be responsible for designing, building, automating, deploying, and operating highly scalable and reliable cloud networking systems that support hundreds of thousands of network devices and millions of servers across Oracle’s global infrastructure.

This role combines advanced networking expertise, software engineering, automation, observability, reliability engineering, and large-scale cloud operations.

You will play a critical role in driving operational excellence, reducing downtime, automating repetitive tasks, enhancing monitoring capabilities, and improving the overall reliability of OCI networking infrastructure.

Key Responsibilities
Network Reliability Engineering
Design, deploy, and operate large-scale cloud networking infrastructure.
Ensure maximum availability, scalability, and reliability of OCI network services.
Manage network operations across global cloud regions.
Network Architecture & Engineering
Participate in network architecture and solution design discussions.
Support large-scale data center and backbone network environments.
Contribute to network fabric design and optimization.
Automation & Software Development
Develop automation tools and frameworks to improve operational efficiency.
Build scripts and applications to automate routine networking tasks.
Integrate automation solutions into network operations workflows.
Serve as a subject matter expert for network automation projects.
Monitoring & Observability
Design and implement telemetry collection systems.
Develop monitoring dashboards and alerting frameworks.
Build network visibility tools to identify anomalies and performance issues.
Create data-driven solutions for proactive issue detection.
Incident Management & Reliability
Participate in operational on-call rotations.
Lead incident response and troubleshooting activities.
Provide break-fix support for critical network events.
Conduct root cause analysis (RCA) and implement preventive actions.
Collaboration & Leadership
Mentor junior engineers and support technical development.
Collaborate with project managers, automation teams, and operations teams.
Coordinate with networking vendors and quality assurance teams.
Influence technical roadmaps, priorities, and operational strategies.
Vendor & Firmware Management
Work with network equipment vendors on bug resolution.
Assist in qualification and testing of firmware upgrades.
Evaluate operating system releases and hardware improvements.
Required Technical Skills
Core Networking

Strong expertise in:

MPLS
BGP
OSPF
IS-IS
TCP/IP
IPv4
IPv6
DNS
DHCP
Preferred
VXLAN
EVPN
Network Engineering
Large-scale ISP environments
Cloud networking
Backbone networks
Data center networking
Clos network architectures
Internet routing infrastructure
Network Automation & Programming
Programming Languages
Python (Preferred)
Other scripting or compiled languages
Network Automation Technologies
Network Automation Frameworks
API Development & Integration
Infrastructure Automation
Network Modeling & Programmability
YANG
OpenConfig
NETCONF
Network Security
SSL/TLS
VPN Technologies
Secure Network Design
Monitoring & Telemetry
Network Monitoring Systems
Telemetry Platforms
Alerting Systems
Dashboard Development
Performance Analytics
Preferred Experience

Candidates should have experience in:

Large ISP environments
Hyperscale cloud providers
Network Operations Centers (NOC)
Reliability Engineering (NRE/SRE)
Cloud Infrastructure Platforms
Automation-first operational environments
Required Soft Skills
Strong analytical and troubleshooting skills
Excellent verbal and written communication
Ability to work in fast-paced cloud environments
Strong ownership mindset
Ability to manage ambiguity effectively
Collaboration and mentoring capabilities
Continuous learning mindset
Education

Bachelor’s Degree in:

Computer Science
Computer Engineering
Electrical Engineering
Telecommunications
Related Technical Discipline

OR

Equivalent professional experience.

Key Success Indicators

Successful candidates will demonstrate:

Expertise in large-scale cloud networking
Advanced network automation capabilities
Strong software development skills
Operational excellence mindset
Ability to solve complex distributed systems problems
Experience managing hyperscale network environments
Career Progression Opportunities

This role can progress into:

Senior Principal Network Engineer
Cloud Network Architect
Principal Site Reliability Engineer
Distinguished Engineer
OCI Infrastructure Architect
Network Automation Architect
Cloud Infrastructure Leadership Roles
Why Join Oracle?
Work on one of the world’s largest cloud infrastructures.
Operate networking systems at hyperscale.
Build automation solutions used globally.
Collaborate with world-class engineers.
Access global career growth opportunities.
Competitive compensation, healthcare, retirement, and employee benefits.
Ideal Candidate Profile

This role is ideal for professionals coming from:

Hyperscale Cloud Providers
ISP/Core Networking Environments
Network Reliability Engineering (NRE)
Site Reliability Engineering (SRE)
Data Center Networking
Cloud Infrastructure Engineering
Network Automation Engineering

Candidates with strong expertise in both networking and software development/automation will be particularly successful in this role.

Location