Oracle SRE/Architect in Sacramento, California


Preferred Qualifications

The Oracle Cloud Infrastructure (OCI) team can provide you

the opportunity to build and operate a suite of massive scale, integrated cloud

services in a broadly distributed, multi-tenant cloud environment. OCI is committed to providing the best in

cloud products that meet the needs of our customers who are tackling some of

the world’s biggest challenges.

We offer unique opportunities for smart, hands-on engineers

with the expertise and passion to solve difficult problems in distributed

highly available services and virtualized infrastructure.

At every level, our engineers have a

significant technical and business impact designing and building innovative new

systems to power our customer’s business critical applications

About Oracle Production Engineering

Oracle Cloud Production Engineering plays a critical role indelivering and supporting best-of-breed cloud solutions to Oracle customers.

Oracle Cloud is the industry’s broadest and most integratedpublic cloud. It offers best-in-class services across software as a service(SaaS), platform as a service (PaaS), and infrastructure as a service (IaaS),and even lets you put Oracle Cloud in your own data center. Oracle Cloud helpsorganizations drive innovation and business transformation by increasingbusiness agility, lowering costs, and reducing IT complexity.

The Oracle Cloud has shown strong adoption, supporting 70million users and more than 30 billion transactions each day. It runs in 19data centers around the world.

Our team provides 24/7/365, follow-the-sun coverage whilepushing the boundaries of what can be accomplished in the cloud. Advancingcloud computing means great growth opportunities, and highly rewardingexperiences working in our expanding computing environments and DEVOPS teams.

About The Job

A unique opportunity to join a rapidly growing world-class team asthe technical lead, and design engineer, for cutting edge Kubernetes andContainerization Framework solutions, built on Oracle Cloud technologies andinfrastructure, and the latest open source projects.

Are you the type of engineer that hears a problem statement andsays “we can do that” – even with recently released technology? If a problem needs to be solved, you just doit – even if that means solving in code? You have been the technical lead for teams developing and deployingsolutions for bare metal/physical servers, and multiple cloud providers?

Cloud Native technologies are rapidly changing and evolving – doyou relish diving in, learning what’s new, and delivering infrastructuresolutions that are on the bleeding edge, while being able to lead others inwhat you’ve just learned?

This is an opportunity to build with the latest technologies, inan environment that needs to be dynamic, yet scale, whist providing thetechnical direction, and leadership, for the other engineers on the team – anopportunity to help build out a world-class team as a critical member and thetechnical lead.

As part of the SRE team, you will be continually challenged, andhave an opportunity to contribute to the success of the Oracle Cloud every day,and will work closely with the product development partners leveraging yoursolutions.

As a lead DevOps engineer, you will solve interesting technicalchallenges by defining, designing deploying and troubleshooting key OracleCloud services, platforms, and infrastructure, always thinking aboutreliability, scalability, resilience, security, and performance, while leading,mentoring and guiding a team of engineers in the technologies used to deliversolutions.

What You'll Do

  • Kubernetes Architecture, Design,Build & Operations


As the technical lead forKubernetes, you will work to architect, then design, build and operateKubernetes deployments, in line with Oracle Architecture guidelines andstrategies

§ Envision the requirements and architecture to provide rock-solid, highlyavailable, and 100% automated Kubernetes Deployments

§ Ensure that deployments integrate with and leverage internal Oraclesystems

§ Write the code, lead the team in all technical matters

  • Kubernetes InfrastructureApplications & Toolchain


Just deploying Kubernetes is notenough – how do you measure performance, read aggregated log files, graph andalert on data that details the current state of the environment?

§ Designing and building supporting systems that run within Kubernetes forKubernetes – whether Elasticsearch, Prometheus, or other systems to aid in the managementof Kubernetes infrastructure

  • Continuous Integration/ContinuousDeployment (CI/CD)


Design CI/CD pipelines thatintegrate with Oracle’s internal development processes and toolsets, whilstleveraging technologies such as Jenkins and Spinnaker

§ Pipelines for Infrastructure development

§ Pipelines for Applications to run on Kubernetes deployments

  • TechnicalLeadership – You will be thetechnical lead for Kubernetes and the containerization framework utilized bydevelopment teams – you will be the go to person for:








Infrastructure Coding


Systems Engineering




Hardware Deployments



  • Service Ownership –You will be partof the SRE team, whose mission is the shared full stack ownership of acollection of services and/or technology areas, with our Development partners.

  • Ownership Scope – As an SRE, andtechnical lead, you will understand the end-to-end configuration, technicaldependencies, and overall behavioral characteristics of the production servicesyou own. In partnership with your Development partners, you will have theresponsibility to ensure that services are designed and delivered to be missioncritical with focus on security, resiliency, scale, and performance. SREs arethe ultimate authority and are accountable for the end-to-end performance andoperability of the services they own.

  • Service Design – As the OracleCloud evolves; you will partner with development teams in defining andimplementing improvements in service architecture, both current and future. Asan SRE, you will be an expert at articulating technical characteristics of yourservices and the dependencies between services, and guide Development teams toengineer and add premier capabilities to the Oracle Cloud service portfolio.

  • OperationsEngineering – You will understand and be able to communicate thescale, capacity, security, performance attributes and requirements of theservices you own. You are a Subject Matter Expert, able to understand andcommunicate every characteristic of the service stack, such as:


degradation and behavior under load of the services andtheir dependencies


end-to-end tuning needs, optimizing resource utilization,as load patterns fluctuate


Instrumentation and metrics that clearly describe theservice behaviors


scaling requirements and patterns


resiliency and recoverability, ensuring that backup /restore and disaster recovery capabilities are implemented, tested andmaintained

  • Automation – You will have aclear understanding of automation and orchestration principles, and will beeager to automate, wherever and whenever the possibility arises, whilesimultaneously eliminating technical debt. Automation must be part of your DNA.

  • Technical Experts - As serviceowner, you are the ultimate escalation point for complex or critical issuesthat have not yet been documented as SOPs for Level1 staff. You will usuallyget called in during major incidents as an SME, when the source of aproblem is unclear. You will have the deep understanding of service topologyand their dependencies required to troubleshoot issues and define mitigations.

  • Broad Interests - SREs are a raremix of sysadmins and development Engineers, and as such have the ability tounderstand and explain the affect of product architecture decisions on theability to run as distributed systems. They are driven by professionalcuriosity and a desire to a develop deep understanding of the their servicesand the technologies they depend upon.

What You Need to Have

A BS or MS in Computer Science, or equivalent experience

Knowledge of :

  • Defining and documenting technical architecture of complex andhighly scalable products

  • Kubernetes Infrastructure


Building a cluster, from scratch,without using kubeadm and other toolsets

§ High Availability, Security, Automation


Kubernetes networking andintegration with existing network topologies

§ Calico, Flannel, BGP


Community add-ons & repositoriesfor common mgmt. toolchains

§ Prometheus, Elasticsearch, Grafana

§ ISTIO, or similar service mesh technologies


Building components from sourcein a CI/CD pipeline

§ Hyperkube, and other components – built on Oracle Linux for Oracle Linux

  • Continuous Integration/Continuous Deployment pipelines (CI/CD)


Developing robust and consumablepipelines for developers to onboard applications to Kubernetes

§ Docker container build automation with Dockerfile

§ Jenkins

§ Spinnaker

§ Kapitan


§ Custom coded solutions

  • Development of RESTAPIs and Services


You can build REST APIs to perform complex tasks with thesimplicity of a HTTP call

  • Development ofad-hoc tools as required


The tool doesn’t exist? Then write it


You found a bug? Then fix it


You are already part of, or happy to join, open sourcecommunities, and contribute bugs, fixes and help to the community


You solve problems with code, not just people and process

  • Server hardwareconfiguration

  • Linux internals

  • General Networkingand TCP/IP

  • Standard Internetservices, such as DNS, HTTP, etc.

  • Programming andscripting languages, such as Go, Python, Ruby, Bash, etc.

  • Configurationmanagement systems such as Ansible, Chef and Puppet

  • DevOps toolchains –CI/CD – Jenkins, GIT, other automation frameworks

  • Cloud computingpatterns

  • IT Security andcompliance

  • 10 yearsexperience of running large scale customer facing web services

  • Most importantly,the aptitude to be a good team player and the willingness to learn andimplement new Cloud technologies as needed, and then lead others in the same

  • Methodical approachto troubleshooting complex problems

What The Perfect Candidate Will Have

Understanding of:

  • Service Mesh forKubernetes – ISTIO, Consul, others

  • Load balancingtechnologies, including L7 routing

  • Industry standardsand futures for containerization technologies

  • Machine learning,and real-world applications of such

Experience with:

  • Open Sourcedevelopment – leading, or contributing to open source projects

  • Development inlanguages, such as C, C , Java

  • Databases, and bigdata stores

  • Previous TechnicalLeadership roles

Detailed Description and Job Requirements

Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.

As a member of the software engineering division, you will take an active role in the definition and evolution of standard practices and procedures. Define specifications for significant new projects and specify, design and develop software according to those specifications. You will perform professional software development tasks associated with the developing, designing and debugging of software applications or operating systems.

Provide leadership and expertise in the development of new products/services/processes, frequently operating at the leading edge of technology. Recommends and justifies major changes to existing products/services/processes. BS or MS degree or equivalent experience relevant to functional area. 8 or more years of software engineering or related experience.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans status or any other characteristic protected by law.

Job: Product Development

Location: US-CA,California-Santa Clara

Other Locations: US-CA,California-Redwood City, US-CO,Colorado-Broomfield

Job Type: Regular Employee Hire

Organization: Oracle