Oracle SRE/Architect in Sacramento, California
The Oracle Cloud Infrastructure (OCI) team can provide you
the opportunity to build and operate a suite of massive scale, integrated cloud
services in a broadly distributed, multi-tenant cloud environment. OCI is committed to providing the best in
cloud products that meet the needs of our customers who are tackling some of
the world’s biggest challenges.
We offer unique opportunities for smart, hands-on engineers
with the expertise and passion to solve difficult problems in distributed
highly available services and virtualized infrastructure.
At every level, our engineers have a
significant technical and business impact designing and building innovative new
systems to power our customer’s business critical applications
About Oracle Production Engineering
Oracle Cloud Production Engineering plays a critical role indelivering and supporting best-of-breed cloud solutions to Oracle customers.
Oracle Cloud is the industry’s broadest and most integratedpublic cloud. It offers best-in-class services across software as a service(SaaS), platform as a service (PaaS), and infrastructure as a service (IaaS),and even lets you put Oracle Cloud in your own data center. Oracle Cloud helpsorganizations drive innovation and business transformation by increasingbusiness agility, lowering costs, and reducing IT complexity.
The Oracle Cloud has shown strong adoption, supporting 70million users and more than 30 billion transactions each day. It runs in 19data centers around the world.
Our team provides 24/7/365, follow-the-sun coverage whilepushing the boundaries of what can be accomplished in the cloud. Advancingcloud computing means great growth opportunities, and highly rewardingexperiences working in our expanding computing environments and DEVOPS teams.
About The Job
A unique opportunity to join a rapidly growing world-class team asthe technical lead, and design engineer, for cutting edge Kubernetes andContainerization Framework solutions, built on Oracle Cloud technologies andinfrastructure, and the latest open source projects.
Are you the type of engineer that hears a problem statement andsays “we can do that” – even with recently released technology? If a problem needs to be solved, you just doit – even if that means solving in code? You have been the technical lead for teams developing and deployingsolutions for bare metal/physical servers, and multiple cloud providers?
Cloud Native technologies are rapidly changing and evolving – doyou relish diving in, learning what’s new, and delivering infrastructuresolutions that are on the bleeding edge, while being able to lead others inwhat you’ve just learned?
This is an opportunity to build with the latest technologies, inan environment that needs to be dynamic, yet scale, whist providing thetechnical direction, and leadership, for the other engineers on the team – anopportunity to help build out a world-class team as a critical member and thetechnical lead.
As part of the SRE team, you will be continually challenged, andhave an opportunity to contribute to the success of the Oracle Cloud every day,and will work closely with the product development partners leveraging yoursolutions.
As a lead DevOps engineer, you will solve interesting technicalchallenges by defining, designing deploying and troubleshooting key OracleCloud services, platforms, and infrastructure, always thinking aboutreliability, scalability, resilience, security, and performance, while leading,mentoring and guiding a team of engineers in the technologies used to deliversolutions.
What You'll Do
- Kubernetes Architecture, Design,Build & Operations
As the technical lead forKubernetes, you will work to architect, then design, build and operateKubernetes deployments, in line with Oracle Architecture guidelines andstrategies
§ Envision the requirements and architecture to provide rock-solid, highlyavailable, and 100% automated Kubernetes Deployments
§ Ensure that deployments integrate with and leverage internal Oraclesystems
§ Write the code, lead the team in all technical matters
- Kubernetes InfrastructureApplications & Toolchain
Just deploying Kubernetes is notenough – how do you measure performance, read aggregated log files, graph andalert on data that details the current state of the environment?
§ Designing and building supporting systems that run within Kubernetes forKubernetes – whether Elasticsearch, Prometheus, or other systems to aid in the managementof Kubernetes infrastructure
- Continuous Integration/ContinuousDeployment (CI/CD)
Design CI/CD pipelines thatintegrate with Oracle’s internal development processes and toolsets, whilstleveraging technologies such as Jenkins and Spinnaker
§ Pipelines for Infrastructure development
§ Pipelines for Applications to run on Kubernetes deployments
- TechnicalLeadership – You will be thetechnical lead for Kubernetes and the containerization framework utilized bydevelopment teams – you will be the go to person for:
Service Ownership –You will be partof the SRE team, whose mission is the shared full stack ownership of acollection of services and/or technology areas, with our Development partners.
Ownership Scope – As an SRE, andtechnical lead, you will understand the end-to-end configuration, technicaldependencies, and overall behavioral characteristics of the production servicesyou own. In partnership with your Development partners, you will have theresponsibility to ensure that services are designed and delivered to be missioncritical with focus on security, resiliency, scale, and performance. SREs arethe ultimate authority and are accountable for the end-to-end performance andoperability of the services they own.
Service Design – As the OracleCloud evolves; you will partner with development teams in defining andimplementing improvements in service architecture, both current and future. Asan SRE, you will be an expert at articulating technical characteristics of yourservices and the dependencies between services, and guide Development teams toengineer and add premier capabilities to the Oracle Cloud service portfolio.
OperationsEngineering – You will understand and be able to communicate thescale, capacity, security, performance attributes and requirements of theservices you own. You are a Subject Matter Expert, able to understand andcommunicate every characteristic of the service stack, such as:
degradation and behavior under load of the services andtheir dependencies
end-to-end tuning needs, optimizing resource utilization,as load patterns fluctuate
Instrumentation and metrics that clearly describe theservice behaviors
scaling requirements and patterns
resiliency and recoverability, ensuring that backup /restore and disaster recovery capabilities are implemented, tested andmaintained
Automation – You will have aclear understanding of automation and orchestration principles, and will beeager to automate, wherever and whenever the possibility arises, whilesimultaneously eliminating technical debt. Automation must be part of your DNA.
Technical Experts - As serviceowner, you are the ultimate escalation point for complex or critical issuesthat have not yet been documented as SOPs for Level1 staff. You will usuallyget called in during major incidents as an SME, when the source of aproblem is unclear. You will have the deep understanding of service topologyand their dependencies required to troubleshoot issues and define mitigations.
Broad Interests - SREs are a raremix of sysadmins and development Engineers, and as such have the ability tounderstand and explain the affect of product architecture decisions on theability to run as distributed systems. They are driven by professionalcuriosity and a desire to a develop deep understanding of the their servicesand the technologies they depend upon.
What You Need to Have
A BS or MS in Computer Science, or equivalent experience
Knowledge of :
Defining and documenting technical architecture of complex andhighly scalable products
Building a cluster, from scratch,without using kubeadm and other toolsets
§ High Availability, Security, Automation
Kubernetes networking andintegration with existing network topologies
§ Calico, Flannel, BGP
Community add-ons & repositoriesfor common mgmt. toolchains
§ Prometheus, Elasticsearch, Grafana
§ ISTIO, or similar service mesh technologies
Building components from sourcein a CI/CD pipeline
§ Hyperkube, and other components – built on Oracle Linux for Oracle Linux
- Continuous Integration/Continuous Deployment pipelines (CI/CD)
Developing robust and consumablepipelines for developers to onboard applications to Kubernetes
§ Docker container build automation with Dockerfile
§ Custom coded solutions
- Development of RESTAPIs and Services
You can build REST APIs to perform complex tasks with thesimplicity of a HTTP call
- Development ofad-hoc tools as required
The tool doesn’t exist? Then write it
You found a bug? Then fix it
You are already part of, or happy to join, open sourcecommunities, and contribute bugs, fixes and help to the community
You solve problems with code, not just people and process
General Networkingand TCP/IP
Standard Internetservices, such as DNS, HTTP, etc.
Programming andscripting languages, such as Go, Python, Ruby, Bash, etc.
Configurationmanagement systems such as Ansible, Chef and Puppet
DevOps toolchains –CI/CD – Jenkins, GIT, other automation frameworks
IT Security andcompliance
10 yearsexperience of running large scale customer facing web services
Most importantly,the aptitude to be a good team player and the willingness to learn andimplement new Cloud technologies as needed, and then lead others in the same
Methodical approachto troubleshooting complex problems
What The Perfect Candidate Will Have
Service Mesh forKubernetes – ISTIO, Consul, others
Load balancingtechnologies, including L7 routing
Industry standardsand futures for containerization technologies
Machine learning,and real-world applications of such
Open Sourcedevelopment – leading, or contributing to open source projects
Development inlanguages, such as C, C , Java
Databases, and bigdata stores
Previous TechnicalLeadership roles
Detailed Description and Job Requirements
Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.
As a member of the software engineering division, you will take an active role in the definition and evolution of standard practices and procedures. Define specifications for significant new projects and specify, design and develop software according to those specifications. You will perform professional software development tasks associated with the developing, designing and debugging of software applications or operating systems.
Provide leadership and expertise in the development of new products/services/processes, frequently operating at the leading edge of technology. Recommends and justifies major changes to existing products/services/processes. BS or MS degree or equivalent experience relevant to functional area. 8 or more years of software engineering or related experience.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans status or any other characteristic protected by law.
Job: Product Development
Location: US-CA,California-Santa Clara
Other Locations: US-CA,California-Redwood City, US-CO,Colorado-Broomfield
Job Type: Regular Employee Hire