Oracle Autonomous Database Reliability Engineer (Join-Ns2) in Frisco, Texas
Autonomous Database Reliability Engineer (Join-Ns2)
Minimum requirement : US Citizen with at least an active TS clearance that is SCI eligible
Preferred Location : Seattle, WA or Reston, VA, Dallas, TX
No Visa Sponsorship is available for this position.
Are you interested in the exciting challenges of building and operating large-scale distributed infrastructure for the cloud? Oracle’s Cloud Infrastructure (OCI) National Security Sector Group is building its next generation of Cloud IaaS/PaaS/SaaS technologies that operate at high scale in a broadly distributed multi-tenant environment. Our mission is to provide our customers with an enterprise level cloud infrastructure platform that delivers unmatched reliability, scalability and performance for mission-critical databases, applications and workloads.
The Autonomous Database Team is responsible for building the cloud service framework powering various Oracle Autonomous Database cloud services, including Autonomous Data Warehouse (ADW) and Autonomous Transaction Processing (ATP). The framework automates deployment, scaling and management of databases in the cloud. It is built on top of Oracle's Cloud Infrastructure (OCI) Layer.
The autonomous database cloud service framework features APIs to handle all lifecycle management operations of databases. It also performs operations autonomously based on internal and external events. The team has the unique opportunity to make significant contributions to the full stack of Oracle technology, from database kernel to cloud service platform and to customer-facing portals.
As an SRE you will be responsible for defining and deploying autonomous database services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our stakeholders while ensuring reliability and performance.
This role will support Oracle’s Government customers.
We are a dynamic and enthusiastic team with great emphasis on go-getters and proactive individuals.
Overview from oracle.com with links to other collateral:
In this role you will need to:
Act as a point of escalation for incidents and other issues arising within the region, for the cloud database services.
Operates and performs maintenance to cloud database services running within the region.
Deploys code and executes other changes within the region.
Take ownership of the implementation and production operations of a wide array of core system platform solutions
React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
Ensures thorough documentation of incidents through company-standard reporting methods.
Stay informed of cloud infrastructure stacks
Drives and actively participates in the resolution of complex technical issues spanning various services.
Ability to maintain a US government security clearance.
At least a Bachelor’s degree, in Computer Science, MIS or another technical field, or equivalent work experience.
Solid experience with Linux.
Experience troubleshooting complex software and/or networking issues.
Strong understanding of cloud concepts and platforms.
Experience in cloud technical support, operations, NOC or similar is preferred, but not required.
Expert level experience, understanding, implementation and troubleshooting of Oracle Database technology including RAC, Dataguard, ASM, RMAN preferred.
Development skills utilizing Python, shell, SQL
Expert knowledge and in-depth experience of Oracle Engineered systems and subsystems, especially Exadata
Ability to troubleshoot and resolve complex hardware/software issues, restore environments to an operational state, perform root cause analysis and provide forward thinking mitigation strategies
Strong communication and analytical skills
Familiarity with security practices in web application delivery and general knowledge of network topology
Experience working with government customers is preferred, but not required.
Proven ability to quickly learn new technical domains and then train others.
Detailed Description and Job Requirements
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.
A BS or MS in Computer Science, or equivalent. Identifies and implements complex solutions to knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance. Experience running large scale customer facing web services. Identifies and implements complex solutions to understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies. Work involves defining and documenting technical architecture of complex and highly scalable products. A minimum of 8 years experience of running large scale customer facing web services.
Oracle is an Affirmative Action-Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, protected veterans status, age, or any other characteristic protected by law.
Job: Product Development
Other Locations: US-TX,Texas-Frisco, US-WA,Washington-Seattle
Job Type: Regular Employee Hire
- Oracle Jobs