Oracle Senior Manager - Cloud Service Reliability Engineering in Belmont, California
Manage a team that designs, develops, troubleshoots and debugs software programs for databases, applications, tools, networks etc.
As a manager of the software engineering division, you will apply your knowledge of software architecture to manage software development tasks associated with developing, debugging or designing software applications, operating systems and databases according to provided design specifications. Build enhancements within an existing software architecture and suggest improvements to the architecture.
Manages and controls activities in multi-functional areas of sections. Ensures appropriate operational planning is effectively executed to meet Corporate specifications. Demonstrated leadership and people management skills. Strong communication skills, analytical skills, thorough understanding of product development. BS or MS degree or equivalent experience relevant to functional area. 4 years of software engineering or related experience.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans status or any other characteristic protected by law.
Senior Manager - Cloud Service Reliability Engineering
Do you like thinking about how to make large scale deployments that have a lot of moving parts more reliable?
Are you comfortable attempting a problem that has never been solved before?
Are you someone who thinks about how you can make things better?
Are you hands-on, driving for excellence and do you thrive with challenging high-scale problems?
The Hospitality Cloud DevOps team is a newly formed group within Oracle working on solving difficult challenges in maximizing our service availability while working in parallel to evolve applications from SaaS to True Cloud Native Solutions. We are like a start-up inside a large company with a big charter and with room for creative freedom. We are looking to assemble some of the smartest people in the industry in growing this team.
As a SRE Sr. Manager, you ll define how to use latest technologies to identify and optimize the operational efficiency. You will be responsible for the infrastructure and reliability of SaaS services. You will work with a team pushing the boundaries toward scalable, self-healing, autonomous platform solutions.
We are looking for someone who is passionate about:
Owning end-to-end availability, reliability, and performance of our Hospitality SaaS services on Oracle Industry Cloud and Oracle IaaS.
Leading, managing and developing a distributed team of Service Reliability Engineers
Providing training for all personnel to ensure highest level of support and customer satisfaction
Leading teams in design and implementation of processes for rolling out software and security updates to deployments with near zero downtime
Building and maintaining our platform and automation frameworks to ensure maximum up-time and predictability while preventing outages and service interruptions or degradation
Analyzing system failures and developing rapid response processes to ensure such failures do not reoccur
Working cross-functionally with product development, product management, program management and cloud infra operations teams
Partnering with engineering to provide the infrastructure and services required to enable innovation and ensure the highest level of quality and service
Predicting and providing notice of potential system vulnerabilities for current and future solutions and implementations; providing specific recommendations and guidance to address such vulnerabilities
Developing and managing processes and metrics that ensure maximum reliability and up-time for our customers
In partnership with Hospitality Cloud Engineering Peer, oversee analyzing, building and maintaining all automation tools and processes to ensure the highest standards of reliability and robustness
Fully understanding our customers service needs and ensuring we meet those needs
Participating in escalation workflows as required
5 years of experience supporting large scale distributed systems globally
2 years technical operations leadership
2 years post-mortem assessments and correction action plans
Understanding of Hospitality Product Portfolio
Experience working cross-functionally with internal customers and executives
2 years of people management and team leadership experience including headcount planning and developing strong and motivated teams
Experience creating effective resource plans that ensure a high level of performance
Experience developing repeatable processes and metrics that maximum uptime, reliability, and predictability
Experience managing complex deployments
Successful track record for participation in building and maintaining infrastructure including required monitoring, testing, and tooling
Experience with Agile and DevOps methodologies
Effective verbal, written communication and interpersonal skills including interfacing with customers on a professional and cooperative level
Able to develop and maintain strong relationships with Oracle Stakeholders & Leadership Peers
BS degree in Computer Science or related degree or equivalent experience
No matter your role on our team, you'll find yourself in an exciting and challenging environment where every person is empowered to show initiative, be outspoken, and be proactive, not reactive. Oracle is dedicated to the continual growth and development of its staff, striving constantly to strengthen our expertise as well as develop new skills. Our team is spread all around the world on four continents. We provide a full range of opportunities and challenges to apply your skills and grow your career in this new and exciting arena.
Job: *Product Development
Title: Senior Manager - Cloud Service Reliability Engineering
Location: United States
Requisition ID: 18001BQO