Job was saved successfully.
Job was removed from Saved Jobs.

Job Details


Cloud Infrastructure Site Reliability and Automation Engineer (362265BR)

Aerospace and Aviation





IntroductionAre you passionate about technology? Do you love building new things? Do you want to develop the future of IBM's Cloud offerings? If you answered YES, then we have the right opportunity for you!The shift toward the consumption of IT as a service, i.e., the cloud, is one of the most important changes to happen to our industry in decades. At IBM, we are driven to shift our technology to an as-a-service model and to help our clients transform themselves to take full advantage of the cloud. With industry leadership in analytics, security, commerce, and cognitive computing and with unmatched hardware and software design and industrial research capabilities, no other company is as well positioned to address the full opportunity of cloud computing.Your Role and ResponsibilitiesWe are looking for a dynamic, Site Reliability and Automation Engineer to join our Cloud Operations Team, who is responsive to market needs, to deliver value to our clients in a fast-changing cloud landscape. The Cloud team is dedicated to ensuring the IBM Cloud is at the forefront of cloud technology, from data center design to network architecture to storage and compute clusters to flexible infrastructure services. We are building and operating IBM's next generation cloud platform to deliver performance and predictability for our customers' most demanding workloads, at global scale and with leadership efficiency, resiliency and security. It is an exciting time, and as a team we are driven by this incredible opportunity to thrill our clients.In this Site Reliability and Automation Engineer role, you will work closely with the Data Center, the entire Cloud development organization and IBM vendors to support, maintain and operationally improve the cloud infrastructure. Your focus will be the following key responsibilities:* Automate health monitoring of the production and test systems* Automate return to service procedures for Cloud Platform Components* Support the compliance and security integrity of the environment through your work* Partner with other teams, functional managers and program managers to deliver mission-critical services to the market* Support development of new and existing capabilities for our compute, storage and network services* Integrate automation with operational requirements* Work with Engineering to:o Define operational requirementso Automate operational requirementso Participate in the full deployment pipeline* Work with Support and Development to:o Identify and resolve issueso Discuss and plan integration requirementsRequired Professional and Technical Expertise* Minimum of 5 years' experience in hands-on production administration of large system environments, including virtual platforms.* Experience in establishing, following, and improving operational procedures within a mission critical environment* 5+ years of experience in data center infrastructure or relevant work experience* 5+ years of experience in large-scale infrastructure design, engineering, and support* 5+ years of experience in IT Change, Incident, Problem, Asset management* 5+ years of infrastructure engineering with proven record for delivering high-quality, large-scale solutions. Experience designing architectures for scale and performance* Must be efficient in writing, debugging and maintaining scripts (Bash and Python)* Must be extremely comfortable using and navigating within a Linux environment* Ability to do low level debugging and problem analysis by examining logs and running Unix commands* 2-3 years of extensive experience with open-source products* 3-5 years of experience with configuration management systems (Ansible / Chef)* Hands on knowledge of using Splunk or ELK* Must have the ability to perform debugging and problem analysis by examining logs and running Unix commands* Working knowledge with Network and Storage technologies* Working knowledge with ServiceNow, JIRA, Confluence, and GitHub* Excellent written and verbal communication skills* Comfortable operating in fast paced environmentThis role is based on a shift pattern of Tuesday - Saturday 08:00 - 16:00Required Technical and Professional ExpertiseAs per Job DescriptionPreferred Technical and Professional ExpertisePreferred Skills:* 2+ years of experience with Kubernetes* 4+ years of experience with GitHub, Perl and Python* 5+ years of experience with configuration management systems (SaltStack/Ansible/Chef)* 8+ years of experience in virtualization environments such as AWS /Softlayer/Zen/VMWARERequired Education:BS or equivalent in computer science or electrical engineering or relevant experiencePreferred Education:Masters' DegreeAbout Business UnitDigitization is accelerating the ongoing evolution of business, and clouds - public, private, and hybrid - enable companies to extend their existing infrastructure and integrate across systems. IBM Cloud provides the security, control, and visibility that our clients have come to expect. We are working to provide the right tools and environment to combine all of our client's data, no matter where it resides, to respond to changing market dynamics.Your Life @ IBMWhat matters to you when you're looking for your next career challenge?Maybe you want to get involved in work that really changes the world? What about somewhere with incredible and diverse career and development opportunities - where you can truly discover your passion? Are you looking for a culture of openness, collaboration and trust - where everyone has a voice? What about all of these? If so, then IBM could be your next career challenge. Join us, not to do something better, but to attempt things you never thought possible.Impact. Inclusion. Infinite Experiences. Do your best work ever.About IBMIBM's greatest invention is the IBMer. We believe that progress is made through progressive thinking, progressive leadership, progressive policy and progressive action. IBMers believe that the application of intelligence, reason and science can improve business, society and the human condition. Restlessly reinventing since 1911, we are the largest technology and consulting employer in the world, with more than 380,000 IBMers serving clients in 170 countries.Location StatementFor additional information about location requirements, please discuss with the recruiter following submission of your application.Being You @ IBMIBM is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, pregnancy, disability, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.