Oracle Site Reliability Engineer in Prague, Czech Republic
Site Reliability Engineer
Build and own the vision for cloud-native applications. Identify, pilot and develop tools, automation, processes and software changes to address top runtime & scalability issues. Work with Kubernetes, Docker, Continuous Integration. Run large-scale, massively distributed systems.
SRE (what we do)
SRE works on improving the developer experience - we’re trying to make the developer’s life easier by automating everything that can be automated, providing reliable monitoring of the running services (Prometheus, ELK stack) and keeping everything we do well documented (Confluence, Sphinx). We also keep an eye on our infrastructure and network configuration, making use of Infrastructure as Code principles (Terraform, Terratest). SRE helps smoothen the continuous integration and continuous delivery process (TeamCity, CircleCI). Our team also coordinates everything related to an incident response (PagerDuty, Slack), helping our engineers resolve any issues related to our production.
Talking about production: we have to make sure we’re live and running at all times. If there’s a production issue, we need to make sure it’s handled properly. SRE team is on top of the whole incident workflow - from the initial alert through the impact analysis, customer response to creating a post mortem - we coordinate everything related to an incident response.
Our range is wide so we learn new things every day - which is the extra super cool part about being an SRE.
Your main task will be helping other engineers to keep production in a good shape. You will be developing code on the top of our infrastructure to ensure monitoring, continuous delivery process, reliability and security of all our products. You will be the one who will help others to do the things the best way possible. You will have the freedom to design solutions to achieve the goals above.
EU citizenship (or EU permanent residency), willing to relocate to Prague
Willingness to run what you've build (and carrying pager to prove it; rotating on platform shifts)
Good Linux and shell-scripting skills (Red Hat or Oracle Linux knowledge is an advantage)
General laziness towards manual work and tendency to automate everything
Interest in all things observability (monitoring, logging, tracing), we’d welcome experience with Prometheus, Grafana, ELK stack
Very good level of written and spoken English
Genuine love for creating documentation
Detailed Description and Job Requirements
Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.
As a member of the software engineering division, you will take an active role in the definition and evolution of standard practices and procedures. You will be responsible for defining and developing software for tasks associated with the developing, designing and debugging of software applications or operating systems.
Work is non-routine and very complex, involving the application of advanced technical/business skills in area of specialization. Leading contributor individually and as a team member, providing direction and mentoring to others. BS or MS degree or equivalent experience relevant to functional area. 7 years of software engineering or related experience.
As part of Oracle's employment process candidates will be required to successfully complete a pre-employment screening process. This will involve identity and employment verification, professional references, education verification and professional qualifications and memberships (if applicable).
Job: Product Development
Location: CZ-CZ,Czech Rep-Prague
Job Type: Regular Employee Hire
- Oracle Jobs