Principal Site Reliability Developer (SRE/SRD)
Oracle
Romania
Onsite
staff
June 1, 2026
€155,000
€64,000
Want to apply for this job?
Subscribe to access the application link and 8,000+ more jobs
Job Description
At Oracle Cloud Infrastructure (OCI), we build the more intelligent future of cloud. OCI EMEA Operations is a team of smart, motivated, and diverse people that are focused on bringing the world's most important work to OCI. We build and operate our commercial and sovereign cloud regions to be reliable and high performance. Our customers and their mission are the centre of what we do. We strive to improve our knowledge of the challenges our customers face which we use to enhance our cloud capabilities and work together to deliver their mission.
As a Site Reliability Engineer, you will be responsible for the operation of production environments, including systems and databases, supporting critical business operations for a commercial and sovereign cloud environment. You will be focused on automation and optimization of operations for multiple production environments. You will recommend new and novel solutions to improve availability, performance, and supportability. This is an opportunity to bring a combination of deep technical knowledge with administration/analysis knowledge of Oracle's Cloud Infrastructure to provide escalation support to a wide range of complex production environment problems related to immense growth, scaling, leveraging the cloud, extremely high performance, and high availability requirements. You will also guide junior engineers to solve complex problems, take part in large-scale incident bridges and help to build and optimize processes and procedures.
- Development of automation and optimization’s focused on operational excellence.
- Deep dive, root cause and solve for systemic issues.
- Enhance Operations quality outcomes through scalable automations.
- Install, monitor, maintain, support, and optimize all production server hardware and software.
- Provide escalated technical support for complex technical issues which may include leading problem management cases and providing management status.
- Coordinate escalated support cases and lead appropriate internal technical resources and/or third-party vendors to resolution and coordinate a storage infrastructure of Oracle system and database appliances.
- Responsible for Oracle production environments; assist with server operating system and application upgrades, bug fixes, and patching; and work on standardization projects for both hardware and software under the Oracle technology stack while providing consistent system uptime as expected in a Cloud environment.
- Lead communications with key partners in solving complex technical problems.
- Provide technical guidance and leadership to junior members to enable them to grow in their careers.
Requirements:
- Permanently resident in Romania.
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.
- 6+ years of experience in systems engineering, software development, cloud operations, or site reliability engineering roles.
- Strong proficiency in at least one programming or scripting language (e.g., Python, Go, Java, Bash).
- Solid understanding of Linux/Unix systems, networking (TCP/IP, DNS, load balancing), and storage technologies.
- Experience with monitoring, observability, and operational analytics platforms.
- Understanding of cloud-native technologies such as Docker, Kubernetes, and orchestration frameworks.
- Experience participating in and leading large-scale incident response and operational bridges.
- Experience developing automation solutions focused on operational excellence, reliability, and scalability.
- Familiarity with AI-assisted engineering tools such as Codex, GitHub Copilot, Cursor, Claude Code, or similar technologies to improve engineering productivity.
- Understanding of Large Language Models (LLMs) and their application in troubleshooting, automation, incident management, operational workflows, and knowledge management.
- Familiarity with agentic workflows, AI agents, and intelligent automation frameworks to streamline operations and improve service reliability.
- Strong operational mindset with a focus on ownership, customer impact, continuous improvement, automation, and operational excellence.
- Experience leveraging data-driven insights, observability platforms, and automation to proactively identify, investigate, and resolve reliability and performance issues.
- Customer focus, with a passion for delivering reliable and scalable cloud services.
- Experience in SRE, cloud technical support, cloud operations, large-scale events management, or similar environments.
- Demonstrated ability to quickly learn new technical disciplines and effectively mentor and train others.
Career Level - IC4
More Jobs You Might Like
Helpful Resources
Salary & Savings Calculator
Compare salaries across European cities and calculate your potential savings. Understand cost of living and take-home pay for tech jobs in Europe.
Career Guides
Expert advice on landing high-paying tech jobs in Europe. Tips on interviews, salary negotiation, and career growth from The European Engineer.
Access 8,000+ High-Paying Tech Jobs
Get unlimited access to our full database of 8,000+ jobs with advanced filters, salary comparisons, and exclusive career guides from The European Engineer.