Talent.com
Această ofertă de loc de muncă nu este disponibilă în țara dvs.
Site Reliability Developer

Site Reliability Developer

OracleRomania
În urmă cu 16 zile
Descrierea postului

As a Principal Site Reliability Developer, you will play a key role in designing, automating, and managing enterprise-grade cloud services that support Oracle’s Business Units and customers. This role focuses on ensuring the stability, scalability, security, and performance of Oracle Cloud Infrastructure (OCI) services while driving Infrastructure as Code (IaC), automation, and reliability engineering best practices.

Career Level - IC4

Responsibilities :

We are looking for a self-driven professional that possesses a strong combination of IT engineering and IT operations skills, with a sound knowledge of Cloud services and Infrastructure as Code.

You will solve technical challenges by defining, designing, deploying, and troubleshooting Oracle Cloud services, platforms, and infrastructure, with a focus on reliability, scalability, resilience, security, and performance.

Duties will be varied and complex requiring independent judgment and proactive action with the role including, but not limited to, the following :

  • Ensure the stability, reliability, and proper documentation of production services.
  • Plan and coordinate release deployments across multiple phases and OCI realms, ensuring timely completion of tasks.
  • Lead the deployment of new service regions across different OCI realms.
  • Troubleshoot and debug service incidents, providing efficient resolutions.
  • Participate in the annual Disaster Recovery testing to ensure the resilience and reliability of cloud services.
  • Develop automation tools to enhance deployment speed and monitoring capabilities in a large-scale environment.
  • Collaborate and communicate effectively within a global team.
  • Deliver accurate and creative solutions to user issues, ensuring productivity and satisfaction.
  • Provide on-call support for critical production applications in a 24 / 7 environment.
  • Maintain service quality by resolving issues efficiently to meet performance metrics and SLAs.
  • Assess and prioritize workload, escalating tickets through appropriate channels for timely resolution.
  • Perform database management tasks, including software installations, version upgrades, security patching, configuration management, and backup / recovery.
  • Work independently while coordinating effectively with geographically distributed teams.
  • Implement and collaborate on security measures to protect data and infrastructure.
  • Research and analyze emerging threats and vulnerabilities to enhance system security

Required Skills and Experience

  • European Union citizen residing within a European Union country.
  • Bachelor’s degree in computer science, related fields, or equivalent experience.
  • 7+ years of experience in Infrastructure as Code (IaC) with focus on Terraform, Site Reliability Engineering (SRE), DevOps tooling, or automation, supporting large-scale production systems.
  • Strong understanding of cloud concepts and platforms, preferably Oracle Cloud Infrastructure (OCI).
  • Strong expertise in Site Reliability Engineering (SRE) methodology.
  • Proficiency in programming and scripting languages such as Python, Ruby, Perl, Shell scripting, or Java.
  • Good knowledge of source code management concepts and tools, especially Git, BitBucket, and GitLab.
  • Experience in SQL and PL / SQL scripting, including Oracle packages, procedures, functions, Linux Bash scripting, and SQL tuning.
  • Strong system administration skills, including Linux internals, TCP / IP, DNS, and load balancing technologies.
  • Experience in developing scripts to automate application deployments and installations.
  • Excellent verbal and written English communication skills, with the ability to engage effectively with middle to senior management.
  • Experience with the following is an advantage : Oracle APEX Resolving common security vulnerabilities such as Cross-Site Scripting (XSS), SQL Injection, Cross-Site Request Forgery (CSRF), and HTTP Response Splitting Configuration management tools such as Chef and Puppet Continuous deployment and source control tools such as Jenkins, Git, and Maven Knowledge of Kubernetes and Docker technologies
  • Even if you don’t meet all the required skills, we encourage you to apply—enthusiasm and a willingness to learn are just as important.

    Creați o alertă de locuri de muncă pentru această căutare

    Developer • Romania