fbpx

Site Reliability Engineer

Essentials 

Job title: Site Reliability Engineer
Location: Bucharest
Type: CIM, hybrid (2 days/week office)

Technologies          

Docker/Kubernetes, Grafana / Elasticsearch
AWS, Terraform, SQL

Offer

Long-term development plan
Clear career progression paths.
25 days of holiday
Health insurance
Meal tickets
Udemy access
Various bonuses (Christmas, Easter)

Client

Our client is  the Romanian subsidiary of a well known  France based company, encompassing over 23,000 customer companies and administration, 90% being CAC 40 companies with more than 5,000 employees (1,000 IT engineers). Our client solutions range from consultancy to the co-creation of innovative offers and service operations, as well as professional and sector-based solutions. The company, by combining collaborative platforms, professional expertise, digital and industrial capacities, has asserted itself as a trusted business partner in digital and mobile transformation for organizations.

Responsibilities

  • Understand functional flows of platform managed and being able to support customer on onboarding or support phase
  • Harden platforms before they go live by reviewing their design and implementation, tuning configuration as well as developing auxiliary tools and necessary monitoring of critical health indicators
  • Maintain platforms after go live by measuring and monitoring their availability, performance and overall system health
  • Recover platforms during production incidents to meet targeted SLA; perform detailed root cause analysis to prevent regressions. Business hours work model plus additional on calls possible.
  • Proactively seek improvements of non-functional requirements; cooperate with development teams to improve operational aspects of platforms under your responsibility
  • Validate readiness and maturity of new rollouts through development, execution and verification of automated smoke test suites
  • Provide technical expertise on company’s products and support processes to internal and external customers

Education and Experience

MS Computer Engineering
First experience of 5 years in SRE position or integrator role in a software company

Technical Skills

Must have

  • Good knowledge of Unix ecosystem and tools
  • Experience with Docker/Kubernetes
  • Bash scripting and automation skills
  • Understanding of networking topology and components of distributed web applications
  • Working experience with monitoring tools: Grafana / Elasticsearch
  • Familiar with CI/CD
  • Knowledge about REST API and testing tools: Postman

Nice to have

  • Good understanding of security and key encryption mechanisms
  • Experience with Public Cloud tools and deployments (AWS, Azure)
  • Understanding of SQL database design and operations; SQL syntax

Soft-Skills

  • Communicative in English
  • Strong analytical skills, systematic problem solving approach
  • Communicate effectively and professionally with customers and other third party companies
  • Ability to work and interact effectively in a distributed team environment.

Apply today

If you meet the minimum requirements and are interested in applying for this position, please send your details to careers@key-talents.com   with “Site Reliability Engineer”, in the subject line.