fbpx

Site Reliability Engineer

Essentials 

Job title: Site Reliability Engineer
Location: Bucharest, Hybrid
Type: employment contract

Tech Stack  

AWS services, including ALB, CloudWatch, DynamoDB, EC2, IAM, RDS, S3

Offer

Fun, people-first work culture built on the principles of lean and flat structure
Work with latest tech powering global businesses
Hybrid working model based on both office environment and working from home ( 1-2 days/ week from the office)
High-quality hardware and tools to support your best work
25 days of paid vacation to recharge and explore
Net annual well-being bonus of 360 EUR to support your personal growth and happiness

Client

Our client is a Nordic based company dedicated to building trust and adding value for their customers. They secure and automate their client’s most advanced workflows and onboarding processes using their API-first SaaS & PaaS products, enabling businesses with powerful but simple electronic signing, electronic identification, Anti-Money Laundering and Smart Forms capabilities.

Their culture is based on a lean and self-organizing attitude, trust, respect and flexibility with your way of life. Our client is looking for people without ego, who do not hide their strengths or their weaknesses, ready to help their teammates and share their ideas with the team. Their ambition is to become worldwide leaders in the digital eID market and to offer their partners the best experiences to suit their needs.

They are a growing, profitable and well-established company in Scandinavia with 100+ employees, and they are looking for a passionate, creative and tech-savvy colleague to join their new office in Bucharest, Romania.

Role

As a Site Reliability Engineer , you’ll play a key role in ensuring our systems remain secure, scalable, and resilient as we grow. You get to work with a bunch of great people, where the whole team is passionate about technology. You’ll design and maintain the infrastructure, security, and automation that power our 24/7 production environment.

Responsibilities

Take ownership of the infrastructure, security, and automation that underpin our 24/7 production operations, ensuring we remain robust and efficient as we scale.

Monitor availability and scalability, taking a holistic view of system health.

Collaborate closely with development teams to enhance services through effective communication, testing, and release practices.

Build and maintain software and systems that manage our platform infrastructure.

Measure, analyse, and optimise system performance – identifying opportunities to evolve our architecture and stay ahead of customer needs.

Continuously improve the reliability, quality, and delivery speed of our systems through proactive automation and monitoring.

Contribute to our reliability roadmap by sharing insights on the evolution and scaling of our environments.

Requirements

Strong focus on learning & innovation.
You share the desire to grow into improving security, resilience, and reducing vulnerabilities.
You are investigative in the tech approach.
You are a firm believer in the power of knowledge sharing and like helping others grow, perform better, and become stronger as professionals.

Soft skills:

You have an excellent level of English.
Curiosity, pragmatism, open-mindedness, adaptability and resourcefulness.
You are an excellent listener and able to relate to a wide range of stakeholders.
You have a good team spirit, you like to share what you learn and also learn from others.
You can work autonomously in small, distributed teams.

Current Tech Ecosystem – we’ll welcome your influence:
AWS | Amazon EKS | Event Store| Elasticsearch| BitBucket| MongoDB| Datadog Node.js,
JavaScript

You have the following skills and qualifications:

A minimum of 3 years of site reliability, DevOps, or similar experience, including responsibility for supporting Cloud production systems.
Comprehensive ability to develop, run and monitor production services on AWS (Experience with the core AWS services, including ALB, CloudWatch, DynamoDB, EC2, IAM, RDS, S3).
Expert knowledge of Infrastructure as Code, Config as Code.
Automation – you’re someone who loves to build efficient automations.
Experienced in planning and executing backup, restore and rollback operations.
A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.
Problem-solving attitude.
Self-responsibility.
Strong communication and interpersonal skills.

Nice to have:

Exposure to microservice design
An appreciation of CI/CD principles
Analytical capabilities & customer focus

Apply today

If you meet the minimum requirements and are interested in applying for this position, please send your details to careers@key-talents.com with “SRE”, in the subject line.