Employee Records
SRE Manager
Radisson Hotel Group, Madrid Office-Information Technology
SRE Manager
Radisson Hotel Group, Madrid Office-Information Technology
Full Time
Hybrid remote
7 Years Experience
Coins Icon To be discussed
SRE Manager
Radisson Hotel Group, Madrid Office-Information Technology

Full Time
Hybrid remote
7 Years Experience
Coins Icon To be discussed
Skills
Team Leadership
Site Reliability Engineering (SRE)
Scripting (Python, Go, Bash)
Cloud Platforms (Azure, AWS, GCP)
Fluent in English
Description

Radisson Hotel Group is a leading hospitality company serving as a true host and best partner to guests, owners, business partners and talent. Our unique hotel brands offer award-winning and exceptional hotel experiences, originating from our strong Scandinavian heritage of design and innovation. Our brands embody our modern vision of hospitality, including authentic local tastes, stylish living design, unique locations and vibrant social scenes.

Radisson Hotel Group brings a refreshed commitment to hospitality leadership to meet the changing travel industry and the bespoke needs of our guests. We provide exceptional service in all of our hotels across the globe and strive to deliver a hospitality experience that is beyond guest expectations.

Role purpose:

The SRE Manager ensures the reliability, scalability, and performance of Radisson Hotel Group’s digital web and app platforms.

To achieve this, the role will:

              1. Lead and mentor the SRE team to design, implement, and operate resilient systems.

              2. Establish and enforce best practices for monitoring, incident response, automation, and capacity planning.

              3. Partner with product, engineering, and infrastructure teams to embed reliability into the software development lifecycle.

Resulting in:

              1. Highly available and performant digital platforms that enhance guest experience.

              2. Reduced downtime and faster incident resolution across services.

              3. A culture of reliability, automation, and continuous improvement within the Digital services.

Roles/Responsibilities

·        Lead, coach, and grow a team of SREs, fostering a culture of ownership, collaboration, and innovation.

·        Drive automation of operational tasks, deployments, and monitoring to reduce manual effort and human error.

·        Oversee incident management processes, ensuring timely communication, root cause analysis, and postmortems.

·        Collaborate with software engineering, product, and infrastructure teams to design scalable, secure, and reliable systems.

·        Report on system health, reliability metrics, and operational risks to senior leadership.

Job requirements and qualifications:

Location:

·        Madrid, Spain.

Language skills:

·        Fluency in English is a must.

Must have experience

·        7+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure roles.

·        2+ years in leadership/managerial role, leading distributed teams.

·        Proven track record of managing mission-critical, customer-facing digital platforms.

·        Experience with hybrid cloud environments (Azure, AWS, GCP).

·        Strong knowledge of observability tools (Dynatrace, Prometheus, Grafana, Splunk, etc.).

·        Expertise in automation and Infrastructure-as-Code (Terraform, Ansible, Pulumi).

·        Familiarity with CI/CD pipelines, Kubernetes, and microservices architectures.

Desirable experience

·        Hospitality, travel, or e-commerce industry background

·        Solid understanding of networking, security, and distributed systems.

·        Expertise in scripting languages (Python, Go, Bash

Travel needs

·        Approximately 10% to Madrid and/or Brussels HQ

Soft skills:

•       Strong leadership and people management skills

•       Excellent communication and stakeholder management

•       Strategic thinker with hands-on problem-solving ability

•       Ability to thrive in a fast-paced, global, customer-centric environment

Education:

·        University Degree in Computer Science, Engineering, or related field

·       Cloud, agile and/or DevOps certifications preferable.



Radisson Hotel Group is a leading hospitality company serving as a true host and best partner to guests, owners, business partners and talent. Our unique hotel brands offer award-winning and exceptional hotel experiences, originating from our strong Scandinavian heritage of design and innovation. Our brands embody our modern vision of hospitality, including authentic local tastes, stylish living design, unique locations and vibrant social scenes.

Radisson Hotel Group brings a refreshed commitment to hospitality leadership to meet the changing travel industry and the bespoke needs of our guests. We provide exceptional service in all of our hotels across the globe and strive to deliver a hospitality experience that is beyond guest expectations.

Role purpose:

The SRE Manager ensures the reliability, scalability, and performance of Radisson Hotel Group’s digital web and app platforms.

To achieve this, the role will:

              1. Lead and mentor the SRE team to design, implement, and operate resilient systems.

              2. Establish and enforce best practices for monitoring, incident response, automation, and capacity planning.

              3. Partner with product, engineering, and infrastructure teams to embed reliability into the software development lifecycle.

Resulting in:

              1. Highly available and performant digital platforms that enhance guest experience.

              2. Reduced downtime and faster incident resolution across services.

              3. A culture of reliability, automation, and continuous improvement within the Digital services.

Roles/Responsibilities

·        Lead, coach, and grow a team of SREs, fostering a culture of ownership, collaboration, and innovation.

·        Drive automation of operational tasks, deployments, and monitoring to reduce manual effort and human error.

·        Oversee incident management processes, ensuring timely communication, root cause analysis, and postmortems.

·        Collaborate with software engineering, product, and infrastructure teams to design scalable, secure, and reliable systems.

·        Report on system health, reliability metrics, and operational risks to senior leadership.

Job requirements and qualifications:

Location:

·        Madrid, Spain.

Language skills:

·        Fluency in English is a must.

Must have experience

·        7+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure roles.

·        2+ years in leadership/managerial role, leading distributed teams.

·        Proven track record of managing mission-critical, customer-facing digital platforms.

·        Experience with hybrid cloud environments (Azure, AWS, GCP).

·        Strong knowledge of observability tools (Dynatrace, Prometheus, Grafana, Splunk, etc.).

·        Expertise in automation and Infrastructure-as-Code (Terraform, Ansible, Pulumi).

·        Familiarity with CI/CD pipelines, Kubernetes, and microservices architectures.

Desirable experience

·        Hospitality, travel, or e-commerce industry background

·        Solid understanding of networking, security, and distributed systems.

·        Expertise in scripting languages (Python, Go, Bash

Travel needs

·        Approximately 10% to Madrid and/or Brussels HQ

Soft skills:

•       Strong leadership and people management skills

•       Excellent communication and stakeholder management

•       Strategic thinker with hands-on problem-solving ability

•       Ability to thrive in a fast-paced, global, customer-centric environment

Education:

·        University Degree in Computer Science, Engineering, or related field

·       Cloud, agile and/or DevOps certifications preferable.



{{ backgroundCheckDisclosureText }}