Site Reliability Engineer

3 weeks ago


København, Denmark SPEEDECK Full time
Site Reliability Engineer (SRE) (Contract) Description Job Title : Production Environment Manager / Site Reliability Engineer (SRE) Type: Hybrid Term: Daily Rate Contract Long Term - Starts with 6 months Multiple number of positions Role Overview

The Production Environment Manager will be responsible for overseeing and improving the stability, performance, and reliability of production systems. This role focuses on ensuring efficient operations, resolving incidents, automating processes, and enhancing monitoring strategies to optimize platform performance. The candidate will be a key player in managing CI/CD pipelines, automating infrastructure processes, and mentoring junior resources.

This position requires a blend of hands-on technical expertise, problem-solving skills, and the ability to collaborate with global teams to drive platform resilience and reliability.

Key Responsibilities Production Management
  • Plan, manage, and oversee all aspects of the Production Environment to ensure system stability, availability, and performance.
  • Define and implement strategies for Application Performance Monitoring (APM), optimization, and proactive performance improvements.
  • Respond to production incidents, conduct root cause analysis, and implement fixes to reduce incident recurrence.
  • Measure and document incident reduction trends over time while enhancing system reliability.
Monitoring & Optimization
  • Design, develop, and standardize monitoring and alerting mechanisms to provide end-to-end visibility for production applications.
  • Take a holistic approach to problem-solving during production incidents, diagnosing issues across the entire technology stack to minimize recovery time.
  • Continuously analyze platform performance, identify operational gaps, and recommend improvements.
DevOps & CI/CD Support
  • Support the deployment of code across multiple environments (dev, staging, production).
  • Maintain and optimize CI/CD pipelines using tools like Jenkins and scripting languages (Groovy, YAML, Shell).
  • Ensure seamless software promotion into higher environments with operational gating and validations.
  • Lead automation initiatives across infrastructure, deployment, and monitoring to improve speed and efficiency.
System Reliability & Scaling
  • Improve system scalability through automation and sustainable system evolution.
  • Proactively measure and monitor availability, latency, and system health, ensuring high standards of performance.
  • Engage in end-to-end lifecycle management of services—from inception and design to deployment, operation, and optimization.
  • Participate in system design consulting, capacity planning , and launch readiness reviews.
  • Collaborate with globally distributed teams across multiple time zones and tech hubs.
  • Share knowledge with team members, mentor junior engineers, and foster a culture of learning and collaboration.
  • Conduct training sessions and workshops as needed to improve team understanding of processes, tools, and systems.
On-Call & Off-Hours Support
  • Perform on-call duties on a rotational basis, ensuring swift incident response and resolution.
  • Willingness to work off-hours for urgent incidents, deployments, or planned maintenance activities.
Requirements Must-Have Skills
  • Production Support Experience :
    • Proven experience in supporting cloud-based applications (AWS, Azure, GCP, etc.) in a production environment.
  • Automation & Configuration Management :
    • Expertise with Ansible or Chef for automating infrastructure and application processes.
  • Proficiency in managing CI/CD pipelines using tools like Jenkins .
  • Experience writing and troubleshooting Groovy scripting and YAML configurations.
  • Strong knowledge of Linux operating systems, including system troubleshooting, performance tuning, and shell scripting.
  • Scripting & Automation :
    • Proficiency in Shell scripting for automating workflows and resolving incidents.
  • Monitoring & Troubleshooting :
    • Hands-on experience designing monitoring solutions and resolving complex system issues across distributed systems.
  • Experience in responding to incidents, performing root cause analysis, and driving incident resolution processes.
Good-to-Have Skills
  • Experience working with observability platforms (e.g., Prometheus, Grafana, Splunk, Datadog).
  • Knowledge of Infrastructure-as-Code (IaC) tools like Terraform.
  • Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
  • Exposure to Agile methodologies and DevOps best practices.
  • Experience with capacity planning and system design consulting.
Soft Skills
  • Strong problem-solving and analytical skills, with the ability to diagnose issues across the technology stack.
  • Excellent verbal and written communication skills to collaborate with global teams.
  • Ability to mentor and train junior resources effectively.
  • Team player with a proactive mindset and passion for automating repetitive tasks.
  • Flexibility to work in a dynamic, fast-paced environment with occasional off-hours support.
Qualifications
  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • 5+ years of experience in Production Support, DevOps, or Site Reliability Engineering roles.
  • Relevant certifications (e.g., AWS/GCP/Azure, Ansible, Jenkins) are a plus.
Key Performance Indicators (KPIs)
  • Reduction in incident count and Mean Time to Recovery (MTTR).
  • Improved system uptime, performance, and availability.
  • Efficiency of CI/CD pipelines and automation processes.
  • Adoption and effectiveness of monitoring and alerting systems.
  • Contribution to knowledge sharing and team mentoring.
#J-18808-Ljbffr

  • København, Denmark e-conomic Danmark Full time

    Apply here Are you a driven Site Reliability Engineer with a passion for Infrastructure as Code and Automation? Would you like to contribute to the development of a modern infrastructure on Google Cloud? If so, we might have the right opportunity for you!The Scope of your role As our new Site Reliability Engineer, you’ll join the SRE team and work...


  • København, Denmark Dalux Full time

    Are you interested and experienced in taking observability capabilities to the next level? Then become a part of our exciting journey of building a new area from the ground up in our scale-up environment. Your new role We are looking for an experienced Site Reliability Engineer to take our observability capabilities to the next level. In this role, you...


  • København, Denmark Dixa Full time

    Permanent employee, Full-time · Copenhagen Your mission Do you want to work with a team that’s building a SaaS product and platform at the center of a major new movement towards conversational customer engagement and customer friendships? And do you want to be one of the first to come in and instil the right practices, processes and tools that will ensure...


  • København, Denmark The HubDanske Bank Full time

    Do you want to help the world EV better? We are on the lookout for a talented Senior Site Reliability Engineer. Is this you? As a Senior SRE at Monta, you will help build and manage the Infrastructure at Monta, mainly built on Kubernetes on AWS. You will also be helping and assisting people around the organization with Infrastructure and performance-related...

  • Test Site Technician

    2 months ago


    København, Denmark Orbex Space Full time

    At Orbex we are looking for a Test Site Technician to join our growing team in Copenhagen. With us, you will have the opportunity to support rocket engine and turbopump testing campaigns, production for propulsion systems and most of all, you will directly contribute to the maiden launch of Orbex Prime! It would be desirable that the applicant have a...


  • København, Denmark Inpay Full time

    The Role: We are seeking a highly skilled DevSecOps Senior Engineer to join our dynamic team. The ideal candidate will have strong experience in: Administration and configuration of Azure cloud services via IaaC Administrating Linux systems Database administration & optimisation (both SQL and noSQL) Monitoring, event and incident management Setting up...


  • København, Denmark AGC Inc Full time

    Our purpose is to bring hope to life by enabling life-changing therapies for patients around the globe, creating a healthier and happier tomorrow. Our mission is to work side by side with our customers in order to improve patients’ lives by bringing new biopharmaceuticals to market. We are looking for a Manager for our Manufacturing Equipment Maintenance...


  • København, Denmark Corti Full time

    There is no quality healthcare without a quality dialogue. Today, that dialogue is broken; we need you to help us fix it. Doctors and nurses across the world are facing unprecedented challenges. When we meet them, they're dealing with heavy workloads, extensive paperwork, and the pressure of performing well, which in healthcare, can have dire consequences....


  • København, Denmark Alstom Transport Danmark AS Full time

    Location: Copenhagen, DK Company: Alstom Req ID: 464121 At Alstom, we understand transport networks and what moves people. From high-speed trains, metros, monorails, and trams, to turnkey systems, services, infrastructure, signalling and digital mobility, we offer our diverse customers the broadest portfolio in the industry. Every day, more than 80,000...


  • København, Denmark The European Spallation Source (ESS) Full time

    Thank you for your interest in ESS. We hope that you will find a position matching your qualifications and interests. If you don't find what you are looking for this time, please come back. There will be several openings, in different areas, in the future. Get involved, you are needed. Together we will advance science so future generations can thrive!...


  • København, Denmark Radiometer Danmark Danaher Full time

    In our line of work, life isn’t a given - it’s the ultimate goal. When life takes an unexpected turn, our technology and solutions enable caregivers to make informed diagnostic decisions to improve patient care. This is our shared purpose at Radiometer and what unites all +4000 of us - no matter our roles or where in the world we’re located. Creating...


  • København, Denmark IBM Full time

    Introduction Technology sales at IBM is evolving its way of working to break beyond boundaries with innovative approaches. Preferring to ‘show’ vs. ‘tell’, Client Engineering co-creates with clients, in real-time, on solutions to solve their hardest business challenges.As a Platform Engineer within Client Engineering, you’ll be a key player in a...


  • København, Denmark IBM Client Innovation Center Full time

    Introduction Technology sales at IBM is evolving its way of working to break beyond boundaries with innovative approaches. Preferring to ‘show’ vs. ‘tell’, Client Engineering co-creates with clients, in real-time, on solutions to solve their hardest business challenges. As a Platform Engineer within Client Engineering, you’ll be a key player in a...

  • Lead Platform Engineer

    2 months ago


    København, Denmark Corti Full time

    There is no quality healthcare without a quality dialogue. Today, that dialogue is broken; we need you to help us fix it. Doctors and nurses across the world are facing unprecedented challenges. When we meet them, they're dealing with heavy workloads, extensive paperwork, and the pressure of performing well, which in healthcare, can have dire consequences....


  • København, Denmark LEGO Gruppe Full time

    Job Description Join us in building an extraordinary observability platform and help us continue to encourage and develop the builders of tomorrow. We seek an innovative software engineer to join our outstanding Observability Platform (OP) team of highly skilled engineers – Brave engineers with a mission to improve developer efficiency in our digital...


  • København, Denmark Alfa Laval Mid Europe GmbH Full time

    time left to apply End Date: February 2, 2025 (30+ days left to apply) job requisition id JR0034048 About the Job We are seeking a dedicated and detail-oriented Cost Estimation Engineer to join our team. In this role, you will be responsible for developing comprehensive cost estimates for our projects, ranging from small orders to large orders, within the...

  • MLOps Engineer

    2 days ago


    København, Denmark Pandora AS Full time

    Pandora is the World’s largest jewellery brand, known for affordable luxury, innovative design and our high-quality craftsmanship. Founded in 1982 in Copenhagen, Denmark, by Per Enevoldsen and his wife Winnie, Pandora started as a small jewelry shop and has since grown into the globally recognized brand it is today. At Pandora, we are transforming the...

  • Lead Engineer

    2 months ago


    København, Denmark Pandora AS Full time

    Lead the design, development and implementation of IBM OMS, IBM Web Store, IBM Web Call center applications. Architect, design and develop IBM OMS, WebStore and Web Callcenter solutions aligning with business needs. Serve as the subject matter expert in IBM Sterling applications, offering solutions for complex business scenarios and guiding engineers....


  • København, Denmark Ramboll Group Full time

    Hannemanns Allé 53, 2300 København, Denmark Full-time Job Description Do you want to be part of a global Offshore Wind Engineering department, joining a team of high performing, talented and engaged engineers at the forefront of innovative and complex design techniques? Do you aspire to taking responsibility for your own design deliverables, leading the...

  • RAM Engineer

    2 weeks ago


    København, Denmark ALSTOM Gruppe Full time

    Company: Alstom At Alstom, we understand transport networks and what moves people. From high-speed trains, metros, monorails, and trams, to turnkey systems, services, infrastructure, signalling and digital mobility, we offer our diverse customers the broadest portfolio in the industry. Every day, more than 80,000 colleagues lead the way to greener and...