Site Reliability Engineer

5 days ago


Copenhagen, Copenhagen, Denmark SPEEDECK Full time
Site Reliability Engineer (SRE) (Contract) Description Job Title : Production Environment Manager / Site Reliability Engineer (SRE) Type: Hybrid Term: Daily Rate Contract Long Term - Starts with 6 months Multiple number of positions Role Overview

The Production Environment Manager will be responsible for overseeing and improving the stability, performance, and reliability of production systems. This role focuses on ensuring efficient operations, resolving incidents, automating processes, and enhancing monitoring strategies to optimize platform performance. The candidate will be a key player in managing CI/CD pipelines, automating infrastructure processes, and mentoring junior resources.

This position requires a blend of hands-on technical expertise, problem-solving skills, and the ability to collaborate with global teams to drive platform resilience and reliability.

Key Responsibilities Production Management
  • Plan, manage, and oversee all aspects of the Production Environment to ensure system stability, availability, and performance.
  • Define and implement strategies for Application Performance Monitoring (APM), optimization, and proactive performance improvements.
  • Respond to production incidents, conduct root cause analysis, and implement fixes to reduce incident recurrence.
  • Measure and document incident reduction trends over time while enhancing system reliability.
Monitoring & Optimization
  • Design, develop, and standardize monitoring and alerting mechanisms to provide end-to-end visibility for production applications.
  • Take a holistic approach to problem-solving during production incidents, diagnosing issues across the entire technology stack to minimize recovery time.
  • Continuously analyze platform performance, identify operational gaps, and recommend improvements.
DevOps & CI/CD Support
  • Support the deployment of code across multiple environments (dev, staging, production).
  • Maintain and optimize CI/CD pipelines using tools like Jenkins and scripting languages (Groovy, YAML, Shell).
  • Ensure seamless software promotion into higher environments with operational gating and validations.
  • Lead automation initiatives across infrastructure, deployment, and monitoring to improve speed and efficiency.
System Reliability & Scaling
  • Improve system scalability through automation and sustainable system evolution.
  • Proactively measure and monitor availability, latency, and system health, ensuring high standards of performance.
  • Engage in end-to-end lifecycle management of services—from inception and design to deployment, operation, and optimization.
  • Participate in system design consulting, capacity planning , and launch readiness reviews.
  • Collaborate with globally distributed teams across multiple time zones and tech hubs.
  • Share knowledge with team members, mentor junior engineers, and foster a culture of learning and collaboration.
  • Conduct training sessions and workshops as needed to improve team understanding of processes, tools, and systems.
On-Call & Off-Hours Support
  • Perform on-call duties on a rotational basis, ensuring swift incident response and resolution.
  • Willingness to work off-hours for urgent incidents, deployments, or planned maintenance activities.
Requirements Must-Have Skills
  • Production Support Experience :
    • Proven experience in supporting cloud-based applications (AWS, Azure, GCP, etc.) in a production environment.
  • Automation & Configuration Management :
    • Expertise with Ansible or Chef for automating infrastructure and application processes.
  • Proficiency in managing CI/CD pipelines using tools like Jenkins .
  • Experience writing and troubleshooting Groovy scripting and YAML configurations.
  • Strong knowledge of Linux operating systems, including system troubleshooting, performance tuning, and shell scripting.
  • Scripting & Automation :
    • Proficiency in Shell scripting for automating workflows and resolving incidents.
  • Monitoring & Troubleshooting :
    • Hands-on experience designing monitoring solutions and resolving complex system issues across distributed systems.
  • Experience in responding to incidents, performing root cause analysis, and driving incident resolution processes.
Good-to-Have Skills
  • Experience working with observability platforms (e.g., Prometheus, Grafana, Splunk, Datadog).
  • Knowledge of Infrastructure-as-Code (IaC) tools like Terraform.
  • Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
  • Exposure to Agile methodologies and DevOps best practices.
  • Experience with capacity planning and system design consulting.
Soft Skills
  • Strong problem-solving and analytical skills, with the ability to diagnose issues across the technology stack.
  • Excellent verbal and written communication skills to collaborate with global teams.
  • Ability to mentor and train junior resources effectively.
  • Team player with a proactive mindset and passion for automating repetitive tasks.
  • Flexibility to work in a dynamic, fast-paced environment with occasional off-hours support.
Qualifications
  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 5+ years of experience in Production Support, DevOps, or Site Reliability Engineering roles.
  • Relevant certifications (e.g., AWS/GCP/Azure, Ansible, Jenkins) are a plus.
Key Performance Indicators (KPIs)
  • Reduction in incident count and Mean Time to Recovery (MTTR).
  • Improved system uptime, performance, and availability.
  • Efficiency of CI/CD pipelines and automation processes.
  • Adoption and effectiveness of monitoring and alerting systems.
  • Contribution to knowledge sharing and team mentoring.
#J-18808-Ljbffr

  • Copenhagen, Copenhagen, Denmark Usemover Full time

    Are you the go-to person for diagnosing complex systems with a calm, methodical approach? Do you thrive on solving high-impact challenges, ensuring systems are robust, reliable, observable and always available? Then you might be our next Site Reliability Engineer About Mover Mover is a high-tech Danish company revolutionizing the logistics industry. Founded...


  • Copenhagen, Copenhagen, Denmark e-conomic Danmark Full time

    Apply here Are you a driven Site Reliability Engineer with a passion for Infrastructure as Code and Automation? Would you like to contribute to the development of a modern infrastructure on Google Cloud? If so, we might have the right opportunity for youThe Scope of your role As our new Site Reliability Engineer, you'll join the SRE team and work closely...


  • Copenhagen, Copenhagen, Denmark Dalux Full time

    Are you interested and experienced in taking observability capabilities to the next level? Then become a part of our exciting journey of building a new area from the ground up in our scale-up environment. Your new role We are looking for an experienced Site Reliability Engineer to take our observability capabilities to the next level. In this role, you...


  • Copenhagen, Copenhagen, Denmark The HubDanske Bank Full time

    We're on the hunt for a seasoned Senior Site Reliability Engineer to join our team at Monta. As a key member of our Engineering Team, you will play a pivotal role in building and managing our infrastructure.Company OverviewMonta's powerful SaaS platform connects the dots in the entire EV charging ecosystem. With an engineering-driven culture and a team...


  • Copenhagen, Copenhagen, Denmark Dalux Full time

    Job DescriptionWe are looking for an experienced Site Reliability Engineer to take our observability capabilities to the next level. In this role, you will be responsible for enhancing the observability of our SaaS applications in AWS.About the RoleImplement monitoring and alarms in our AWS production setup, as well as fine-tune the current setup.Collaborate...


  • Copenhagen, Copenhagen, Denmark Dixa Full time

    Our MissionWe're on a mission to eliminate bad customer service and create a world where all people are welcomed by their favorite brands with the warm familiarity of a friend. Our Conversational Customer Service Platform combines powerful AI with a human touch to deliver personalized-service experiences that scale.About the Role:We're seeking a skilled Site...


  • Copenhagen, Copenhagen, Denmark The HubDanske Bank Full time

    Do you want to help the world EV better? We are on the lookout for a talented Senior Site Reliability Engineer. Is this you? As a Senior SRE at Monta, you will help build and manage the Infrastructure at Monta, mainly built on Kubernetes on AWS. You will also be helping and assisting people around the organization with Infrastructure and performance-related...


  • Copenhagen, Copenhagen, Denmark Dixa Full time

    Permanent employee, Full-time · Copenhagen Your mission Do you want to work with a team that's building a SaaS product and platform at the center of a major new movement towards conversational customer engagement and customer friendships? And do you want to be one of the first to come in and instil the right practices, processes and tools that will ensure...


  • Copenhagen, Copenhagen, Denmark SPEEDECK Full time

    About the TeamSPEEDECK's DevOps and Reliability team is responsible for ensuring the smooth operation of our cloud-based systems. We are seeking an experienced DevOps and Reliability Expert to join our team.Responsibilities:Collaborate with cross-functional teams to identify and prioritize infrastructure requirements.Develop and maintain automated deployment...


  • Copenhagen, Copenhagen, Denmark e-conomic Danmark Full time

    About the JobWe are seeking a highly skilled Reliability Engineering Lead to join our team at e-conomic Danmark. As a key member of our Infrastructure Team, you will lead the design and implementation of reliable and scalable infrastructure solutions, ensuring high availability and minimal downtime for our cloud-based accounting product.The successful...


  • Copenhagen, Copenhagen, Denmark Usemover Full time

    About UsMover is a high-tech Danish company revolutionizing the logistics industry. Our mission is to change logistics for good: creating smart solutions that reduce waste and make people's lives better.We are a diverse, passionate team of over 85 talents who enjoy challenging the status quo and delivering real-world impact.The JobAs a Site Reliability...


  • Copenhagen, Copenhagen, Denmark Weibel Scientific AS Full time

    Join us at Weibel Scientific A/S, where we develop innovative solutions for our customers. As a Reliability and Maintainability Engineer, you will be part of our Aftermarket team, contributing to the establishment of Integrated Logistics Support (ILS) deliveries.Main TasksConduct reliability and maintainability analyses for complex projects.Develop and...


  • Copenhagen, Copenhagen, Denmark Dalux Full time

    Job SummaryWe are seeking an experienced Site Reliability Engineer to join our team at Dalux. In this role, you will be responsible for taking our observability capabilities to the next level by enhancing the observability of our SaaS applications in AWS.About the TeamOur team is passionate about delivering high-quality software and ensuring that our systems...


  • Copenhagen, Copenhagen, Denmark Weibel Scientific AS Full time

    Do you want to join Weibel and make a difference for colleagues and customers as our new ILS Engineer? Weibel develops and manufactures the world´s most advanced Doppler radar systems for costumers across most of the world. We can offer an exciting opportunity where you get to use your technical skills in an innovative environment where our goal is to push...


  • Copenhagen, Copenhagen, Denmark Usemover Full time

    About MoverWe are a high-tech Danish company with a mission to change logistics for good. Our team of over 85 talents is passionate about challenging the status quo and delivering real-world impact.The RoleWe're looking for a Site Reliability Engineer to join our Platform Engineering team. As a Site Reliability Engineer, you'll be responsible for designing...


  • Copenhagen, Copenhagen, Denmark Radiometer Danmark Danaher Full time

    About the RoleWe are seeking a highly skilled Test Engineer to join our Projects, NPI and Manufacturing Technology department. As a key member of the team, you will play a critical role in NPI projects and ongoing production of blood gas and immunoassay analyzers. Your primary focus will be on ensuring the functionality and reliability of testing procedures,...


  • Copenhagen, Copenhagen, Denmark Inpay Full time

    The Role: We are seeking a highly skilled DevSecOps Senior Engineer to join our dynamic team. The ideal candidate will have strong experience in: Administration and configuration of Azure cloud services via IaaC Administrating Linux systems Database administration & optimisation (both SQL and noSQL) Monitoring, event and incident management Setting up...


  • Copenhagen, Copenhagen, Denmark Inpay Full time

    Job DescriptionInpay is a cross-border payments company connecting businesses and communities to a global banking network.We are seeking a highly skilled DevSecOps Senior Engineer to join our dynamic team. This role combines the principles of DevOps, security (DevSecOps), and Site Reliability Engineering (SRE) to ensure the reliability, security, and...


  • Copenhagen, Copenhagen, Denmark IPS-Integrated Project Services Full time

    Are you a skilled professional looking for a challenging role in Construction Management? Do you have a passion for delivering high-quality projects and exceeding client expectations?About IPS:IPS-Integrated Project Services is a global leader in providing consultancy services, architecture, engineering, project controls, construction management, and...

  • Site HSE Manager

    4 days ago


    Copenhagen, Copenhagen, Denmark Orion Group Full time

    Direct message the job poster from Orion Group Orion has a unique opportunity for a health & safety professional to join a global player in the power generation, transmission & distribution sector. Position: Site HSE Manager Summary In this role, you will support a project site in the Nordic cluster, supported by the local HSE manager, and focus on the HSE...