My client, a multinational organisation operating within a complex enterprise technology environment is seeking an experienced Systems Reliability Engineer to improve the resilience of critical applications and services. The role focuses on assessing application designs against established resilience patterns and standards, identifying gaps, and driving improvements across architecture, processes, and tooling.
You will collaborate with application owners, cloud and infrastructure teams, and service continuity stakeholders to enhance system resilience, disaster recovery readiness, and post-incident learning through structured analysis and governance.
Key Responsibilities
As a Systems Reliability Engineer, you will support the delivery of resilient, highly available systems by:
Evaluating applications and services against defined resilience architecture patterns and requirements
Communicating resilience standards and requirements to application owners and technical teams
Identifying, recommending, and implementing improvements to application design patterns and resilience templates
Driving simplification and automation initiatives to improve resilience processes
Administering disaster recovery applications, tools, and methodologies
Contributing to the development and maintenance of service continuity documentation, including business impact analyses, resilience and DR plans, and executive-level status and risk reports
Tracking, reviewing, and reporting resilience metrics and risk indicators
Partnering with service continuity leadership to assess resilience gaps, including priority, impact, and risk
Participating in post-incident reviews and Root Cause Analysis to ensure findings are fully documented, tracked, and resolved
Determining remediation strategies for identified resilience gaps and reporting outcomes to business and technology leaders
Developing training materials and delivering training to application owners on resilience assessment processes
Implementing, maintaining, and supporting DR tooling and resilience assessment frameworks
Additional Responsibilities
Work effectively in ambiguous or unstructured situations
Anticipate stakeholder needs and proactively propose solutions
Contribute to a collaborative environment where people and technology succeed together
Act in accordance with organisational policies, ethics, and standards
Your Profile
Basic Qualifications
Bachelor’s degree in Information Technology, Systems Engineering, Cloud Architecture, Cybersecurity, Emergency Management, or a related discipline
Strong experience delivering technical analysis and system reviews
Solid background in cloud engineering and modern infrastructure environments
Preferred Skills and Experience
Experience with ServiceNow, including Business Continuity Management functionality
Understanding of large-scale or global infrastructure environments
Knowledge of information security, resilience, and disaster recovery practices
Experience designing and supporting highly available cloud-based architectures
Awareness of emerging infrastructure and resilience technologies
Ability to align technology resilience initiatives with broader business objectives
Competitive salary structure, including bonus, pension, and health care cover. Life insurance, laptop, phone, and access to extensive training resources. Company discounts, on-site parking, and additional employee benefits.
Permanent role based in Letterkenny, Co. Donegal. Hybrid working model: three days per week on-site, with the remainder remote if desired.
Candidates must be eligible to work in Ireland or the EU.
For further details or to express interest, please contact David Coyle at 01 6351748 or via email at david@methodius.com.
#J-18808-Ljbffr