AI Operations Engineer (R-19381)
Dun & Bradstreet · Drogheda, IE
Job description
Shape the Future with Dun & Bradstreet
At Dun & Bradstreet, we believe data has the power to create a better tomorrow. As a global leader in business decisioning data and analytics, we help companies worldwide grow, manage risk, and innovate. For over 180 years, businesses have trusted us to turn uncertainty into opportunity. We’re a diverse, global team that values creativity, collaboration, and bold ideas. Are you ready to make an impact and help shape what’s next? Join us! Explore opportunities at dnb.com/careers.
The AI Operations Engineer is responsible for supporting the reliability and operational intelligence of cloud-hosted and AI-enabled services. This role focuses on observability, CI/CD-integrated reliability, alerting, and automated remediation to reduce noise, detect issues early, and improve incident response.
What’s on Offer at D&B Ireland
- 25 days annual leave (plus 2 paid volunteer days & 1 paid un-sick day)
- Holiday buy & sell (the option to buy or sell up to 5 additional days per year)
- Flexible working - hybrid model
- Employee Health Insurance
- Mental Health Support program
- Pension Contribution
- Family Friendly Leave (Maternity, Paternity, Parental, Marriage and Bereavement)
- Life Assurance
- Educational Assistance Program
- Life-Style Account (D&B will match your contributions up to €40 per month and can be used to claim for a range of health-related, leisure or lifestyle activities)
At Dun & Bradstreet, we are 6,000 friendly colleagues around the world waiting to meet you and give you the opportunity to grow your career.
Responsibilities:
- Build and maintain observability standards (logs, metrics, traces, events) and dashboards using Splunk Observability.
- Configure and tune alerts and SLOs to reduce noise and improve signal quality.
- Embed observability and reliability checks into CI/CD pipelines.
- Analyze telemetry to detect anomalies and support faster incident triage and root-cause analysis.
- Implement automated runbooks and remediation workflows using scripts and tooling.
- Operate and optimize telemetry pipelines with a focus on data quality, scale, and cost efficiency.
- Support monitoring of AI/LLM services for latency, errors, and cost anomalies where applicable.
- Own and evolve observability platforms, standards, dashboards, alerting strategies, and SLOs.
Essential skills and/or Certifications:
- Bachelor’s degree in Computer Science, Artificial Intelligence or related field
- Hands-on experience with observability and monitoring tools (e.g., Splunk).
- Experience working in cloud-native environments (GCP preferred).
- Experience with CI/CD pipelines and automation (Python, Bash, or similar).
- Solid understanding of production incidents and operational workflows.
- Deep experience with automation, event correlation, and auto-remediation
- Proficiency in Microsoft Office Suites Skills
- Show an ownership mindset in everything you do; be a problem solver, be curious and be inspired to take action, be proactive, seek ways to collaborate and connect with people and teams in support of driving success.
- Continuous growth mindset, keep learning through social experiences and relationships with stakeholders, experts, colleagues and mentors as well as widen and broaden your competencies through structural courses and programs.
- Where applicable, fluency in English and languages relevant to the working market.
All employees and contractors working in D&B should be aware that they have responsibilities in relation to the Company’s Business Management System. This relates to information and its security, quality, environment and health and safety both during and post-employment with D&B.
Dun & Bradstreet is an Equal Opportunity Employer
All Dun & Bradstreet job postings can be found at https://jobs.lever.co/dnb. Official communication from Dun & Bradstreet will come from an email address ending in @dnb.com.
Notice to Applicants: Please be advised that this job posting page is hosted and powered by Lever, a subsidiary of Employ Inc. Your use of this page is subject to Employ's Privacy Notice and Cookie Policy, which governs the processing of visitor data on this platform.
#LI-DNI
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please visit https://bit.ly/3LMn4CQ.
ML/AI Work links you to the employer's original posting — always verify the details there before applying.
More LLM Platform and Reliability roles
View all →Cloud & AI Security Specialist (Prompt & LLM Security)
Allianz · Remote · Madrid
Senior AI Security Engineer
Obsidian Security · Leeds, GB
On-Prem LLM Platform Engineer (OpenShift AI / GPU)
Infosys · Charlotte, US
Data Engineer (f/m/d) - Smart Buildings & IoT Analytics
Siemens · Remote · Vienna
GTM AI Engineer
freshworks · Oakland, US
Senior Golang Engineer - AI Product & Platforms
Citi · London, GB