Senior Hybrid Evaluation / Test Environment Lead (CXR VLM/LLM)
FalconSmartIT · Remote · Tilburg
Job description
Job Title: Senior Hybrid Evaluation / Test Environment Lead (CXR VLM/LLM)
Job Location: Eindhoven, Netherlands (High Tech Campus 52, 5656 AG Eindhoven)
Job Type: Contract 6-12 months
Is it Onsite/Remote/Hybrid: Hybrid 3 days a week at Philips office (address mentioned)
JDs:
These roles are intentionally scoped as senior, accountable positions to ensure the ChestIQ programme can move fast without compromising safety, evidence quality, or regulatory credibility.
We expect:
- At least 10 years of experience in relevant or adjacent domains
- Excellent communication skills
- Working autonomously/independently
- Thought leadership
Job Description:
This role focuses on independent evaluation and controlled experimentation, separate from core model development, to support evidence generation and safe iteration.
Purpose of the role:
Design, build, and operate a hybrid evaluation and test environment for CXR VLM/LLM models, enabling systematic testing of model functionality, edge cases, and performance across findings without interfering with the main development pipeline.
Key capabilities
- Experience setting up model evaluation sandboxes or test harnesses for AI/ML systems, ideally in medical imaging.
- Ability to test and compare multiple CXR VLM/LLM variants across findings (e.g. pneumothorax, cardiomegaly, fractures) using consistent protocols.
- AWS experience and familiarity with cloud based dev environment.
- Familiarity with report-level evaluation, discrepancy analysis, and structured comparison between AI-generated outputs and clinician-validated references.
- Comfortable working at the intersection of engineering, clinical logic, and governance, feeding findings into monitoring, change management, and validation processes.
- Senior judgement to distinguish exploratory testing from evidence that is eligible for regulated use.
Regards,
Rachana
ML/AI Work links you to the employer's original posting — always verify the details there before applying.