Structured evaluation of data use and model performance.
Structured evaluation of data use and model performance.
AI in radiology is transforming diagnostic workflows, but without independent validation, even high-performing models can produce biased or unsafe results. Poor dataset diversity or non-transparent testing can delay regulatory approval and erode clinician trust.
As AI tools become increasingly prevalent, it is essential to demonstrate that these tools have been evaluated using relevant data and tested with appropriate methods. Our Independent Algorithm Assessment and Dataset Testing Service provides objective, evidence-based analysis of AI model performance, using either vendor-supplied or independently sourced data.
Find out how we can support your radiology tools Contact Us
This service supports organisations that require external assurance on radiology AI tools, including a qualitative assessment of the dataset used for evaluation as well as transparent reporting of the algorithm performance metrics. All data used in the assessment process is stored and managed securely in accordance with relevant standards. Where appropriate, relevant information, including underlying data, can be securely shared with regulatory bodies to support conformity assessments and oversight.
Proven expertise: TÜV SÜD is a trusted leader in testing, inspection, and certification, with decades of experience assuring safety and performance in regulated sectors. Our experts bring deep technical and regulatory knowledge to the evaluation of AI systems, ensuring assessments are robust, transparent, and aligned with emerging international standards.
Independent and impartial: As a third-party conformity assessment body, TÜV SÜD provides unbiased evaluation of AI tools. Our assessments are designed to build confidence among regulators, clinicians, and end users - helping vendors demonstrate the reliability and generalisability of their AI solutions.
Radiology and beyond: Our service is designed for AI in radiology developers and can be extended across other healthcare domains where performance, fairness, and data quality are critical. Evaluations can use vendor-supplied data or independently curated NHS datasets, ensuring realistic and reproducible performance evidence.
Customised assessment approach: We tailor each evaluation to the maturity and intended use of the AI tool, applying consistent, standards-aligned methodologies for dataset auditing, performance benchmarking, and reproducibility testing.
Dedicated coordination and support: You will benefit from a single point of contact throughout your project, ensuring clarity of scope, efficient delivery, and secure data handling in line with ISO/IEC 27001 and relevant data governance frameworks.
We offer independent evaluation of radiology AI tools through:
Our service provides version-controlled AI model testing, enabling comparative assessments of model updates or changes in data acquisition processes. This allows AI developers to evaluate performance stability over time and determine when recertification may be required.
Designed with input from regulatory consultation, the platform ensures traceability, reproducibility, and alignment with assurance best practices. It can support regulatory sandboxes, NHS AI testbeds, or third-party independent evaluation frameworks, offering a robust infrastructure for transparent and credible assessment of healthcare AI tools.
For breast imaging AI tools, we offer a unique capability to perform independent testing using high-quality, curated datasets from NHS sources. This includes:
This enables AI developers to demonstrate algorithm performance on real-world and clinically relevant data that is completely independent from training data, enhancing transparency and confidence in their results. There is work underway to expand data collection activities to other data types - please enquire for further details.
Credible evidence of AI performance
Provides independent, verifiable evidence that AI systems perform as intended, enabling vendors to demonstrate reliability and effectiveness to any audience.
Sales support and customer confidence
Equips vendors with validated results to demonstrate performance to prospective clients, accelerating customer decision-making by reducing uncertainty around AI capabilities.
Regulatory and compliance readiness
Generates essential evidence supporting regulatory submissions or compliance requirements while mitigating risk and simplifying interactions with regulators and auditors.
Competitive advantage
Leverages validated AI performance metrics to establish clear differentiation from competitors, reinforcing messaging focused on quality, transparency, and reliability.
Flexible use of evidence application
Delivers a versatile asset supporting multiple business objectives, allowing vendors to enhance their tools, demonstrate value to customers, or satisfy regulatory requirements.
Yes. TÜV SÜD offers independent testing for breast radiology AI tools using curated NHS imaging datasets with expert clinical annotations. This allows developers to demonstrate algorithm performance on real-world, clinically validated data.
Yes. The structured evaluation report can be used as supporting evidence for regulatory conformity assessments under MHRA, EU MDR, or FDA frameworks. It includes dataset details, test methods, and quantitative performance results.
All data is managed according to ISO/IEC 27001 and relevant data-protection laws. TÜV SÜD uses secure, controlled-access environments for storage and analysis, ensuring that client data and algorithm information remain strictly confidential.
AI models should be reassessed whenever algorithms, datasets, or intended uses change. TÜV SÜD’s version-controlled testing framework enables developers to compare model iterations and determine when recertification is required.
TÜV SÜD combines deep medical device certification expertise with independent AI evaluation capabilities. Our regulator-aligned approach, secure infrastructure, and access to curated NHS datasets deliver transparent and credible evidence of performance.
Following completion of the assessment, we provide a structured report that includes:
This report may be used to support internal assurance, documentation for buyers, or as part of evidence generation for external conformity assessments.
Independent verification provides the credible evidence vendors need to demonstrate that their AI radiology tools perform reliably across diverse clinical scenarios. Our expert assessments test AI models against a representative dataset and compare outputs to established ground truth, delivering validated performance results that build customer confidence, support regulatory compliance, and help your product stand out with clear evidence of accuracy and reliability.
Contact us to discuss your requirements or to request an initial scope assessment.
Opportunities and challenges in applying the EU AI Act 2024/1689 to AI-driven medical technologies
Learn More
Site Selector
Global
Americas
Asia
Europe
Middle East and Africa