Uniting data providers
and AI-innovators for
healthcare breakthroughs

Why our network?
Building a network based
on trust and relevance
Why our platform?
It is a match!
Use case categories
AI for Cancer Detection
Data / Service Request
Data Modality
MRI for cancer detection and corresponding EHR / EMR data with DICOM and EHR textual/structured data
Medical Area
Oncology
Data Diversity requirements
1,000 – 10,000 sample size
At least 1 cycle of <confidential> therapy
5 countries located within the European Union excluding the DACH region
Solution
The request coming from a leading organisation within the DACH region of Europe was aimed at addressing the company’s objective to train an advanced AI algorithm capable of generating synthetic versions of the liver MR hepatobiliary phase.
Understanding the precise data requirements and the intended impact, Hypherdata supported the company in a few ways:
– Leveraged our existing network of hospitals pan-EU to provide a diverse and extensive dataset of liver MRI scans with comprehensive coverage of the hepatobiliary phase. The dataset encompassed a broad range of patient demographics, liver conditions, and imaging protocols, ensuring robust training of the AI algorithm.
– Ensured available data is annotated, de-identified and cleaned towards a compliant data exchange
– Shortened the search by identifying diverse hospitals from our network, addressing more complex inclusion and exclusion criteria questions and ensuring the first introduction is a useful one to take significant strides ahead.
The hepatobiliary phase is particularly valuable for assessing liver lesions, such as hepatocellular carcinoma (the most common form of liver cancer) and metastases. A successful outcome could revolutionize liver imaging by significantly shortening the examination process while leveraging artificial intelligence to generate the later phases of the MRI and improving overall efficiency.
Building AI solution
Data / Service Request
Data Modality
Echocardiogram (ECG) for 45 diagnoses
Medical Area
Cardiology
Data Diversity requirements
10,000 – 100,000 sample size
Minimum 3 geographic locations spanning 3 continents
Solution
At Hypherdata, we have addressed a leading company’s request for data to build their AI-driven Cardiology Solution. Our network of global data providers, both hospitals and clinicians, have a diverse and extensive dataset of electrocardiograms (ECGs).
With global healthcare data coverage, we can meet the needs of data diversity geographically, and with several other inclusion and exclusion criteria. This diverse data pool with a robust sample size ensured a well-rounded representation of cardiac conditions, considering variations in demographics, lifestyles, and healthcare practices worldwide.
Through our network of service providers for annotation and data cleaning, we ensure that the dataset provided is always of exceptional quality. Accurate and well-annotated data minimizes bias, enhances model interpretability, and boosts the overall reliability of their AI solution.
Once a match has been made, companies are invited to our deal room where companies efficiently navigate the contractual aspects of accessing the data, annotations, and cleaning services. Our pre-established frameworks, templates, and structures facilitate smooth negotiations, allowing them to focus on their core AI development efforts.
Cardiology ECG
Data / Service Request
Data Modality
Echocardiogram (ECG) for 45 diagnoses
Medical Area
Cardiology
Data Diversity requirements
10,000 – 100,000 sample size
Minimum 3 geographic locations spanning 3 continents
Solution
At Hypherdata, we have addressed a leading company’s request for data to build their AI-driven Cardiology Solution. Our network of global data providers, both hospitals and clinicians, have a diverse and extensive dataset of electrocardiograms (ECGs).
With global healthcare data coverage, we can meet the needs of data diversity geographically, and with several other inclusion and exclusion criteria. This diverse data pool with a robust sample size ensured a well-rounded representation of cardiac conditions, considering variations in demographics, lifestyles, and healthcare practices worldwide.
Through our network of service providers for annotation and data cleaning, we ensure that the dataset provided is always of exceptional quality. Accurate and well-annotated data minimizes bias, enhances model interpretability, and boosts the overall reliability of their AI solution.
Once a match has been made, companies are invited to our deal room where companies efficiently navigate the contractual aspects of accessing the data, annotations, and cleaning services. Our pre-established frameworks, templates, and structures facilitate smooth negotiations, allowing them to focus on their core AI development efforts.
Data Aggregation & Integration 1
Data / Service Request
Data Modality
Multimodal neuroimaging data for biomarker identification / validation with demographic information
Cleaning and Standardising available data from multiple projects in UK, US and Cuba
Anonymized EHR information for demographic, unaggregated
Medical Area
Neurology
Data Diversity requirements
100,000+
Wide age distribution of the dataset
Globally representative sample
Solution
Aimed at aiding research around biomarkers from day 115 to age 100 with the potential to support AI solutions in this area, a company had reached out to Hypherdata with a large set of requirements as phase 1 of a multi-phase project.
Understanding the precise data requirements and the intended impact, Hypherdata supported the company in a few ways
– Leveraged our existing network of hospitals and partners to provide a diverse and extensive multimodal imaging dataset including but not limited to MRIs, PETs, Clinical trial information and EHR / EMR data
– Ensured the available data is de-identified, annotated, and cleaned towards a compliant data exchange
– Shortened the search by identifying diverse healthcare providers from our network especially those with digital capabilities and intent to partner research, addressing more complex inclusion and exclusion criteria questions and ensuring the first introduction is a useful one to take significant strides ahead.
Data Aggregation & Integration 2
Data / Service Request
Data Modality
Comprehensive Datasets including participant characteristics, screening exam results, diagnostic procedures, lung cancer, and mortality
CT scans and / or Pathology images
Cleaned and annotated for quicker usage
Medical Area
Otorhinolaryngology / Lung Cancer
Data Diversity requirements
North America representing multiple healthcare providers
Solution
Hypherdata works closely with AI solutions who have delivered solutions and / or products to specific markets, but continuously work to improve the efficacy of their algorithm through a continuous flow of newer and more diverse data. Diversity at different cycles of market expansion imply different modalities which either ensure improvement of the product, or greater innovation in the route to, or type of, solution.
A typical agreement with a hospital can take over 2 years to materialize, with an additional 10-12 months typically required by a developer to clean and annotate the data to align with existing standards.
With Hypherdata’s network, companies who are delivering AI solutions and improving upon such solutions simultaneously benefit from quick introductions to healthcare providers and data aggregators, allowing for accelerated search and outcome. Hypherdata’s network comprises companies offering Data Cleaning and Data Annotation for AI services, and a quick round of introductions later, the process became defined, purpose-oriented and quick, saving time and resources. The team and platform’s competence includes contract management and a secure deal room enabling a more streamlined approach.
Data Annotation
Data / Service Request
Data Modality
Comprehensive Datasets including participant characteristics, screening exam results, diagnostic procedures, lung cancer, and mortality
CT scans and / or Pathology images
Cleaned and annotated for quicker usage
Medical Area
Otorhinolaryngology / Lung Cancer
Data Diversity requirements
North America representing multiple healthcare providers
Solution
Hypherdata works closely with AI solutions who have delivered solutions and / or products to specific markets, but continuously work to improve the efficacy of their algorithm through a continuous flow of newer and more diverse data. Diversity at different cycles of market expansion imply different modalities which either ensure improvement of the product, or greater innovation in the route to, or type of, solution.
A typical agreement with a hospital can take over 2 years to materialize, with an additional 10-12 months typically required by a developer to clean and annotate the data to align with existing standards.
With Hypherdata’s network, companies who are delivering AI solutions and improving upon such solutions simultaneously benefit from quick introductions to healthcare providers and data aggregators, allowing for accelerated search and outcome. Hypherdata’s network comprises companies offering Data Cleaning and Data Annotation for AI services, and a quick round of introductions later, the process became defined, purpose-oriented and quick, saving time and resources. The team and platform’s competence includes contract management and a secure deal room enabling a more streamlined approach.
Data Cleaning
Data / Service Request
Data Modality
Comprehensive Datasets including participant characteristics, screening exam results, diagnostic procedures, lung cancer, and mortality
CT scans and / or Pathology images
Cleaned and annotated for quicker usage
Medical Area
Otorhinolaryngology / Lung Cancer
Data Diversity requirements
North America representing multiple healthcare providers
Solution
Hypherdata works closely with AI solutions who have delivered solutions and / or products to specific markets, but continuously work to improve the efficacy of their algorithm through a continuous flow of newer and more diverse data. Diversity at different cycles of market expansion imply different modalities which either ensure improvement of the product, or greater innovation in the route to, or type of, solution.
A typical agreement with a hospital can take over 2 years to materialize, with an additional 10-12 months typically required by a developer to clean and annotate the data to align with existing standards.
With Hypherdata’s network, companies who are delivering AI solutions and improving upon such solutions simultaneously benefit from quick introductions to healthcare providers and data aggregators, allowing for accelerated search and outcome. Hypherdata’s network comprises companies offering Data Cleaning and Data Annotation for AI services, and a quick round of introductions later, the process became defined, purpose-oriented and quick, saving time and resources. The team and platform’s competence includes contract management and a secure deal room enabling a more streamlined approach.
Data Security & Privacy
Data / Service Request
A mid-sized hospital is entering into a collaboration with a technology company specialized in providing medical recommendation systems in cancer research. Using AI, a new system is being developed to help the oncology department better/faster predict and diagnose breast cancer cases. The existing collection of medical data, collected and maintained by the hospital IT department, will be used to train this new AI system.
The Hospital’s management and IT department are looking for a sound strategy and roadmap to identify potential risks surrounding the data’s usage by the AI company, map out the technical project from contracts to continuous transfer of data for AI refinement and input of AI services for predictive outcomes. Both organisations also wish to address compliance with national and international data privacy and security regulations.
Solution
At Hypherdata, we have access to in-depth expertise on the constraints and conditions of using medical data when applying AI and Machine Learning within healthcare systems. Fuelled by the broad range of solution partners we have, our clients get access to new alternative solutions that can minimize the risks and costs when solving new challenges.
For instance, in the case of data privacy, our customers not only find the correct answer but also get access and insight into other better solutions such as:
– Applying Synthetic data to eliminate the risk of data privacy violations
– New techniques to more efficiently apply de-identification of data and data augmentation
– Applying innovative data architectures where AI algorithms/models can move to the source of data instead of having to transfer data or duplicate data sourcesBuilding the right data privacy framework is not only about complying with industry and national regulations but a necessary step to build trust, protect reputation, and develop new revenue streams in healthcare.
Imaging Data
Data / Service Request
Data Modality
MRI for cancer detection and corresponding EHR / EMR data with DICOM and EHR textual/structured data
Medical Area
Oncology
Data Diversity requirements
1,000 – 10,000 sample size
At least 1 cycle of <confidential> therapy
5 countries located within the European Union excluding the DACH region
Solution
The request coming from a leading organisation within the DACH region of Europe was aimed at addressing the company’s objective to train an advanced AI algorithm capable of generating synthetic versions of the liver MR hepatobiliary phase.
Understanding the precise data requirements and the intended impact, Hypherdata supported the company in a few ways:
– Leveraged our existing network of hospitals pan-EU to provide a diverse and extensive dataset of liver MRI scans with comprehensive coverage of the hepatobiliary phase. The dataset encompassed a broad range of patient demographics, liver conditions, and imaging protocols, ensuring robust training of the AI algorithm.
– Ensured available data is annotated, de-identified and cleaned towards a compliant data exchange
– Shortened the search by identifying diverse hospitals from our network, addressing more complex inclusion and exclusion criteria questions and ensuring the first introduction is a useful one to take significant strides ahead.
The hepatobiliary phase is particularly valuable for assessing liver lesions, such as hepatocellular carcinoma (the most common form of liver cancer) and metastases. A successful outcome could revolutionize liver imaging by significantly shortening the examination process while leveraging artificial intelligence to generate the later phases of the MRI and improving overall efficiency.
Improve AI Solution
Data / Service Request
Data Modality
Comprehensive Datasets including participant characteristics, screening exam results, diagnostic procedures, lung cancer, and mortality
CT scans and / or Pathology images
Cleaned and annotated for quicker usage
Medical Area
Otorhinolaryngology / Lung Cancer
Data Diversity requirements
North America representing multiple healthcare providers
Solution
Hypherdata works closely with AI solutions who have delivered solutions and / or products to specific markets, but continuously work to improve the efficacy of their algorithm through a continuous flow of newer and more diverse data. Diversity at different cycles of market expansion imply different modalities which either ensure improvement of the product, or greater innovation in the route to, or type of, solution.
A typical agreement with a hospital can take over 2 years to materialize, with an additional 10-12 months typically required by a developer to clean and annotate the data to align with existing standards.
With Hypherdata’s network, companies who are delivering AI solutions and improving upon such solutions simultaneously benefit from quick introductions to healthcare providers and data aggregators, allowing for accelerated search and outcome. Hypherdata’s network comprises companies offering Data Cleaning and Data Annotation for AI services, and a quick round of introductions later, the process became defined, purpose-oriented and quick, saving time and resources. The team and platform’s competence includes contract management and a secure deal room enabling a more streamlined approach.
Lung Cancer
Data / Service Request
Data Modality
Comprehensive Datasets including participant characteristics, screening exam results, diagnostic procedures, lung cancer, and mortality
CT scans and / or Pathology images
Cleaned and annotated for quicker usage
Medical Area
Otorhinolaryngology / Lung Cancer
Data Diversity requirements
North America representing multiple healthcare providers
Solution
Hypherdata works closely with AI solutions who have delivered solutions and / or products to specific markets, but continuously work to improve the efficacy of their algorithm through a continuous flow of newer and more diverse data. Diversity at different cycles of market expansion imply different modalities which either ensure improvement of the product, or greater innovation in the route to, or type of, solution.
A typical agreement with a hospital can take over 2 years to materialize, with an additional 10-12 months typically required by a developer to clean and annotate the data to align with existing standards.
With Hypherdata’s network, companies who are delivering AI solutions and improving upon such solutions simultaneously benefit from quick introductions to healthcare providers and data aggregators, allowing for accelerated search and outcome. Hypherdata’s network comprises companies offering Data Cleaning and Data Annotation for AI services, and a quick round of introductions later, the process became defined, purpose-oriented and quick, saving time and resources. The team and platform’s competence includes contract management and a secure deal room enabling a more streamlined approach.
Medical recordings
Data / Service Request
Data Modality
Surgery recordings
Medical Area
Spine Injuries
Diversity requirements
1,000 recordings
Global sample set, excluding US and EU since recordings from these regions are already available
Solution
For a client building robotic guidance systems for spine surgery, designed to help surgeons with navigation and precision during complex spinal procedures, Hypherdata was appointed to identify global sources of medical recordings. Since the first version of the guidance system had been built using data from the U.S. and EU, Hypherdata sub-licensed the client with data originating from 4 countries keeping in mind their exclusion criteria.
Hypherdata’s global network built on willingness and digital preparedness for data and image exchange helps identify the right sources for specific requirements, and enables the simplest and most compliant transfer ecosystem.
Microbiology
Data / Service Request
Data modality
EHR and Laboratory reports
Medical area
Microbiology
Data Diversity requirements
Minimum of 4 laboratories spanning 2 continents
Solution
Working with U.S-based AI companies building algorithms for early detection of Urinary Tract Infections (UTIs), Hypherdata supported their diverse dataset requirement intended to be submitted for FDA approval.
Their AI-powered systems can analyze urine samples to potentially identify the presence of bacteria or abnormal cells indicative of UTIs.
Hypherdata connected them with reliable, global laboratories to obtain de-identified datasets that include relevant microbiology data points, such as the type of specimen, organism identification, antimicrobial susceptibility results, and date of culture. By ensuring the right laboratories are introduced, Hypherdata can shorten the back-and-forth clarification process with early alignment on data modalities and expectations, and allow companies like this one deliver the intended outcome in a shortened, more structured span of time.
The company also received de-identified metadata, such as patient age, gender, and any specific medical history that would contribute to the context of their research.
MRI Data
Data / Service Request
Data Modality
Echocardiogram (ECG) for 45 diagnoses
Medical Area
Cardiology
Data Diversity requirements
10,000 – 100,000 sample size
Minimum 3 geographic locations spanning 3 continents
Solution
At Hypherdata, we have addressed a leading company’s request for data to build their AI-driven Cardiology Solution. Our network of global data providers, both hospitals and clinicians, have a diverse and extensive dataset of electrocardiograms (ECGs).
With global healthcare data coverage, we can meet the needs of data diversity geographically, and with several other inclusion and exclusion criteria. This diverse data pool with a robust sample size ensured a well-rounded representation of cardiac conditions, considering variations in demographics, lifestyles, and healthcare practices worldwide.
Through our network of service providers for annotation and data cleaning, we ensure that the dataset provided is always of exceptional quality. Accurate and well-annotated data minimizes bias, enhances model interpretability, and boosts the overall reliability of their AI solution.
Once a match has been made, companies are invited to our deal room where companies efficiently navigate the contractual aspects of accessing the data, annotations, and cleaning services. Our pre-established frameworks, templates, and structures facilitate smooth negotiations, allowing them to focus on their core AI development efforts.
Neurology
Data / Service Request
Data Modality
Multimodal neuroimaging data for biomarker identification / validation with demographic information
Cleaning and Standardising available data from multiple projects in UK, US and Cuba
Anonymized EHR information for demographic, unaggregated
Medical Area
Neurology
Data Diversity requirements
100,000+
Wide age distribution of the dataset
Globally representative sample
Solution
Aimed at aiding research around biomarkers from day 115 to age 100 with the potential to support AI solutions in this area, a company had reached out to Hypherdata with a large set of requirements as phase 1 of a multi-phase project.
Understanding the precise data requirements and the intended impact, Hypherdata supported the company in a few ways
– Leveraged our existing network of hospitals and partners to provide a diverse and extensive multimodal imaging dataset including but not limited to MRIs, PETs, Clinical trial information and EHR / EMR data
– Ensured the available data is de-identified, annotated, and cleaned towards a compliant data exchange
– Shortened the search by identifying diverse healthcare providers from our network especially those with digital capabilities and intent to partner research, addressing more complex inclusion and exclusion criteria questions and ensuring the first introduction is a useful one to take significant strides ahead.
Oncology
Data / Service Request
Data modality
Whole slide images
Medical area
Pathology
Data Diversity requirements
24 diagnoses
200 images per diagnosis
Geographies: Europe, US, Asia
Solution
We partnered with a U.S. -based company developing AI-powered algorithms for histopathologic assessment of 24 diagnoses to provide objective quantification of disease-markers.
In these cases, data is sourced directly from manufacturers of the microscopes, giving data requestors unprecedented access to longitudinal data from various locations.
Oncology with MRI data
Data / Service Request
Data Modality
MRI for cancer detection and corresponding EHR / EMR data with DICOM and EHR textual/structured data
Medical Area
Oncology
Data Diversity requirements
1,000 – 10,000 sample size
At least 1 cycle of <confidential> therapy
5 countries located within the European Union excluding the DACH region
Solution
The request coming from a leading organisation within the DACH region of Europe was aimed at addressing the company’s objective to train an advanced AI algorithm capable of generating synthetic versions of the liver MR hepatobiliary phase.
Understanding the precise data requirements and the intended impact, Hypherdata supported the company in a few ways:
– Leveraged our existing network of hospitals pan-EU to provide a diverse and extensive dataset of liver MRI scans with comprehensive coverage of the hepatobiliary phase. The dataset encompassed a broad range of patient demographics, liver conditions, and imaging protocols, ensuring robust training of the AI algorithm.
– Ensured available data is annotated, de-identified and cleaned towards a compliant data exchange
– Shortened the search by identifying diverse hospitals from our network, addressing more complex inclusion and exclusion criteria questions and ensuring the first introduction is a useful one to take significant strides ahead.
The hepatobiliary phase is particularly valuable for assessing liver lesions, such as hepatocellular carcinoma (the most common form of liver cancer) and metastases. A successful outcome could revolutionize liver imaging by significantly shortening the examination process while leveraging artificial intelligence to generate the later phases of the MRI and improving overall efficiency.
Pathology
Data / Service Request
Data modality
Whole slide images
Medical area
Pathology
Data Diversity requirements
24 diagnoses
200 images per diagnosis
Geographies: Europe, US, Asia
Solution
We partnered with a U.S. -based company developing AI-powered algorithms for histopathologic assessment of 24 diagnoses to provide objective quantification of disease-markers.
In these cases, data is sourced directly from manufacturers of the microscopes, giving data requestors unprecedented access to longitudinal data from various locations.
Standardization / FAIR
Data / Service Request
A pharmaceutical company is setting up a new Machine Learning pipeline for one of its drug discovery projects. Like any typical ML project, the first step is to collect the correct datasets and structure, clean them, and do proper feature mapping.
The data science team has decided to experiment with this project by using two sources of data:
- In-house datasets
- public datasets
The goal is to validate the usefulness and trustworthiness of data from public sources. Furthermore, confirm the ability to combine these data sources in the pipeline during the first phase of the project, e.g., data aggregation and structuring.
They need the fastest and safest way to achieve the above, following the standards defined within the organization and simultaneously complying with all that goes with public datasets.
Solution
At Hypherdata, we recognize the unique challenges faced by pharmaceutical companies in setting up robust Machine Learning (ML) pipelines for drug discovery projects. We tailored a comprehensive solution to address their specific requirements. Our network of experts worked closely with the organization to identify methods and processes to clean, de-identify and annotate both sources of data.
Cleaning implied ensuring data quality, and removing inconsistencies, errors, and redundancies from the healthcare datasets, especially the public one. Annotation involved correctly and efficiently labeling and categorizing medical data to create a high-quality dataset for training their ML model.
Through this support, we ensured the organisation could use the expertly curated In-house datasets, with both in-house and public sets seamlessly integrated. A professional strategy based on the right knowledge will help the organization streamline research, heighten data confidence, and accelerate drug discovery programmes.