Muteki Group
Sep 22, 2023
No image
Completed
AI solution in Screening Mammography Breast Cancer Detection
$75,000+
7-12 months
Canada
2-5
Service categories
Service Lines
Artificial Intelligence
Big Data
Domain focus
Healthcare
Other
Programming language
Python
Frameworks
TensorFlow
Torch/PyTorch
Challenge
According to the World Health Organization, breast cancer is the most common cancer worldwide, with 2.3 million new diagnoses and 685,000 deaths in 2020 alone. However, breast cancer mortality in high-income countries has decreased by 40% since the 1980s due to regular mammography screening. Early detection and treatment are crucial in reducing cancer fatalities, and machine learning skills can help streamline the process of evaluating screening mammograms used by radiologists. For Machine learning the task was challenging from the start due to the low number of positive class samples.
According to the World Health Organization, breast cancer is the most common cancer worldwide, with 2.3 million new diagnoses and 685,000 deaths in 2020 alone. However, breast cancer mortality in high-income countries has decreased by 40% since the 1980s due to regular mammography screening. Early detection and treatment are crucial in reducing cancer fatalities, and machine learning skills can help streamline the process of evaluating screening mammograms used by radiologists. For Machine learning the task was challenging from the start due to the low number of positive class samples.
Solution
Despite the encountered difficulties we were able to get reasonably good results after implementing a good training pipeline that included positive class balance, scaling, model selection, and post-processing.
The final solution was based on the voting strategy, and then the average score based on votes. The four steps of the solution were straightforward, including processing the DICOM files into PNG, inferring the three posterior models from TTA, averaging the ensemble probabilities or voting, and thresholding.
Despite the encountered difficulties we were able to get reasonably good results after implementing a good training pipeline that included positive class balance, scaling, model selection, and post-processing.
The final solution was based on the voting strategy, and then the average score based on votes. The four steps of the solution were straightforward, including processing the DICOM files into PNG, inferring the three posterior models from TTA, averaging the ensemble probabilities or voting, and thresholding.
Results
We've successfully implemented a training pipeline, with
- 85% accuracy
- processing the DICOM files into PNG
- inferring the three posterior models from TTA
- averaging the ensemble probabilities or voting, and thresholding
We've successfully implemented a training pipeline, with
- 85% accuracy
- processing the DICOM files into PNG
- inferring the three posterior models from TTA
- averaging the ensemble probabilities or voting, and thresholding