Enhancing Community Vision Screening -- AI Driven Retinal Photography for Early Disease Detection and Patient Trust
Xiaofeng Lei, Yih-Chung Tham, Jocelyn Hui Lin Goh, Yangqin Feng, Yang Bai, Zhi Da Soh, Rick Siow Mong Goh, Xinxing Xu, Yong Liu, Ching-Yu Cheng·October 27, 2024
Summary
ECVS, an AI-driven retinal photography solution, enhances community vision screening by detecting pathology-based visual impairment. It offers a one-stop workflow for large-scale settings, reducing traditional screening tests from five to two, chair time from 40 minutes to 5, and referral time from 2-4 weeks to 10-20 minutes. The system uses deep learning models like ResNet, EffcientNet, ViT, Swin Transformer, and RetFound for classification and U-Net for segmentation. It includes a photo quality assessment and a pathology visual impairment classification model. The ECVS framework comprises various models for detecting vision impairment, diagnosing eye diseases, and segmenting lesion regions. The RETQA model, using Swin V2, EfficientNet b7, and ResNet50, achieved high AUC scores on both internal and external datasets. The PVI models achieved high AUC scores on both internal and external datasets, enabling context-specific outputs with high sensitivity or specificity. The multi-label eye disease classification model, akin to a junior ophthalmologist's diagnosis, provided acceptable results during screening. The segmentation model's ROI heatmap, more aligned with human perception, showed promise with a DICE score of 0.482. The ECVS, comprising four deep learning models, was cost-efficient, with manageable computational overhead.
Introduction
Background
Overview of community vision screening challenges
Importance of early detection and intervention in retinal diseases
Objective
Enhancing efficiency and accuracy in community vision screening
Reducing time and costs associated with traditional screening methods
Method
Data Collection
Types of data used for training and testing models
Data sources for retinal images
Data Preprocessing
Techniques for image quality assessment
Methods for preparing data for model training
Classification Models
RETQA Model
Architecture using Swin V2, EfficientNet b7, and ResNet50
Performance metrics on internal and external datasets
PVI Models
High AUC scores achieved on internal and external datasets
Context-specific outputs with high sensitivity or specificity
Segmentation Model
Multi-label Eye Disease Classification
Model's role in diagnosing eye diseases
Comparison with junior ophthalmologists' diagnoses
Segmentation Model
ROI heatmap alignment with human perception
DICE score of 0.482 indicating model performance
Framework and Efficiency
Workflow Reduction
From five traditional screening tests to two
Chair time reduced from 40 minutes to 5 minutes
Referral time shortened from 2-4 weeks to 10-20 minutes
Computational Efficiency
Cost-effective implementation with manageable computational overhead
Conclusion
Summary of ECVS benefits
Future directions and potential improvements
Basic info
papers
image and video processing
computer vision and pattern recognition
artificial intelligence
Advanced features