Structured data collection and deep learning for retinal OCT image-to-text translation: a comprehensive framework
Abstract
This paper presents a comprehensive framework for structured data collection and deep learning (DL)-based translation of retinal optical coherence tomography (OCT) images into diagnostic text. The suggested approach guarantees high-quality OCT data for model training through the use of sophisticated image processing methods like edge detection, noise suppression, and contrast improvement. The study utilizes 84,484 retinal images from the OCT dataset available on Kaggle. The research utilizes various preprocessing techniques, such as median and Gaussian filtering, along with data augmentation strategies like translation, rotation, and scaling, to mitigate class imbalances and improve model performance. The system automatically identifies and categorizes retinal diseases such as drusen, diabetic macular edema (DME), and choroidal neovascularization (CNV) by integrating feature extraction and selection with DL techniques. The research highlights the importance of effective data handling and model scalability to address the increasing need for automated diagnostic tools in ophthalmology. This framework aims to support ophthalmologists in managing the increasing incidence of diabetic retinopathy (DR) and other retinal conditions by enhancing the efficiency of retinal image analysis, thereby improving patient results through early detection and treatment.
Keywords
Automated diagnosis; Deep learning; Diabetic macular edema; Image preprocessing; Retinal OCT imaging
Full Text:
PDFDOI: http://doi.org/10.11591/ijai.v15.i2.pp1050-1061
Refbacks
- There are currently no refbacks.
Copyright (c) 2026 Uday Mande, Shafi Pathan, Pankaj Chandre , Sharvari Mande

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938
This journal is published by the Institute of Advanced Engineering and Science (IAES).