Search results for: DenseNet121

2 Dots to Dialogue: Enhancing Accessibility through Braille Image-to-Speech Conversion

Authors: Shwetha B. S., Sirisha M., Vachana U., Aditya Kadlimatti, Manjushree N. S.

Abstract:

Braille script holds significant importance in bridging the communication gap for visually impaired individuals. However, the challenge of interpreting Braille for non-experts creates barriers in education and day-to-day interactions. This paper aims to develop a system that translates Braille text into multilingual speech using advanced Convolutional Neural Networks (CNNs) and Google Text-to-Speech (GTTS) technology. The proposed system employs image recognition techniques powered by CNNs to accurately identify and decode Braille characters from captured images. The deep learning model undergoes training on a diverse dataset of Braille symbols to ensure high accuracy and robustness. Among the models evaluated, AlexNet demonstrated the highest accuracy in decoding Braille characters. Once recognized, the decoded text is converted into speech in the user’s preferred language using the GTTS API. This system possesses the ability to greatly improve inclusivity by enabling real-time Braille interpretation for visually impaired individuals, educators, and caregivers.

Keywords: convolutional neural networks, Braille image, image-to-speech, GTTS, AlexNet, VGG16, DenseNet121, ResNet50

Procedia PDF Downloads 4

1 Brain Tumor Detection and Classification Using Pre-Trained Deep Learning Models

Authors: Aditya Karade, Sharada Falane, Dhananjay Deshmukh, Vijaykumar Mantri

Abstract:

Brain tumors pose a significant challenge in healthcare due to their complex nature and impact on patient outcomes. The application of deep learning (DL) algorithms in medical imaging have shown promise in accurate and efficient brain tumour detection. This paper explores the performance of various pre-trained DL models ResNet50, Xception, InceptionV3, EfficientNetB0, DenseNet121, NASNetMobile, VGG19, VGG16, and MobileNet on a brain tumour dataset sourced from Figshare. The dataset consists of MRI scans categorizing different types of brain tumours, including meningioma, pituitary, glioma, and no tumour. The study involves a comprehensive evaluation of these models’ accuracy and effectiveness in classifying brain tumour images. Data preprocessing, augmentation, and finetuning techniques are employed to optimize model performance. Among the evaluated deep learning models for brain tumour detection, ResNet50 emerges as the top performer with an accuracy of 98.86%. Following closely is Xception, exhibiting a strong accuracy of 97.33%. These models showcase robust capabilities in accurately classifying brain tumour images. On the other end of the spectrum, VGG16 trails with the lowest accuracy at 89.02%.

Keywords: brain tumour, MRI image, detecting and classifying tumour, pre-trained models, transfer learning, image segmentation, data augmentation

Procedia PDF Downloads 81