Commenced in January 2007

Frequency: Monthly

Edition: International

Paper Count: 9402

Search results for: cataract image learning

9402 Thick Data Analytics for Learning Cataract Severity: A Triplet Loss Siamese Neural Network Model

Authors: Jinan Fiaidhi, Sabah Mohammed

Abstract:

Diagnosing cataract severity is an important factor in deciding to undertake surgery. It is usually conducted by an ophthalmologist or through taking a variety of fundus photography that needs to be examined by the ophthalmologist. This paper carries out an investigation using a Siamese neural net that can be trained with small anchor samples to score cataract severity. The model used in this paper is based on a triplet loss function that takes the ophthalmologist best experience in rating positive and negative anchors to a specific cataract scaling system. This approach that takes the heuristics of the ophthalmologist is generally called the thick data approach, which is a kind of machine learning approach that learn from a few shots. Clinical Relevance: The lens of the eye is mostly made up of water and proteins. A cataract occurs when these proteins at the eye lens start to clump together and block lights causing impair vision. This research aims at employing thick data machine learning techniques to rate the severity of the cataract using Siamese neural network.

Keywords: thick data analytics, siamese neural network, triplet-loss model, few shot learning

Procedia PDF Downloads 104

9401 Validity of Simlified Javal’s Rule in 147 Pre-Operation Cataract Eyes

Authors: Mohammad Ghandehari Motlagh

Abstract:

Purpose: To evaluate validity of simplified Javal’s rule (Total Ast=Corneal Ast-0.50@9) in 147 pre-op cataract eyes. Methods: Due to change in lens tissue and structure in a cataract crystalline lens, we conceive the simplified javal’s rule may not be valid in cataract cases.In this cross-sectional study,147 pre-op cataract eyes without oblique astigmatism were enrolled in this study. Ocular biometry (with IOL master 500)and keratometry and refraction findings were recorded. Results: Mean age of our patients was 64.95 yrs/old (SD+_9.86) that confirms on senile cataract. Mean Axial length and average keratometry were respectively 23.86 and 44.62.Prevalence of systemic diseases diabet and high blood pressure were respectively 43 (29.25%) and 44 (29.93%)and shows importance of these diseases. The Corneal astigmatism axis is correlated with refractive astigmatism in cataract eyes (R=0.493). Simplified Javal’s rule is valid in cataract eyes (P<0.001). Conclusion: Simplified Javal’s rule is a valid formula in pre-op cataract eyes and can be used for keratometry results confirmation.

Keywords: javals rule, cataract, keratometry, ocular axial length

Procedia PDF Downloads 421

9400 Early Recognition and Grading of Cataract Using a Combined Log Gabor/Discrete Wavelet Transform with ANN and SVM

Authors: Hadeer R. M. Tawfik, Rania A. K. Birry, Amani A. Saad

Abstract:

Eyes are considered to be the most sensitive and important organ for human being. Thus, any eye disorder will affect the patient in all aspects of life. Cataract is one of those eye disorders that lead to blindness if not treated correctly and quickly. This paper demonstrates a model for automatic detection, classification, and grading of cataracts based on image processing techniques and artificial intelligence. The proposed system is developed to ease the cataract diagnosis process for both ophthalmologists and patients. The wavelet transform combined with 2D Log Gabor Wavelet transform was used as feature extraction techniques for a dataset of 120 eye images followed by a classification process that classified the image set into three classes; normal, early, and advanced stage. A comparison between the two used classifiers, the support vector machine SVM and the artificial neural network ANN were done for the same dataset of 120 eye images. It was concluded that SVM gave better results than ANN. SVM success rate result was 96.8% accuracy where ANN success rate result was 92.3% accuracy.

Keywords: cataract, classification, detection, feature extraction, grading, log-gabor, neural networks, support vector machines, wavelet

Procedia PDF Downloads 325

9399 Metareasoning Image Optimization Q-Learning

Authors: Mahasa Zahirnia

Abstract:

The purpose of this paper is to explore new and effective ways of optimizing satellite images using artificial intelligence, and the process of implementing reinforcement learning to enhance the quality of data captured within the image. In our implementation of Bellman's Reinforcement Learning equations, associated state diagrams, and multi-stage image processing, we were able to enhance image quality, detect and define objects. Reinforcement learning is the differentiator in the area of artificial intelligence, and Q-Learning relies on trial and error to achieve its goals. The reward system that is embedded in Q-Learning allows the agent to self-evaluate its performance and decide on the best possible course of action based on the current and future environment. Results show that within a simulated environment, built on the images that are commercially available, the rate of detection was 40-90%. Reinforcement learning through Q-Learning algorithm is not just desired but required design criteria for image optimization and enhancements. The proposed methods presented are a cost effective method of resolving uncertainty of the data because reinforcement learning finds ideal policies to manage the process using a smaller sample of images.

Keywords: Q-learning, image optimization, reinforcement learning, Markov decision process

Procedia PDF Downloads 209

9398 Preventive and Attenuative Effect of Vitamin E on Selenite-induced Cataract in Rat

Authors: Seyedeh Zeinab Peighambarzadeh, Mehdi Tavana

Abstract:

Cataract is the most common cause of blindness worldwide and its incidence will increase as the World’s population ages. Even in modern ophthalmology, there is no effective medical treatment for cataract except surgery. Development of a drug which could prevent or delay the onset of cataract will lessen this burden and reduce the number of blind patients waiting for cataract surgery. This study was undertaken to evaluate the protective effect of vitamin E on Selenite-induced Cataract in Sprague-dawely rats. Cataracts were induced in rats by administration of sodium selenite. On postpartum day ten, in group I, saline was injected subcutaneously. Group II rat pups received subcutaneous injection of vitamin E (60mg/kg B.W.) at day 8 postpartum and every other day thereafter. Group III and IV rat pups received a subcutaneous injection of sodium selenite (13mg/kg B.W.) at day 10 postpartum. Group IV also received subcutaneous injection of vitamin E (60mg/kg B.W.) at day 8 postpartum and every other day thereafter. The development of cataract in rats was assessed clinically by slit-lamp biomicroscope from day 14 up to postpartum day 28. After sacriﬁce, extricated pup lenses were analyzed for total and soluble protein concentrations and eletrophoretic pattern (SDS-PAGE). There was no opaciﬁcation of lens in Group I and II. There was mature cataract in 95% of Group III. In group IV, 55% of rats developed sub capsular or cortical cataract. Cataractous and biochemical changes of the crystalline lens proteins due to selenite can be retard or prevented by vitamin E.

Keywords: preventive effect, selenite-induced cataract, vitamin E, rat

Procedia PDF Downloads 370

9397 High-Capacity Image Steganography using Wavelet-based Fusion on Deep Convolutional Neural Networks

Authors: Amal Khalifa, Nicolas Vana Santos

Abstract:

Steganography has been known for centuries as an efficient approach for covert communication. Due to its popularity and ease of access, image steganography has attracted researchers to find secure techniques for hiding information within an innocent looking cover image. In this research, we propose a novel deep-learning approach to digital image steganography. The proposed method, DeepWaveletFusion, uses convolutional neural networks (CNN) to hide a secret image into a cover image of the same size. Two CNNs are trained back-to-back to merge the Discrete Wavelet Transform (DWT) of both colored images and eventually be able to blindly extract the hidden image. Based on two different image similarity metrics, a weighted gain function is used to guide the learning process and maximize the quality of the retrieved secret image and yet maintaining acceptable imperceptibility. Experimental results verified the high recoverability of DeepWaveletFusion which outperformed similar deep-learning-based methods.

Keywords: deep learning, steganography, image, discrete wavelet transform, fusion

Procedia PDF Downloads 79

9396 Post-Processing Method for Performance Improvement of Aerial Image Parcel Segmentation

Authors: Donghee Noh, Seonhyeong Kim, Junhwan Choi, Heegon Kim, Sooho Jung, Keunho Park

Abstract:

In this paper, we describe an image post-processing method to enhance the performance of the parcel segmentation method using deep learning-based aerial images conducted in previous studies. The study results were evaluated using a confusion matrix, IoU, Precision, Recall, and F1-Score. In the case of the confusion matrix, it was observed that the false positive value, which is the result of misclassification, was greatly reduced as a result of image post-processing. The average IoU was 0.9688 in the image post-processing, which is higher than the deep learning result of 0.8362, and the F1-Score was also 0.9822 in the image post-processing, which was higher than the deep learning result of 0.8850. As a result of the experiment, it was found that the proposed technique positively complements the deep learning results in segmenting the parcel of interest.

Keywords: aerial image, image process, machine vision, open field smart farm, segmentation

Procedia PDF Downloads 76

9395 Ocular Biometry: Common Etiologies of Difference More Than 0.33mm between Axial Lengths of the 2 Eyes

Authors: Ghandehari Motlagh, Mohammad

Abstract:

Purpose: We tried to find the most common etiologies for anisometropia in pre-op cataract cases: axial or refractive. Methods: In this cross-sectional study ,41 pre-op cataract eyes with more than 0.33 difference between axial lengths of 2 eyes were enrolled.Considered for each 1mm difference between axial lengths in long eyes( AXL more than 25):1.75-2.00 D of anisometropia, for normal eyes(AXL: 22- 25):2.50D and for short eyes (AXL less than 22):3.50-3.75 D as axial anisometropia. If there are more or lesser anisometropia, we recorded as refractive anisometropia. Results: Average of anisometropia :4.24 D, prevalence of PK or LK :1 (2.38%), kc:1(2.38%), glaucoma surgery: 1(2.38%), and pseudophakic status of the opposite eye 8(19.04%). Prevalence of axial anisometropia:21 (52.4%) and refractive anisometropia 20(47.6%).Then on basis of this study we can rely on the patient’s refraction exactly before phaco for evaluation of axial length differences between the 2 eyes, because most of the anisometropias are axial. Conclusion: In most cases, cataract does not induce significant change in refractive error (secondary myopia) and AXL difference between the 2 eyes are correlated with anisometropia.so it can be used for cataract patient’s ocular biometry evaluation. Pre-cataract refraction is a valuable variable should be measured and recorded in routin eye examination.

Keywords: ocular axial length, anisometropia, cataract, ophthalmology and optometry

Procedia PDF Downloads 376

9394 In-Context Meta Learning for Automatic Designing Pretext Tasks for Self-Supervised Image Analysis

Authors: Toktam Khatibi

Abstract:

Self-supervised learning (SSL) includes machine learning models that are trained on one aspect and/or one part of the input to learn other aspects and/or part of it. SSL models are divided into two different categories, including pre-text task-based models and contrastive learning ones. Pre-text tasks are some auxiliary tasks learning pseudo-labels, and the trained models are further fine-tuned for downstream tasks. However, one important disadvantage of SSL using pre-text task solving is defining an appropriate pre-text task for each image dataset with a variety of image modalities. Therefore, it is required to design an appropriate pretext task automatically for each dataset and each downstream task. To the best of our knowledge, the automatic designing of pretext tasks for image analysis has not been considered yet. In this paper, we present a framework based on In-context learning that describes each task based on its input and output data using a pre-trained image transformer. Our proposed method combines the input image and its learned description for optimizing the pre-text task design and its hyper-parameters using Meta-learning models. The representations learned from the pre-text tasks are fine-tuned for solving the downstream tasks. We demonstrate that our proposed framework outperforms the compared ones on unseen tasks and image modalities in addition to its superior performance for previously known tasks and datasets.

Keywords: in-context learning (ICL), meta learning, self-supervised learning (SSL), vision-language domain, transformers

Procedia PDF Downloads 76

9393 Neural Style Transfer Using Deep Learning

Authors: Shaik Jilani Basha, Inavolu Avinash, Alla Venu Sai Reddy, Bitragunta Taraka Ramu

Abstract:

We can use the neural style transfer technique to build a picture with the same "content" as the beginning image but the "style" of the picture we've chosen. Neural style transfer is a technique for merging the style of one image into another while retaining its original information. The only change is how the image is formatted to give it an additional artistic sense. The content image depicts the plan or drawing, as well as the colors of the drawing or paintings used to portray the style. It is a computer vision programme that learns and processes images through deep convolutional neural networks. To implement software, we used to train deep learning models with the train data, and whenever a user takes an image and a styled image, the output will be as the style gets transferred to the original image, and it will be shown as the output.

Keywords: neural networks, computer vision, deep learning, convolutional neural networks

Procedia PDF Downloads 85

9392 Improved Performance in Content-Based Image Retrieval Using Machine Learning Approach

Authors: B. Ramesh Naik, T. Venugopal

Abstract:

This paper presents a novel approach which improves the high-level semantics of images based on machine learning approach. The contemporary approaches for image retrieval and object recognition includes Fourier transforms, Wavelets, SIFT and HoG. Though these descriptors helpful in a wide range of applications, they exploit zero order statistics, and this lacks high descriptiveness of image features. These descriptors usually take benefit of primitive visual features such as shape, color, texture and spatial locations to describe images. These features do not adequate to describe high-level semantics of the images. This leads to a gap in semantic content caused to unacceptable performance in image retrieval system. A novel method has been proposed referred as discriminative learning which is derived from machine learning approach that efficiently discriminates image features. The analysis and results of proposed approach were validated thoroughly on WANG and Caltech-101 Databases. The results proved that this approach is very competitive in content-based image retrieval.

Keywords: CBIR, discriminative learning, region weight learning, scale invariant feature transforms

Procedia PDF Downloads 176

9391 Curcumin-Loaded Phenethyl Isothiocyanate Nano-Spheres: Preparation, Stability Study, and Its Implication for Cataract Prevention

Authors: Pankaj Dinesh Baviskar

Abstract:

This study examines the impact of curcumin-loaded nano-spheres in the form of emulsions on fish eye cataracts. Curcumin nanoemulsions were prepared by using phenethyl isothiocyanate. Nanoemulsions were synthesized by ultrasound-assisted method at 150 Watt. A zeta potential measurement for curcumin-loaded nanoemulsions was found to be -30.7eV, -13.4eV, and -9.55eV, and particle size was found to be 149.3 nm, 245.3 and nm 403.5 nm using particle size analyzer respectively for different conditions. The surface morphology of nano-spheres was examined by FE-SEM analysis. The zeta potential measured indicates its stability for corresponding nano-spheres. The anti-cataract application was studied by using isolated fish eye lenses. The cataract was induced using high glucose concentrated solution. The biochemical parameters in the form of reduced glutathione were measured to interpret the anti-cataract ability of curcumin-loaded nanoemulsions.

Keywords: curcumin, nano, cataract, nanoemulsion

Procedia PDF Downloads 109

9390 The Relationship between Vitamin D and Vitamin B12 Concentrations in Cataract Patients (Senile vs Diabetic)

Authors: Ali Showail Ali Alasmari

Abstract:

Introduction: Cataract is the loss of transparency of the lens inside the eye. It is the most common cause of visual loss and blindness worldwide. This study provides a systemic review of the recent findings on the association of vitamin D, and vitamin B12, and their possible role in preventing cataracts in senile (S) and diabetic mellitus (DM) patient groups. Objective: This study was intended to establish and investigate if there is any role between vitamin D and vitamin B12? Secondly, the connection between serum level of vitamin D and vitamin B12 in cataract incidence senile (s) vs. diabetic mellitus (DM) cataract patient groups. Furthermore, to evaluate and analyze cataract occurrence regarding vitamin D and vitamin B12 levels with other risk factors. Finally, to evaluate lens opacities pre and post treatment with vitamin D and vitaminB12 linked to age and visual acuity loss in both senile(S) and diabetic mellitus (DM) cataract patients’ groups. Methods: This study conducted at the ophthalmology clinic at Muhyail General Hospital. Select a prospective case-control to study the effect of vitamin D and Vit B12 on senile(S) cataracts that caused by age and diabetic mellitus (DM)cataract patients; then we compare these two groups. This study prospectively enrolled a total of 50 samples, 25 with senile cataract and 25 with diabetic cataract, from ophthalmology clinic at Muhyail General Hospital. Measuring 25-hydroxy vitamin D and vitamin B12 level concentrations in the assigned samples. Analyses were performed using SAS (statistical analysis software) program. Results: The most important finding in this study was that the senile(s) cataract patients’ group greatly benefited by the combination therapy of vitamin D, and Vitamin B12 reached (28.5±1.50 and 521.1±21.10) respectively; on the contrary, the diabetic cataract patient group hardly shows any significant improvement (21.5 ± 1.00 and 197.2 ± 7.20) respectively. This is because of the Metformin, the first line drug for treating diabetes, has been reported to potentially decrease vitamin B-12 status. This epigenetic modification was correlated with the diabetic mellitus (DM) cataract patients’ group not responding. Vitamin B12 deficiency also leads to an impairment of the conversion of methylmalonyl-CoA to succinyl-CoA, which has been associated with insulin resistance. There was no significant difference between the age, body mass index (BMI), the mean of Vit-D pre-treatments, and the mean values of Hemoglobin A1C of both senile (S) and diabetic mellitus (DM) cataract patient groups. On other hand, there was a highly significant difference between the mean values of glucose levels in both senile (S) and diabetic mellitus (DM) cataract patient groups. Conclusion: Here we conclude that diabetic mellitus (DM) cataract patient group hardly benefited from this combination therapy vitamin D and vitamin B12; on the other hand senile patient group (s) benefited a lot from the therapy.

Keywords: cataract patients, senile, diabetes mellitus, vitamin B12, vitamin D, Muhyail General Hospital, Saudi Arabia

Procedia PDF Downloads 98

9389 Image Classification with Localization Using Convolutional Neural Networks

Authors: Bhuyain Mobarok Hossain

Abstract:

Image classification and localization research is currently an important strategy in the field of computer vision. The evolution and advancement of deep learning and convolutional neural networks (CNN) have greatly improved the capabilities of object detection and image-based classification. Target detection is important to research in the field of computer vision, especially in video surveillance systems. To solve this problem, we will be applying a convolutional neural network of multiple scales at multiple locations in the image in one sliding window. Most translation networks move away from the bounding box around the area of interest. In contrast to this architecture, we consider the problem to be a classification problem where each pixel of the image is a separate section. Image classification is the method of predicting an individual category or specifying by a shoal of data points. Image classification is a part of the classification problem, including any labels throughout the image. The image can be classified as a day or night shot. Or, likewise, images of cars and motorbikes will be automatically placed in their collection. The deep learning of image classification generally includes convolutional layers; the invention of it is referred to as a convolutional neural network (CNN).

Keywords: image classification, object detection, localization, particle filter

Procedia PDF Downloads 296

9388 Improved Super-Resolution Using Deep Denoising Convolutional Neural Network

Authors: Pawan Kumar Mishra, Ganesh Singh Bisht

Abstract:

Super-resolution is the technique that is being used in computer vision to construct high-resolution images from a single low-resolution image. It is used to increase the frequency component, recover the lost details and removing the down sampling and noises that caused by camera during image acquisition process. High-resolution images or videos are desired part of all image processing tasks and its analysis in most of digital imaging application. The target behind super-resolution is to combine non-repetition information inside single or multiple low-resolution frames to generate a high-resolution image. Many methods have been proposed where multiple images are used as low-resolution images of same scene with different variation in transformation. This is called multi-image super resolution. And another family of methods is single image super-resolution that tries to learn redundancy that presents in image and reconstruction the lost information from a single low-resolution image. Use of deep learning is one of state of art method at present for solving reconstruction high-resolution image. In this research, we proposed Deep Denoising Super Resolution (DDSR) that is a deep neural network for effectively reconstruct the high-resolution image from low-resolution image.

Keywords: resolution, deep-learning, neural network, de-blurring

Procedia PDF Downloads 510

9387 Evaluating Classification with Efficacy Metrics

Authors: Guofan Shao, Lina Tang, Hao Zhang

Abstract:

The values of image classification accuracy are affected by class size distributions and classification schemes, making it difficult to compare the performance of classification algorithms across different remote sensing data sources and classification systems. Based on the term efficacy from medicine and pharmacology, we have developed the metrics of image classification efficacy at the map and class levels. The novelty of this approach is that a baseline classification is involved in computing image classification efficacies so that the effects of class statistics are reduced. Furthermore, the image classification efficacies are interpretable and comparable, and thus, strengthen the assessment of image data classification methods. We use real-world and hypothetical examples to explain the use of image classification efficacies. The metrics of image classification efficacy meet the critical need to rectify the strategy for the assessment of image classification performance as image classification methods are becoming more diversified.

Keywords: accuracy assessment, efficacy, image classification, machine learning, uncertainty

Procedia PDF Downloads 203

9386 An Image Segmentation Algorithm for Gradient Target Based on Mean-Shift and Dictionary Learning

Authors: Yanwen Li, Shuguo Xie

Abstract:

In electromagnetic imaging, because of the diffraction limited system, the pixel values could change slowly near the edge of the image targets and they also change with the location in the same target. Using traditional digital image segmentation methods to segment electromagnetic gradient images could result in lots of errors because of this change in pixel values. To address this issue, this paper proposes a novel image segmentation and extraction algorithm based on Mean-Shift and dictionary learning. Firstly, the preliminary segmentation results from adaptive bandwidth Mean-Shift algorithm are expanded, merged and extracted. Then the overlap rate of the extracted image block is detected before determining a segmentation region with a single complete target. Last, the gradient edge of the extracted targets is recovered and reconstructed by using a dictionary-learning algorithm, while the final segmentation results are obtained which are very close to the gradient target in the original image. Both the experimental results and the simulated results show that the segmentation results are very accurate. The Dice coefficients are improved by 70% to 80% compared with the Mean-Shift only method.

Keywords: gradient image, segmentation and extract, mean-shift algorithm, dictionary iearning

Procedia PDF Downloads 258

9385 Classification of Computer Generated Images from Photographic Images Using Convolutional Neural Networks

Authors: Chaitanya Chawla, Divya Panwar, Gurneesh Singh Anand, M. P. S Bhatia

Abstract:

This paper presents a deep-learning mechanism for classifying computer generated images and photographic images. The proposed method accounts for a convolutional layer capable of automatically learning correlation between neighbouring pixels. In the current form, Convolutional Neural Network (CNN) will learn features based on an image's content instead of the structural features of the image. The layer is particularly designed to subdue an image's content and robustly learn the sensor pattern noise features (usually inherited from image processing in a camera) as well as the statistical properties of images. The paper was assessed on latest natural and computer generated images, and it was concluded that it performs better than the current state of the art methods.

Keywords: image forensics, computer graphics, classification, deep learning, convolutional neural networks

Procedia PDF Downloads 331

9384 Non-Targeted Adversarial Image Classification Attack-Region Modification Methods

Authors: Bandar Alahmadi, Lethia Jackson

Abstract:

Machine Learning model is used today in many real-life applications. The safety and security of such model is important, so the results of the model are as accurate as possible. One challenge of machine learning model security is the adversarial examples attack. Adversarial examples are designed by the attacker to cause the machine learning model to misclassify the input. We propose a method to generate adversarial examples to attack image classifiers. We are modifying the successfully classified images, so a classifier misclassifies them after the modification. In our method, we do not update the whole image, but instead we detect the important region, modify it, place it back to the original image, and then run it through a classifier. The algorithm modifies the detected region using two methods. First, it will add abstract image matrix on back of the detected image matrix. Then, it will perform a rotation attack to rotate the detected region around its axes, and embed the trace of image in image background. Finally, the attacked region is placed in its original position, from where it was removed, and a smoothing filter is applied to smooth the background with foreground. We test our method in cascade classifier, and the algorithm is efficient, the classifier confident has dropped to almost zero. We also try it in CNN (Convolutional neural network) with higher setting and the algorithm was successfully worked.

Keywords: adversarial examples, attack, computer vision, image processing

Procedia PDF Downloads 333

9383 A Comprehensive Study of Camouflaged Object Detection Using Deep Learning

Authors: Khalak Bin Khair, Saqib Jahir, Mohammed Ibrahim, Fahad Bin, Debajyoti Karmaker

Abstract:

Object detection is a computer technology that deals with searching through digital images and videos for occurrences of semantic elements of a particular class. It is associated with image processing and computer vision. On top of object detection, we detect camouflage objects within an image using Deep Learning techniques. Deep learning may be a subset of machine learning that's essentially a three-layer neural network Over 6500 images that possess camouflage properties are gathered from various internet sources and divided into 4 categories to compare the result. Those images are labeled and then trained and tested using vgg16 architecture on the jupyter notebook using the TensorFlow platform. The architecture is further customized using Transfer Learning. Methods for transferring information from one or more of these source tasks to increase learning in a related target task are created through transfer learning. The purpose of this transfer of learning methodologies is to aid in the evolution of machine learning to the point where it is as efficient as human learning.

Keywords: deep learning, transfer learning, TensorFlow, camouflage, object detection, architecture, accuracy, model, VGG16

Procedia PDF Downloads 138

9382 Infographics to Identify, Diagnose, and Review Medically Important Microbes and Microbial Diseases: A Tool to Ignite Minds of Undergraduate Medical Students

Authors: Mohan Bilikallahalli Sannathimmappa, Vinod Nambiar, Rajeev Aravindakshan

Abstract:

Background: Image-based teaching-learning module is innovative student-centered andragogy. The objective of our study was to explore medical students’ perception of effectiveness of image-based learning strategy in promoting their lifelong learning skills and evaluate its impact on improving students’ exam grades. Methods: A prospective single-cohort study was conducted on undergraduate medical students of the academic year 2021-22. The image-based teaching-learning module was assessed through pretest, posttest, and exam grades. Students’ feedback was collected through a predesigned questionnaire on a 3-point Likert Scale. The reliability of the questionnaire was assessed using Cronbach’s alpha coefficient test. In-Course Exam-4 results were compared with In-Course Exams 1, 2, and 3. Correlation coefficients were worked out wherever relevant to find the impact of the exercise on grades. Data were collected, entered into Microsoft Excel, and statistically analyzed using SPSS version 22. Results: In total, 127 students were included in the study. The posttest scores of the students were significantly high (24.75±) as compared to pretest scores (8.25±). Students’ opinion towards the effectiveness of image-based learning in promoting their lifelong learning skills was overwhelmingly positive (Cronbach’s alpha for all items was 0.756). More than 80% of the students indicated image-based learning was interesting, encouraged peer discussion, and helped them to identify, explore, and revise key information and knowledge improvement. Nearly 70% expressed image-based learning enhanced their critical thinking and problem-solving skills. Nine out of ten students recommended image-based learning module for future topics. Conclusion: Overall, Image-based learning was found to be effective in achieving undergraduate medical students learning outcomes. The results of the study are in favor of the implementation of Image-based learning in Microbiology courses. However, multicentric studies are required to authenticate our study findings.

Keywords: active learning, knowledge, medical education, microbes, problem solving

Procedia PDF Downloads 68

9381 NANCY: Combining Adversarial Networks with Cycle-Consistency for Robust Multi-Modal Image Registration

Authors: Mirjana Ruppel, Rajendra Persad, Amit Bahl, Sanja Dogramadzi, Chris Melhuish, Lyndon Smith

Abstract:

Multimodal image registration is a profoundly complex task which is why deep learning has been used widely to address it in recent years. However, two main challenges remain: Firstly, the lack of ground truth data calls for an unsupervised learning approach, which leads to the second challenge of defining a feasible loss function that can compare two images of different modalities to judge their level of alignment. To avoid this issue altogether we implement a generative adversarial network consisting of two registration networks GAB, GBA and two discrimination networks DA, DB connected by spatial transformation layers. GAB learns to generate a deformation field which registers an image of the modality B to an image of the modality A. To do that, it uses the feedback of the discriminator DB which is learning to judge the quality of alignment of the registered image B. GBA and DA learn a mapping from modality A to modality B. Additionally, a cycle-consistency loss is implemented. For this, both registration networks are employed twice, therefore resulting in images ˆA, ˆB which were registered to ˜B, ˜A which were registered to the initial image pair A, B. Thus the resulting and initial images of the same modality can be easily compared. A dataset of liver CT and MRI was used to evaluate the quality of our approach and to compare it against learning and non-learning based registration algorithms. Our approach leads to dice scores of up to 0.80 ± 0.01 and is therefore comparable to and slightly more successful than algorithms like SimpleElastix and VoxelMorph.

Keywords: cycle consistency, deformable multimodal image registration, deep learning, GAN

Procedia PDF Downloads 124

9380 Traffic Light Detection Using Image Segmentation

Authors: Vaishnavi Shivde, Shrishti Sinha, Trapti Mishra

Abstract:

Traffic light detection from a moving vehicle is an important technology both for driver safety assistance functions as well as for autonomous driving in the city. This paper proposed a deep-learning-based trafﬁc light recognition method that consists of a pixel-wise image segmentation technique and a fully convolutional network i.e., UNET architecture. This paper has used a method for detecting the position and recognizing the state of the traffic lights in video sequences is presented and evaluated using Traffic Light Dataset which contains masked traffic light image data. The first stage is the detection, which is accomplished through image processing (image segmentation) techniques such as image cropping, color transformation, segmentation of possible traffic lights. The second stage is the recognition, which means identifying the color of the traffic light or knowing the state of traffic light which is achieved by using a Convolutional Neural Network (UNET architecture).

Keywords: traffic light detection, image segmentation, machine learning, classification, convolutional neural networks

Procedia PDF Downloads 167

9379 Correlation between Diabetic Cataract, HBA1C and Gurakhu, a Clinical Study in Chhattisgarh State

Authors: A. Bhattacharya, Sanjay Gupta, S. H. Bodakhe

Abstract:

HbA1c is form of the haemoglobin that is used to measure the average plasma glucose concentration over prolonged periods of time. It is formed in a non-enzymatic glycation pathway by hemoglobin's exposure to plasma glucose. In diabetes mellitus, higher amounts of glycated hemoglobin, indicating poorer control of blood glucose levels, have been associated with cardiovascular disease, nephropathy, and retinopathy. Guraku’s basic components are nicotine and jaggery, jaggery is made up of sugarcane so can have a diabetogenic potential which is exacerbated in presence of nicotine. This work had done with the aim to find correlation between Diabetic cataract, HbA1c and Guraku. Subjects were enrolled according to the inclusion and exclusion criteria. In this study total 75 subjects were included. In the study it was found that people consuming Guraku had a high level of HbA1c thus are more prone to the development of diabetic cataract. Male subjects are the more than female subjects. Most of the subjects belong to the lower socioeconomical class and not very educated. It could be concluded that this type of study could be useful in indentifying number of subjects suffering from diabetic cataract whose condition get worse by use of nicotine product like Guraku and preventive measure to be taken in prevention of this type of diabetic complication.

Keywords: diabetic cataract, HbA1c, Guraku, diabetogenic potential

Procedia PDF Downloads 395

9378 Tongue Image Retrieval Based Using Machine Learning

Authors: Ahmad FAROOQ, Xinfeng Zhang, Fahad Sabah, Raheem Sarwar

Abstract:

In Traditional Chinese Medicine, tongue diagnosis is a vital inspection tool (TCM). In this study, we explore the potential of machine learning in tongue diagnosis. It begins with the cataloguing of the various classifications and characteristics of the human tongue. We infer 24 kinds of tongues from the material and coating of the tongue, and we identify 21 attributes of the tongue. The next step is to apply machine learning methods to the tongue dataset. We use the Weka machine learning platform to conduct the experiment for performance analysis. The 457 instances of the tongue dataset are used to test the performance of five different machine learning methods, including SVM, Random Forests, Decision Trees, and Naive Bayes. Based on accuracy and Area under the ROC Curve, the Support Vector Machine algorithm was shown to be the most effective for tongue diagnosis (AUC).

Keywords: medical imaging, image retrieval, machine learning, tongue

Procedia PDF Downloads 69

9377 Medical Imaging Fusion: A Teaching-Learning Simulation Environment

Authors: Cristina Maria Ribeiro Martins Pereira Caridade, Ana Rita Ferreira Morais

Abstract:

The use of computational tools has become essential in the context of interactive learning, especially in engineering education. In the medical industry, teaching medical image processing techniques is a crucial part of training biomedical engineers, as it has integrated applications with healthcare facilities and hospitals. The aim of this article is to present a teaching-learning simulation tool developed in MATLAB using a graphical user interface for medical image fusion that explores different image fusion methodologies and processes in combination with image pre-processing techniques. The application uses different algorithms and medical fusion techniques in real time, allowing you to view original images and fusion images, compare processed and original images, adjust parameters, and save images. The tool proposed in an innovative teaching and learning environment consists of a dynamic and motivating teaching simulation for biomedical engineering students to acquire knowledge about medical image fusion techniques and necessary skills for the training of biomedical engineers. In conclusion, the developed simulation tool provides real-time visualization of the original and fusion images and the possibility to test, evaluate and progress the student’s knowledge about the fusion of medical images. It also facilitates the exploration of medical imaging applications, specifically image fusion, which is critical in the medical industry. Teachers and students can make adjustments and/or create new functions, making the simulation environment adaptable to new techniques and methodologies.

Keywords: image fusion, image processing, teaching-learning simulation tool, biomedical engineering education

Procedia PDF Downloads 122

9376 Glucose Monitoring System Using Machine Learning Algorithms

Authors: Sangeeta Palekar, Neeraj Rangwani, Akash Poddar, Jayu Kalambe

Abstract:

The bio-medical analysis is an indispensable procedure for identifying health-related diseases like diabetes. Monitoring the glucose level in our body regularly helps us identify hyperglycemia and hypoglycemia, which can cause severe medical problems like nerve damage or kidney diseases. This paper presents a method for predicting the glucose concentration in blood samples using image processing and machine learning algorithms. The glucose solution is prepared by the glucose oxidase (GOD) and peroxidase (POD) method. An experimental database is generated based on the colorimetric technique. The image of the glucose solution is captured by the raspberry pi camera and analyzed using image processing by extracting the RGB, HSV, LUX color space values. Regression algorithms like multiple linear regression, decision tree, RandomForest, and XGBoost were used to predict the unknown glucose concentration. The multiple linear regression algorithm predicts the results with 97% accuracy. The image processing and machine learning-based approach reduce the hardware complexities of existing platforms.

Keywords: artificial intelligence glucose detection, glucose oxidase, peroxidase, image processing, machine learning

Procedia PDF Downloads 196

9375 Deepnic, A Method to Transform Each Variable into Image for Deep Learning

Authors: Nguyen J. M., Lucas G., Brunner M., Ruan S., Antonioli D.

Abstract:

Deep learning based on convolutional neural networks (CNN) is a very powerful technique for classifying information from an image. We propose a new method, DeepNic, to transform each variable of a tabular dataset into an image where each pixel represents a set of conditions that allow the variable to make an error-free prediction. The contrast of each pixel is proportional to its prediction performance and the color of each pixel corresponds to a sub-family of NICs. NICs are probabilities that depend on the number of inputs to each neuron and the range of coefficients of the inputs. Each variable can therefore be expressed as a function of a matrix of 2 vectors corresponding to an image whose pixels express predictive capabilities. Our objective is to transform each variable of tabular data into images into an image that can be analysed by CNNs, unlike other methods which use all the variables to construct an image. We analyse the NIC information of each variable and express it as a function of the number of neurons and the range of coefficients used. The predictive value and the category of the NIC are expressed by the contrast and the color of the pixel. We have developed a pipeline to implement this technology and have successfully applied it to genomic expressions on an Affymetrix chip.

Keywords: tabular data, deep learning, perfect trees, NICS

Procedia PDF Downloads 84

9374 A Deep Learning Based Approach for Dynamically Selecting Pre-processing Technique for Images

Authors: Revoti Prasad Bora, Nikita Katyal, Saurabh Yadav

Abstract:

Pre-processing plays an important role in various image processing applications. Most of the time due to the similar nature of images, a particular pre-processing or a set of pre-processing steps are sufficient to produce the desired results. However, in the education domain, there is a wide variety of images in various aspects like images with line-based diagrams, chemical formulas, mathematical equations, etc. Hence a single pre-processing or a set of pre-processing steps may not yield good results. Therefore, a Deep Learning based approach for dynamically selecting a relevant pre-processing technique for each image is proposed. The proposed method works as a classifier to detect hidden patterns in the images and predicts the relevant pre-processing technique needed for the image. This approach experimented for an image similarity matching problem but it can be adapted to other use cases too. Experimental results showed significant improvement in average similarity ranking with the proposed method as opposed to static pre-processing techniques.

Keywords: deep-learning, classification, pre-processing, computer vision, image processing, educational data mining

Procedia PDF Downloads 153

9373 Progress in Combining Image Captioning and Visual Question Answering Tasks

Authors: Prathiksha Kamath, Pratibha Jamkhandi, Prateek Ghanti, Priyanshu Gupta, M. Lakshmi Neelima

Abstract:

Combining Image Captioning and Visual Question Answering (VQA) tasks have emerged as a new and exciting research area. The image captioning task involves generating a textual description that summarizes the content of the image. VQA aims to answer a natural language question about the image. Both these tasks include computer vision and natural language processing (NLP) and require a deep understanding of the content of the image and semantic relationship within the image and the ability to generate a response in natural language. There has been remarkable growth in both these tasks with rapid advancement in deep learning. In this paper, we present a comprehensive review of recent progress in combining image captioning and visual question-answering (VQA) tasks. We first discuss both image captioning and VQA tasks individually and then the various ways in which both these tasks can be integrated. We also analyze the challenges associated with these tasks and ways to overcome them. We finally discuss the various datasets and evaluation metrics used in these tasks. This paper concludes with the need for generating captions based on the context and captions that are able to answer the most likely asked questions about the image so as to aid the VQA task. Overall, this review highlights the significant progress made in combining image captioning and VQA, as well as the ongoing challenges and opportunities for further research in this exciting and rapidly evolving field, which has the potential to improve the performance of real-world applications such as autonomous vehicles, robotics, and image search.

Keywords: image captioning, visual question answering, deep learning, natural language processing

Procedia PDF Downloads 67