Search results for: optical musical recognition
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 3594

Search results for: optical musical recognition

3534 DBN-Based Face Recognition System Using Light Field

Authors: Bing Gu

Abstract:

Abstract—Most of Conventional facial recognition systems are based on image features, such as LBP, SIFT. Recently some DBN-based 2D facial recognition systems have been proposed. However, we find there are few DBN-based 3D facial recognition system and relative researches. 3D facial images include all the individual biometric information. We can use these information to build more accurate features, So we present our DBN-based face recognition system using Light Field. We can see Light Field as another presentation of 3D image, and Light Field Camera show us a way to receive a Light Field. We use the commercially available Light Field Camera to act as the collector of our face recognition system, and the system receive a state-of-art performance as convenient as conventional 2D face recognition system.

Keywords: DBN, face recognition, light field, Lytro

Procedia PDF Downloads 464
3533 Creative Practice and Consciousness in Juju Music: A Nigerian Musical and Cultural Perspective

Authors: Olupemi E. Oludare

Abstract:

This paper investigates the creative practice engaged in Juju music, a Nigerian Neo-traditional genre of the Yoruba, and its influence on the consciousness of societal praxis. It takes a musical and cultural perspective, as representational indices of how the people’s religious, social, educational, and political consciousness is expressed in their music. The study adopts the historical cum descriptive design in its methodology, tracing the historical development of Juju music, the appropriation of musical and cultural materials in its creative process, and a descriptive analysis of its musical practice, in order to substantiate the role and function of Juju music and its musicians in the political, philosophical, and social consciousness of Nigeria’s pre- and post-independence epoch. Data were collected through oral interviews of selected Juju practitioners, stakeholders, and enthusiasts. It also employed the use of discography of Juju musicians. This paper discusses musical factors such as form, melodic and rhythmic patterns, and thematic materials, while highlighting cultural factors such as linguistic elements, with textual analysis, as a conscious avenue of expression. The study revealed that Juju musicians composed their music by engaging both indigenous and foreign musical materials, as a means of creative practice for musical entertainment, while expressing the people’s consciousness of their beliefs, values, and socio-political issues, hence the music functioning as a vehicle for social commentaries. The popularization and commercialization of Juju music brought the musicians national and international accolades, subsequently attracting contributions from contemporary musicians, which led to innovations of new brands, such as ‘Afro-Juju’, ‘Gospel-Juju’, ‘Hip-Hop-Juju’, etc., albeit retaining the basic musical elements of its progenitor, as a conscious music for socio-cultural functions. This study concludes that Juju music and its musicians remain germane in the musical scene of the nation’s social, educational, and political terrain, especially in the current Nigerian democratic climate. This paper recommends the promotion and patronage of the Juju music in its original form, to prevent its decline in current times, since it serves as an enrichment of national identity both in Nigeria, and Internationally.

Keywords: appropriation, consciousness, creative practice, national identity, neo-traditional

Procedia PDF Downloads 426
3532 Prime Ministers of Malaysia Musicals: Political Performances Trend in Istana Budaya (2007-2012)

Authors: Abdul Walid Ali

Abstract:

The trend of publishing political musicals in Istana Budaya has been popular since 2007 when Malaysia celebrated its 50th anniversary of independence. Istana Budaya has at least one musical of any politician since then. Normally, the politicians are former Prime Ministers and renowned ministers prior to Malaysia's independence in 1957. The first performance in Istana Budaya which represented a politician as a theme was Muzikal Cheng Lock (2007) followed by Muzikal Tun Abdul Razak (2009), Muzikal Tun Mahathir (2010), and Muzikal Tun Mahathir 2 (2011). In 2012, Lawak Ke Der has changed the trend with comic performance and put an end to politician musical. Tun Siti Hasmah the Musical (2012) is not listed in the research because she did not hold any position as a minister. This qualitative research focuses on musicals of political figures as a theme. Some factors of making this type of performance are analyzed based on Istana Budaya’s decisions during that time in Malaysia between 2007 and 2011. This research aims to document these musical themed performances in Istana Budaya for further research in the future. Political performances are listed and analyzed from 2007 to 2012 based on reports and previous research. The declination of audiences in 2012 and a new theme in theatre performances in Istana Budaya are important factors for the downfall of the political theatres' theme.

Keywords: musical, politician, Istana Budaya, theatre

Procedia PDF Downloads 191
3531 Theoretical Investigations on Optical Properties of GaFeMnN Quaternary Compound

Authors: H. A. Bentounes, A. Abbad, W. Benstaali

Abstract:

Using first principles calculations based on the density functional theory and local spin density approximation, we investigate optical properties of GaFeMnN quaternary compound. Results show that optical properties confirm that GaFeMnN can be a good candidate in the design of thin film solar cells in the visible and ultraviolet parts of the spectrum, and a good sensor in the infrared

Keywords: GaN, optical absorption, semi-metallic, dielectric function

Procedia PDF Downloads 368
3530 Correlating Musical Subject and Dialectical Subject to Develop a Critical Approach to Ideology in Musical Analysis and Composition

Authors: James Waide

Abstract:

In music, subject typically denotes the initial idea to which the entire composition refers—a concept congruous with Aristotle's notion of subject as primary substance, in the sense of an irreducible this particular. Gioseffo Zarlino, who established subject (soggetto) as a musical term, insisted the composer as “rediscovering” the subject within their music in order to “[bring] it to perfection” and, furthermore, that if the composer had not rediscovered the subject already, then one would simply take the first part of the composition to be the subject. Meanwhile, Žižek reads the Hegelian subject (as negativity set against positive object) through Lacanian Psychoanalysis (in which the subject is a kind of fictive entity of the clinic: a mere appearance which sits atop the objects of analysis) in the concept of Absolute Recoil. For Žižek, subject exists retroactively in Absolute Recoil from object, meaning subject is a void which only has meaning because of the object it is seen through. Following the work of theorists such as Adorno and Althusser, one can understand the ideological construction of such a subject. It may be argued that in Zarlino, musical subject can be similarly read as retroactively constructed, either by the composer or the listener. Furthermore, in recent work, Samuel Wilson identifies different kinds of subjects in music which can be psychoanalytically examined, including the fictive subject: a purely musical entity raised to the level of psychoanalytic subject. On which basis if, as Adorno insisted, 'authentic' music constitutes 'cognition without concepts', where and what is this subject without concept?.

Keywords: absolute recoil, critical theory, ideology, music analysis, psychoanalysis, retroactivity, subject

Procedia PDF Downloads 71
3529 From Sound to Music: The Trajectory of Musical Semiotics in a Selected Soundscape Environment in South-Western Nigeria

Authors: Olatunbosun Samuel Adekogbe

Abstract:

This paper addresses the question of musical signification, revolving around nature and its natural divides; the paper tends to examine the roles of the dispositional apparatus of listeners to react to sounding environments through music as coordinated sound that focuses on the powerful strain between vibrational occurrences of sound and potentials of being structured. This paper sets out to examine music as a simple conventional design that does not allude to something beyond music and sound as a vehicle to communicate through production, perception, translation, and reaction with regard to melodic and semiotic functions of sounds. This paper adopts the application of questionnaire and evolutionary approach methods to probe musical adaptation, reproduction, and natural selection as the basis for explaining specific human behavioural responses to musical sense-making beyond the above-sketched dichotomies, with a major focus on the transition from acoustic-emotional sensibilities to musical meaning in the selected soundscapes. It was observed that music has emancipated itself from the level of mere acoustic processing of sounds to a functional description in terms of allowing music users to share experiences and interact with the soundscaping environment. The paper, therefore, concludes that the audience as music participants and listeners in the selected soundscapes have been conceived as adaptive devices in the paradigm shift, which can build up new semiotic linkages with the sounding environments in southwestern Nigeria.

Keywords: semiotics, sound, music, soundscape, environment

Procedia PDF Downloads 65
3528 Connected Objects with Optical Rectenna for Wireless Information Systems

Authors: Chayma Bahar, Chokri Baccouch, Hedi Sakli, Nizar Sakli

Abstract:

Harvesting and transport of optical and radiofrequency signals are a topical subject with multiple challenges. In this paper, we present a Optical RECTENNA system. We propose here a hybrid system solar cell antenna for 5G mobile communications networks. Thus, we propose rectifying circuit. A parametric study is done to follow the influence of load resistance and input power on Optical RECTENNA system performance. Thus, we propose a solar cell antenna structure in the frequency band of future 5G standard in 2.45 GHz bands.

Keywords: antenna, IoT, optical rectenna, solar cell

Procedia PDF Downloads 178
3527 Myers-Briggs Type Index Personality Type Classification Based on an Individual’s Spotify Playlists

Authors: Sefik Can Karakaya, Ibrahim Demir

Abstract:

In this study, the relationship between musical preferences and personality traits has been investigated in terms of Spotify audio analysis features. The aim of this paper is to build such a classifier capable of segmenting people into their Myers-Briggs Type Index (MBTI) personality type based on their Spotify playlists. Music takes an important place in the lives of people all over the world and online music streaming platforms make it easier to reach musical contents. In this context, the motivation to build such a classifier is allowing people to gain access to their MBTI personality type and perhaps for more reliably and more quickly. For this purpose, logistic regression and deep neural networks have been selected for classifier and their performances are compared. In conclusion, it has been found that musical preferences differ statistically between personality traits, and evaluated models are able to distinguish personality types based on given musical data structure with over %60 accuracy rate.

Keywords: myers-briggs type indicator, music psychology, Spotify, behavioural user profiling, deep neural networks, logistic regression

Procedia PDF Downloads 144
3526 Image Processing techniques for Surveillance in Outdoor Environment

Authors: Jayanth C., Anirudh Sai Yetikuri, Kavitha S. N.

Abstract:

This paper explores the development and application of computer vision and machine learning techniques for real-time pose detection, facial recognition, and number plate extraction. Utilizing MediaPipe for pose estimation, the research presents methods for detecting hand raises and ducking postures through real-time video analysis. Complementarily, facial recognition is employed to compare and verify individual identities using the face recognition library. Additionally, the paper demonstrates a robust approach for extracting and storing vehicle number plates from images, integrating Optical Character Recognition (OCR) with a database management system. The study highlights the effectiveness and versatility of these technologies in practical scenarios, including security and surveillance applications. The findings underscore the potential of combining computer vision techniques to address diverse challenges and enhance automated systems for both individual and vehicular identification. This research contributes to the fields of computer vision and machine learning by providing scalable solutions and demonstrating their applicability in real-world contexts.

Keywords: computer vision, pose detection, facial recognition, number plate extraction, machine learning, real-time analysis, OCR, database management

Procedia PDF Downloads 26
3525 Face Tracking and Recognition Using Deep Learning Approach

Authors: Degale Desta, Cheng Jian

Abstract:

The most important factor in identifying a person is their face. Even identical twins have their own distinct faces. As a result, identification and face recognition are needed to tell one person from another. A face recognition system is a verification tool used to establish a person's identity using biometrics. Nowadays, face recognition is a common technique used in a variety of applications, including home security systems, criminal identification, and phone unlock systems. This system is more secure because it only requires a facial image instead of other dependencies like a key or card. Face detection and face identification are the two phases that typically make up a human recognition system.The idea behind designing and creating a face recognition system using deep learning with Azure ML Python's OpenCV is explained in this paper. Face recognition is a task that can be accomplished using deep learning, and given the accuracy of this method, it appears to be a suitable approach. To show how accurate the suggested face recognition system is, experimental results are given in 98.46% accuracy using Fast-RCNN Performance of algorithms under different training conditions.

Keywords: deep learning, face recognition, identification, fast-RCNN

Procedia PDF Downloads 140
3524 Soliton Interaction in Multi-Core Optical Fiber: Application to WDM System

Authors: S. Arun Prakash, V. Malathi, M. S. Mani Rajan

Abstract:

The analytical bright two soliton solution of the 3-coupled nonlinear Schrödinger equations with variable coefficients in birefringent optical fiber is obtained by Darboux transformation method. To the design of ultra-speed optical devices, Soliton interaction and control in birefringence fiber is investigated. Lax pair is constructed for N coupled NLS system through AKNS method. Using two soliton solution, we demonstrate different interaction behaviors of solitons in birefringent fiber depending on the choice of control parameters. Our results shows that interactions of optical solitons have some specific applications such as construction of logic gates, optical computing, soliton switching, and soliton amplification in wavelength division multiplexing (WDM) system.

Keywords: optical soliton, soliton interaction, soliton switching, WDM

Procedia PDF Downloads 505
3523 Explaining Irregularity in Music by Entropy and Information Content

Authors: Lorena Mihelac, Janez Povh

Abstract:

In 2017, we conducted a research study using data consisting of 160 musical excerpts from different musical styles, to analyze the impact of entropy of the harmony on the acceptability of music. In measuring the entropy of harmony, we were interested in unigrams (individual chords in the harmonic progression) and bigrams (the connection of two adjacent chords). In this study, it has been found that 53 musical excerpts out from 160 were evaluated by participants as very complex, although the entropy of the harmonic progression (unigrams and bigrams) was calculated as low. We have explained this by particularities of chord progression, which impact the listener's feeling of complexity and acceptability. We have evaluated the same data twice with new participants in 2018 and with the same participants for the third time in 2019. These three evaluations have shown that the same 53 musical excerpts, found to be difficult and complex in the study conducted in 2017, are exhibiting a high feeling of complexity again. It was proposed that the content of these musical excerpts, defined as “irregular,” is not meeting the listener's expectancy and the basic perceptual principles, creating a higher feeling of difficulty and complexity. As the “irregularities” in these 53 musical excerpts seem to be perceived by the participants without being aware of it, affecting the pleasantness and the feeling of complexity, they have been defined as “subliminal irregularities” and the 53 musical excerpts as “irregular.” In our recent study (2019) of the same data (used in previous research works), we have proposed a new measure of the complexity of harmony, “regularity,” based on the irregularities in the harmonic progression and other plausible particularities in the musical structure found in previous studies. We have in this study also proposed a list of 10 different particularities for which we were assuming that they are impacting the participant’s perception of complexity in harmony. These ten particularities have been tested in this paper, by extending the analysis in our 53 irregular musical excerpts from harmony to melody. In the examining of melody, we have used the computational model “Information Dynamics of Music” (IDyOM) and two information-theoretic measures: entropy - the uncertainty of the prediction before the next event is heard, and information content - the unexpectedness of an event in a sequence. In order to describe the features of melody in these musical examples, we have used four different viewpoints: pitch, interval, duration, scale degree. The results have shown that the texture of melody (e.g., multiple voices, homorhythmic structure) and structure of melody (e.g., huge interval leaps, syncopated rhythm, implied harmony in compound melodies) in these musical excerpts are impacting the participant’s perception of complexity. High information content values were found in compound melodies in which implied harmonies seem to have suggested additional harmonies, affecting the participant’s perception of the chord progression in harmony by creating a sense of an ambiguous musical structure.

Keywords: entropy and information content, harmony, subliminal (ir)regularity, IDyOM

Procedia PDF Downloads 131
3522 The Next Game Changer: 3-D Printed Musical Instruments

Authors: Leonardo Ko

Abstract:

In an era marked by rapid technological innovation, the classical instrument industry nonetheless has not seen significant change. Is this a matter of stubborn traditionalism, or do old, conventional instruments really sound better? Because of the widespread use of 3-D printing, it seems feasible to produce modern, 3-D printed instruments that adhere to the basic conventions of standard construction. This study aimed to design and create a practical, effective 3-D printed acoustic violin. A cost-benefit analysis of materials and design is presented in addition to a report on sound tests in which a pool of professional musicians compared the traditional violin to its synthetic counterpart with regard to acoustic properties. With a low-cost yet functional instrument, musicians of all levels would be able to afford instruments with much greater ease; the present study thus hopes to contribute to efforts to increase the accessibility of classical music education.

Keywords: acoustic musical instrument, classical musical education, low-cost, 3-D printing

Procedia PDF Downloads 229
3521 Gene Names Identity Recognition Using Siamese Network for Biomedical Publications

Authors: Micheal Olaolu Arowolo, Muhammad Azam, Fei He, Mihail Popescu, Dong Xu

Abstract:

As the quantity of biological articles rises, so does the number of biological route figures. Each route figure shows gene names and relationships. Annotating pathway diagrams manually is time-consuming. Advanced image understanding models could speed up curation, but they must be more precise. There is rich information in biological pathway figures. The first step to performing image understanding of these figures is to recognize gene names automatically. Classical optical character recognition methods have been employed for gene name recognition, but they are not optimized for literature mining data. This study devised a method to recognize an image bounding box of gene name as a photo using deep Siamese neural network models to outperform the existing methods using ResNet, DenseNet and Inception architectures, the results obtained about 84% accuracy.

Keywords: biological pathway, gene identification, object detection, Siamese network

Procedia PDF Downloads 291
3520 Tonal Pitch Structure as a Tool of Social Consolidation

Authors: Piotr Podlipniak

Abstract:

Social consolidation has often been indicated as an adaptive function of music which led to the evolution of music faculty. According to many scholars this function is possible thanks to musical rhythm that enables sensorimotor synchronization to a musical beat. The ability to synchronize to music allows performing music collectively which enhances social cohesion. However, the collective performance of music consists also in spectral synchronization that depends on musical pitch structure. Similarly to rhythmic synchronization, spectral synchronization is a result of ‘brain states alignment’ between people who collectively listen to or perform music. In order to successfully synchronize pitches performers have to adequately expect the pitch structure. The most common form of music which predominates among all human societies is tonal music. In fact tonality understood in the broadest sense as such an organization of musical pitches in which some pitch is more important than others is the only kind of musical pitch structure that has been observed in all currently known musical cultures. The perception of such a musical pitch structure elicits specific emotional reactions which are often described as tensions and relaxations. These facts provoke some important questions. What is the evolutionary reason that people use pitch structure as a form of vocal communication? Why different pitch structures elicit different emotional states independent of extra-musical context? It is proposed in the current presentation that in the course of evolution pitch structure became a human specific tool of communication the function of which is to induce emotional states such as uncertainty and cohesion. By the means of eliciting these emotions during collective music performance people are able to unconsciously give cues concerning social acceptance. This is probably one of the reasons why in all cultures people collectively perform tonal music. It is also suggested that tonal pitch structure had been invented socially before it became an evolutionary innovation of Homo sapiens. It means that a predisposition to tonally organize pitches evolved by the means of ‘Baldwin effect’ – a process in which natural selection transforms the learned response of an organism into the instinctive response. The hypothetical evolutionary scenario of the emergence of tonal pitch structure will be proposed. In this scenario social forces such as a need for closer cooperation play the crucial role.

Keywords: emotion, evolution, tonality, social consolidation

Procedia PDF Downloads 323
3519 Research and Innovations in Music Teacher Training Programme in Hungary

Authors: Monika Benedek

Abstract:

Improvisation is an integral part of music education programmes worldwide since teachers recognize that improvisation helps to broaden stylistic knowledge, develops creativity and various musical skills, in particular, aural skills, and also motivates to learn music theory. In Hungary, where Kodály concept is a core element of music teacher education, improvisation has been relatively neglected subject in both primary school and classical music school curricula. Therefore, improvisation was an important theme of a one-year-long research project carried out at the Liszt Academy of Music in Budapest. The project aimed to develop the music teacher training programme, and among others, focused on testing how improvisation could be used as a teaching tool to improve students’ musical reading and writing skills and creative musical skills. Teacher-researchers first tested various teaching approaches of improvisation with numerous teaching modules in music lessons at public schools and music schools. Data were collected from videos of lessons and from teachers’ reflective notes. After analysing data and developing teaching modules, all modules were tested again in a pilot course in 30 contact lessons for music teachers. Teachers gave written feedback of the pilot programme, tested two modules by their choice in their own teaching and wrote reflecting comments about their experiences in applying teaching modules of improvisation. The overall results indicated that improvisation could be an innovative approach to teaching various musical subjects, in particular, solfege, music theory, and instrument, either in individual or in group instruction. Improvisation, especially with the application of relative solmisation and singing, appeared to have been a beneficial tool to develop various musicianship skills of students and teachers, in particular, the aural, musical reading and writing skills, and creative musical skills. Furthermore, improvisation seemed to have been a motivating teaching tool to learn music theory by creating a bridge between various musical styles. This paper reports on the results of the research project.

Keywords: improvisation, Kodály concept, music school, public school, teacher training

Procedia PDF Downloads 144
3518 The Effect of Aluminum Oxide Nanoparticles on the Optical Properties of (PVP-PEG) Blend

Authors: Hussein Hakim, Zainab Al-Ramadhan, Ahmed Hashim

Abstract:

Polymer nano composites of polyvinylpyrrolidone and poly-ethylene glycol with different concentrations of aluminum oxide (Al2O3) nano particles have been prepared by solution cast method. The optical characterizations have been done by analyzing the absorption (A) spectra in the 300–800 nm spectral region. It was found that the optical energy gap decreases with the increasing of Al2O3 nano particles content. The optical constants (refractive index, extinction coefficient, real and imaginary parts of the dielectric constant) are changing with increasing aluminum oxide nano particle concentrations.

Keywords: nanocomposites, polyvinylpyrrolidone, optical constants, polymers, blend

Procedia PDF Downloads 424
3517 Semiconductor Device of Tapered Waveguide for Broadband Optical Communications

Authors: Keita Iwai, Isao Tomita

Abstract:

To expand the optical spectrum for use in broadband optical communications, we study the properties of a semiconductor waveguide device with a tapered structure including its third-order optical nonlinearity. Spectral-broadened output by the tapered structure has the potential to create a compact, built-in device for optical communications. Here we deal with a compound semiconductor waveguide, the material of which is the same as that of laser diodes used in the communication systems, i.e., InₓGa₁₋ₓAsᵧP₁₋ᵧ, which has large optical nonlinearity. We confirm that our structure widens the output spectrum sufficiently by controlling its taper form factor while utilizing the large nonlinear refraction of InₓGa₁₋ₓAsᵧP₁₋ᵧ. We also examine the taper effect for nonlinear optical loss.

Keywords: InₓGa₁₋ₓAsᵧP₁₋ᵧ, waveguide, nonlinear refraction, spectral spreading, taper device

Procedia PDF Downloads 151
3516 Comparing Emotion Recognition from Voice and Facial Data Using Time Invariant Features

Authors: Vesna Kirandziska, Nevena Ackovska, Ana Madevska Bogdanova

Abstract:

The problem of emotion recognition is a challenging problem. It is still an open problem from the aspect of both intelligent systems and psychology. In this paper, both voice features and facial features are used for building an emotion recognition system. A Support Vector Machine classifiers are built by using raw data from video recordings. In this paper, the results obtained for the emotion recognition are given, and a discussion about the validity and the expressiveness of different emotions is presented. A comparison between the classifiers build from facial data only, voice data only and from the combination of both data is made here. The need for a better combination of the information from facial expression and voice data is argued.

Keywords: emotion recognition, facial recognition, signal processing, machine learning

Procedia PDF Downloads 315
3515 Optical and Dielectric Properties of Self-Assembled 0D Hybrid Organic-Inorganic Insulator

Authors: S. Kassou, R. El Mrabet, A. Belaaraj, P. Guionneau, N. Hadi, T. Lamcharfi

Abstract:

The organic–inorganic hybrid perovskite-like [C6H5C2H4NH3]2ZnCl4 (PEA-ZnCl4) was synthesized by saturated solutions method. X-ray powder diffraction, Raman spectroscopy, UV-visible transmittance, and capacitance meter measurements have been used to characterize the structure, the functional groups, the optical parameters, and the dielectric constants of the material. The material has a layered structure. The optical transmittance (T %) was recorded and applied to deduce the absorption coefficient (α) and optical band gap (Eg). The hybrid shows an insulator character with a direct band gap about 4.46 eV, and presents high dielectric constants up to a frequency of about 105 Hz, which suggests a ferroelectric behavior. The reported optical and dielectric properties can help to understand the fundamental properties of perovskite materials and also to be used for optimizing or designing new devices.

Keywords: dielectric constants, optical band gap (eg), optical parameters, Raman spectroscopy, self-assembly organic inorganic hybrid

Procedia PDF Downloads 402
3514 Possibilities, Challenges and the State of the Art of Automatic Speech Recognition in Air Traffic Control

Authors: Van Nhan Nguyen, Harald Holone

Abstract:

Over the past few years, a lot of research has been conducted to bring Automatic Speech Recognition (ASR) into various areas of Air Traffic Control (ATC), such as air traffic control simulation and training, monitoring live operators for with the aim of safety improvements, air traffic controller workload measurement and conducting analysis on large quantities controller-pilot speech. Due to the high accuracy requirements of the ATC context and its unique challenges, automatic speech recognition has not been widely adopted in this field. With the aim of providing a good starting point for researchers who are interested bringing automatic speech recognition into ATC, this paper gives an overview of possibilities and challenges of applying automatic speech recognition in air traffic control. To provide this overview, we present an updated literature review of speech recognition technologies in general, as well as specific approaches relevant to the ATC context. Based on this literature review, criteria for selecting speech recognition approaches for the ATC domain are presented, and remaining challenges and possible solutions are discussed.

Keywords: automatic speech recognition, asr, air traffic control, atc

Procedia PDF Downloads 399
3513 A Contribution to Human Activities Recognition Using Expert System Techniques

Authors: Malika Yaici, Soraya Aloui, Sara Semchaoui

Abstract:

This paper deals with human activity recognition from sensor data. It is an active research area, and the main objective is to obtain a high recognition rate. In this work, a recognition system based on expert systems is proposed; the recognition is performed using the objects, object states, and gestures and taking into account the context (the location of the objects and of the person performing the activity, the duration of the elementary actions and the activity). The system recognizes complex activities after decomposing them into simple, easy-to-recognize activities. The proposed method can be applied to any type of activity. The simulation results show the robustness of our system and its speed of decision.

Keywords: human activity recognition, ubiquitous computing, context-awareness, expert system

Procedia PDF Downloads 118
3512 Switching to the Latin Alphabet in Kazakhstan: A Brief Overview of Character Recognition Methods

Authors: Ainagul Yermekova, Liudmila Goncharenko, Ali Baghirzade, Sergey Sybachin

Abstract:

In this article, we address the problem of Kazakhstan's transition to the Latin alphabet. The transition process started in 2017 and is scheduled to be completed in 2025. In connection with these events, the problem of recognizing the characters of the new alphabet is raised. Well-known character recognition programs such as ABBYY FineReader, FormReader, MyScript Stylus did not recognize specific Kazakh letters that were used in Cyrillic. The author tries to give an assessment of the well-known method of character recognition that could be in demand as part of the country's transition to the Latin alphabet. Three methods of character recognition: template, structured, and feature-based, are considered through the algorithms of operation. At the end of the article, a general conclusion is made about the possibility of applying a certain method to a particular recognition process: for example, in the process of population census, recognition of typographic text in Latin, or recognition of photos of car numbers, store signs, etc.

Keywords: text detection, template method, recognition algorithm, structured method, feature method

Procedia PDF Downloads 186
3511 Recognizing an Individual, Their Topic of Conversation and Cultural Background from 3D Body Movement

Authors: Gheida J. Shahrour, Martin J. Russell

Abstract:

The 3D body movement signals captured during human-human conversation include clues not only to the content of people’s communication but also to their culture and personality. This paper is concerned with automatic extraction of this information from body movement signals. For the purpose of this research, we collected a novel corpus from 27 subjects, arranged them into groups according to their culture. We arranged each group into pairs and each pair communicated with each other about different topics. A state-of-art recognition system is applied to the problems of person, culture, and topic recognition. We borrowed modeling, classification, and normalization techniques from speech recognition. We used Gaussian Mixture Modeling (GMM) as the main technique for building our three systems, obtaining 77.78%, 55.47%, and 39.06% from the person, culture, and topic recognition systems respectively. In addition, we combined the above GMM systems with Support Vector Machines (SVM) to obtain 85.42%, 62.50%, and 40.63% accuracy for person, culture, and topic recognition respectively. Although direct comparison among these three recognition systems is difficult, it seems that our person recognition system performs best for both GMM and GMM-SVM, suggesting that inter-subject differences (i.e. subject’s personality traits) are a major source of variation. When removing these traits from culture and topic recognition systems using the Nuisance Attribute Projection (NAP) and the Intersession Variability Compensation (ISVC) techniques, we obtained 73.44% and 46.09% accuracy from culture and topic recognition systems respectively.

Keywords: person recognition, topic recognition, culture recognition, 3D body movement signals, variability compensation

Procedia PDF Downloads 541
3510 Patented Free-Space Optical System for Auto Aligned Optical Beam Allowing to Compensate Mechanical Misalignments

Authors: Aurelien Boutin

Abstract:

In optical systems such as Variable Optical Delay Lines, where a collimated beam has to go back and forth, corner cubes are used in order to keep the reflected beam parallel to the incoming beam. However, the reflected beam can be laterally shifted, which will lead to losses. In this paper, we report on a patented optical design that allows keeping the reflected beam with the exact same position and direction whatever the displacement of the corner cube leading to zero losses. After explaining how the optical design works and theoretically allows to compensate for any defects in the translation of the corner cube, we will present the results of experimental comparisons between a standard layout (i.e., only corner cubes) and our optical layout. To compare both optical layouts, we used a fiber-to-fiber coupling setup. It consists of a couple of lights from one fiber to the other, thanks to two lenses. The ensemble [fiber+lense] is fixed and called a collimator so that the light is coupled from one collimator to another. Each collimator was precisely made in order to have a precise working distance. In the experiment, we measured and compared the Insertion Losses (IL) variations between both collimators with the distance between them (i.e., natural Gaussian beam coupling losses) and between both collimators in the different optical layouts tested, with the same optical length propagation. We will show that the IL variations of our setup are less than 0.05dB with respect to the IL variations of collimators alone.

Keywords: free-space optics, variable optical delay lines, optical cavity, auto-alignment

Procedia PDF Downloads 99
3509 Vibro-Tactile Equalizer for Musical Energy-Valence Categorization

Authors: Dhanya Nair, Nicholas Mirchandani

Abstract:

Musical haptic systems can enhance a listener’s musical experience while providing an alternative platform for the hearing impaired to experience music. Current music tactile technologies focus on representing tactile metronomes to synchronize performers or encoding musical notes into distinguishable (albeit distracting) tactile patterns. There is growing interest in the development of musical haptic systems to augment the auditory experience, although the haptic-music relationship is still not well understood. This paper represents a tactile music interface that provides vibrations to multiple fingertips in synchronicity with auditory music. Like an audio equalizer, different frequency bands are filtered out, and the power in each frequency band is computed and converted to a corresponding vibrational strength. These vibrations are felt on different fingertips, each corresponding to a different frequency band. Songs with music from different spectrums, as classified by their energy and valence, were used to test the effectiveness of the system and to understand the relationship between music and tactile sensations. Three participants were trained on one song categorized as sad (low energy and low valence score) and one song categorized as happy (high energy and high valence score). They were trained both with and without auditory feedback (listening to the song while experiencing the tactile music on their fingertips and then experiencing the vibrations alone without the music). The participants were then tested on three songs from both categories, without any auditory feedback, and were asked to classify the tactile vibrations they felt into either category. The participants were blinded to the songs being tested and were not provided any feedback on the accuracy of their classification. These participants were able to classify the music with 100% accuracy. Although the songs tested were on two opposite spectrums (sad/happy), the preliminary results show the potential of utilizing a vibrotactile equalizer, like the one presented, for augmenting musical experience while furthering the current understanding of music tactile relationship.

Keywords: haptic music relationship, tactile equalizer, tactile music, vibrations and mood

Procedia PDF Downloads 181
3508 Deflection Effect on Mirror for Space Applications

Authors: Maamar Fatouma

Abstract:

Mirror optical performance can experience varying levels of stress and tolerances, which can have a notable impact on optical parametric systems. to ensure proper optical figure and position of mirror mounting within design tolerances, it is crucial to have a robust support structure in place for optical systems. The optical figure tolerance determines the allowable deviation from the ideal form of the mirror and the position tolerance determines the location and orientations of the optical axis of the optical systems. A variety of factors influence the optical figure of the mirror. Included are self-weight (Deflection), excitation from temperature change, temperature gradients and dimensional instability. This study employs an analytical approach and finite element method to examine the effects of stress resulting from mirror mounting on the wavefront passing through the mirror. The combined effect of tolerance and deflection on mirror performance is represented by an error budget. Numerical mirror mounting is presented to illustrate the space application of performance techniques.

Keywords: opto-mechanical, bonded optic, tolerance, self-weight distortion, Rayleigh criteria

Procedia PDF Downloads 89
3507 Ideas for Musical Activities and Games in the Early Year (IMAGINE-Autism): A Case Study Approach

Authors: Tania Lisboa, Angela Voyajolu, Adam Ockelford

Abstract:

The positive impact of music on the development of children with autism is widely acknowledged: music offers a unique channel for communication, wellbeing and self-regulation, as well as access to culture and a means of creative engagement. Yet, no coherent program exists for parents, carers and teachers to follow with their children in the early years, when the need for interventions is often most acute. Hence, research and the development of resources is urgently required. Autism is a project with children on the autism spectrum. The project aims at promoting the participants’ engagement with music through involvement in specially-designed musical activities with parents and carers. The main goal of the research is to verify the effectiveness of newly designed resources and strategies, which are based on the Sounds of Intent in the Early Years (SoI-EY) framework of musical development. This is a pilot study, comprising case studies of five children with autism in the early years. The data comprises semi-structured interviews, observations of videos, and feedback from parents on resources. Interpretative Phenomenological Analysis was chosen to analyze the interviews. The video data was coded in relation to the SoI-EY framework. The feedback from parents was used to evaluate the resources (i.e. musical activity cards). The participants’ wider development was also assessed through selected elements of the Early Years Foundation Stage (EYFS), a national assessment framework used in England: specifically, communication, language and social-emotional development. Five families of children on the autism spectrum (aged between 4-8 years) participated in the pilot. The research team visited each family 4 times over a 3-month period, during which the children were observed, and musical activities were suggested based on the child’s assessed level of musical development. Parents then trialed the activities, providing feedback and gathering further video observations of their child’s musical engagement between visits. The results of one case study will be featured in this paper, in which the evidence suggests that specifically tailored musical activity may promote communication and social engagement for a child with language difficulties on the autism spectrum. The resources were appropriate for the children’s involvement in musical activities. Findings suggest that non-specialist musical engagement with family and carers can be a powerful means to foster communication. The case study featured in this paper illustrates this with a child of limited verbal ability. There is a need for further research and development of resources that can be made available to all those working with children on the autism spectrum.

Keywords: autism, development, music education, resources

Procedia PDF Downloads 103
3506 Multicasting Characteristics of All-Optical Triode Based on Negative Feedback Semiconductor Optical Amplifiers

Authors: S. Aisyah Azizan, M. Syafiq Azmi, Yuki Harada, Yoshinobu Maeda, Takaomi Matsutani

Abstract:

We introduced an all-optical multi-casting characteristics with wavelength conversion based on a novel all-optical triode using negative feedback semiconductor optical amplifier. This study was demonstrated with a transfer speed of 10 Gb/s to a non-return zero 231-1 pseudorandom bit sequence system. This multi-wavelength converter device can simultaneously provide three channels of output signal with the support of non-inverted and inverted conversion. We studied that an all-optical multi-casting and wavelength conversion accomplishing cross gain modulation is effective in a semiconductor optical amplifier which is effective to provide an inverted conversion thus negative feedback. The relationship of received power of back to back signal and output signals with wavelength 1535 nm, 1540 nm, 1545 nm, 1550 nm, and 1555 nm with bit error rate was investigated. It was reported that the output signal wavelengths were successfully converted and modulated with a power penalty of less than 8.7 dB, which the highest is 8.6 dB while the lowest is 4.4 dB. It was proved that all-optical multi-casting and wavelength conversion using an optical triode with a negative feedback by three channels at the same time at a speed of 10 Gb/s is a promising device for the new wavelength conversion technology.

Keywords: cross gain modulation, multicasting, negative feedback optical amplifier, semiconductor optical amplifier

Procedia PDF Downloads 684
3505 Many-Body Effect on Optical Gain of n+ Doping Tensile-Strained Ge/GeSiSn Quantum Wells

Authors: W. J. Fan, B. S. Ma

Abstract:

The many-body effect on band structure and optical gain of n+ doping tensile-strained Ge/GeSiSn quantum wells are investigated by using an 8-band k•p method. Phase diagram of Ge/GeSiSn quantum well is obtained. The E-k dispersion curves, band gap renormalization and optical gain spectra including many-body effect will be calculated and discussed. We find that the k.p method without many-body effect will overestimate the optical gain and transition energy.

Keywords: Si photonics, many-body effect, optical gain, Ge-on-Si, Quantum well

Procedia PDF Downloads 734