Search results for: audio fingerprinting

401 Getting Out of the Box: Tangible Music Production in the Age of Virtual Technological Abundance

Abstract:

This paper seeks to explore the different ways in which music producers choose to embrace various levels of technology based on musical values, objectives, affordability, access and workflow benefits. Current digital audio production workflow is questioned. Engineers and music producers of today are increasingly divorced from the tangibility of music production. Making music no longer requires you to reach over and turn a knob. Ideas of authenticity in music production are being redefined. Calculations from the mathematical algorithm with the pretty pictures are increasingly being chosen over hardware containing transformers and tubes. Are mouse clicks and movements equivalent or inferior to the master brush strokes we are seeking to conjure? We are making audio production decisions visually by constantly looking at a screen rather than listening. Have we compromised our music objectives and values by removing the ‘hands-on’ nature of music making? DAW interfaces are making our musical decisions for us not necessarily in our best interests. Technological innovation has presented opportunities as well as challenges for education. What do music production students actually need to learn in a formalised education environment, and to what extent do they need to know it? In this brave new world of omnipresent music creation tools, do we still need tangibility in music production? Interviews with prominent Australian music producers that work in a variety of fields will be featured in this paper, and will provide insight in answering these questions and move towards developing an understanding how tangibility can be rediscovered in the next generation of music production.

Keywords: analogue, digital, digital audio workstation, music production, plugins, tangibility, technology, workflow

Procedia PDF Downloads 271

400 Wolof Voice Response Recognition System: A Deep Learning Model for Wolof Audio Classification

Authors: Krishna Mohan Bathula, Fatou Bintou Loucoubar, FNU Kaleemunnisa, Christelle Scharff, Mark Anthony De Castro

Abstract:

Voice recognition algorithms such as automatic speech recognition and text-to-speech systems with African languages can play an important role in bridging the digital divide of Artificial Intelligence in Africa, contributing to the establishment of a fully inclusive information society. This paper proposes a Deep Learning model that can classify the user responses as inputs for an interactive voice response system. A dataset with Wolof language words ‘yes’ and ‘no’ is collected as audio recordings. A two stage Data Augmentation approach is adopted for enhancing the dataset size required by the deep neural network. Data preprocessing and feature engineering with Mel-Frequency Cepstral Coefficients are implemented. Convolutional Neural Networks (CNNs) have proven to be very powerful in image classification and are promising for audio processing when sounds are transformed into spectra. For performing voice response classification, the recordings are transformed into sound frequency feature spectra and then applied image classification methodology using a deep CNN model. The inference model of this trained and reusable Wolof voice response recognition system can be integrated with many applications associated with both web and mobile platforms.

Keywords: automatic speech recognition, interactive voice response, voice response recognition, wolof word classification

Procedia PDF Downloads 116

399 Improving Fingerprinting-Based Localization System Using Generative Artificial Intelligence

Authors: Getaneh Berie Tarekegn

Abstract:

A precise localization system is crucial for many artificial intelligence Internet of Things (AI-IoT) applications in the era of smart cities. Their applications include traffic monitoring, emergency alarming, environmental monitoring, location-based advertising, intelligent transportation, and smart health care. The most common method for providing continuous positioning services in outdoor environments is by using a global navigation satellite system (GNSS). Due to nonline-of-sight, multipath, and weather conditions, GNSS systems do not perform well in dense urban, urban, and suburban areas.This paper proposes a generative AI-based positioning scheme for large-scale wireless settings using fingerprinting techniques. In this article, we presented a novel semi-supervised deep convolutional generative adversarial network (S-DCGAN)-based radio map construction method for real-time device localization. We also employed a reliable signal fingerprint feature extraction method with t-distributed stochastic neighbor embedding (t-SNE), which extracts dominant features while eliminating noise from hybrid WLAN and long-term evolution (LTE) fingerprints. The proposed scheme reduced the workload of site surveying required to build the fingerprint database by up to 78.5% and significantly improved positioning accuracy. The results show that the average positioning error of GAILoc is less than 39 cm, and more than 90% of the errors are less than 82 cm. That is, numerical results proved that, in comparison to traditional methods, the proposed SRCLoc method can significantly improve positioning performance and reduce radio map construction costs.

Keywords: location-aware services, feature extraction technique, generative adversarial network, long short-term memory, support vector machine

Procedia PDF Downloads 71

398 GAILoc: Improving Fingerprinting-Based Localization System Using Generative Artificial Intelligence

Authors: Getaneh Berie Tarekegn

Abstract:

A precise localization system is crucial for many artificial intelligence Internet of Things (AI-IoT) applications in the era of smart cities. Their applications include traffic monitoring, emergency alarming, environmental monitoring, location-based advertising, intelligent transportation, and smart health care. The most common method for providing continuous positioning services in outdoor environments is by using a global navigation satellite system (GNSS). Due to nonline-of-sight, multipath, and weather conditions, GNSS systems do not perform well in dense urban, urban, and suburban areas.This paper proposes a generative AI-based positioning scheme for large-scale wireless settings using fingerprinting techniques. In this article, we presented a novel semi-supervised deep convolutional generative adversarial network (S-DCGAN)-based radio map construction method for real-time device localization. We also employed a reliable signal fingerprint feature extraction method with t-distributed stochastic neighbor embedding (t-SNE), which extracts dominant features while eliminating noise from hybrid WLAN and long-term evolution (LTE) fingerprints. The proposed scheme reduced the workload of site surveying required to build the fingerprint database by up to 78.5% and significantly improved positioning accuracy. The results show that the average positioning error of GAILoc is less than 39 cm, and more than 90% of the errors are less than 82 cm. That is, numerical results proved that, in comparison to traditional methods, the proposed SRCLoc method can significantly improve positioning performance and reduce radio map construction costs.

Keywords: location-aware services, feature extraction technique, generative adversarial network, long short-term memory, support vector machine

Procedia PDF Downloads 75

397 Examining the Design of a Scaled Audio Tactile Model for Enhancing Interpretation of Visually Impaired Visitors in Heritage Sites

Authors: A. Kavita Murugkar, B. Anurag Kashyap

Abstract:

With the Rights for Persons with Disabilities Act (RPWD Act) 2016, the Indian government has made it mandatory for all establishments, including Heritage Sites, to be accessible for People with Disabilities. However, recent access audit surveys done under the Accessible India Campaign by Ministry of Culture indicate that there are very few accessibility measures provided in the Heritage sites for people with disabilities. Though there are some measures for the mobility impaired, surveys brought out that there are almost no provisions for people with vision impairment (PwVI) in heritage sites thus depriving them of a reasonable physical & intellectual access that facilitates an enjoyable experience and enriching interpretation of the Heritage Site. There is a growing need to develop multisensory interpretative tools that can help the PwVI in perceiving heritage sites in the absence of vision. The purpose of this research was to examine the usability of an audio-tactile model as a haptic and sound-based strategy for augmenting the perception and experience of PwVI in a heritage site. The first phase of the project was a multi-stage phenomenological experimental study with visually impaired users to investigate the design parameters for developing an audio-tactile model for PwVI. The findings from this phase included user preferences related to the physical design of the model such as the size, scale, materials, details, etc., and the information that it will carry such as braille, audio output, tactile text, etc. This was followed by the second phase in which a working prototype of an audio-tactile model is designed and developed for a heritage site based on the findings from the first phase of the study. A nationally listed heritage site from the author’s city was selected for making the model. The model was lastly tested by visually impaired users for final refinements and validation. The prototype developed empowers People with Vision Impairment to navigate independently in heritage sites. Such a model if installed in every heritage site, can serve as a technological guide for the Person with Vision Impairment, giving information of the architecture, details, planning & scale of the buildings, the entrances, location of important features, lifts, staircases, and available, accessible facilities. The model was constructed using 3D modeling and digital printing technology. Though designed for the Indian context, this assistive technology for the blind can be explored for wider applications across the globe. Such an accessible solution can change the otherwise “incomplete’’ perception of the disabled visitor, in this case, a visually impaired visitor and augment the quality of their experience in heritage sites.

Keywords: accessibility, architectural perception, audio tactile model , inclusive heritage, multi-sensory perception, visual impairment, visitor experience

Procedia PDF Downloads 106

396 Crosssampler: A Digital Convolution Cross Synthesis Instrument

Authors: Jimmy Eadie

Abstract:

Convolutional Cross Synthesis (CCS) has emerged as a powerful technique for blending input signals to create hybrid sounds. It has significantly expanded the horizons of digital signal processing, enabling artists to explore audio effects. However, the conventional applications of CCS primarily revolve around reverberation and room simulation rather than being utilized as a creative synthesis method. In this paper, we present the design of a digital instrument called CrossSampler that harnesses a parametric approach to convolution cross-synthesis, which involves using adjustable parameters to control the blending of audio signals through convolution. These parameters allow for customization of the resulting sound, offering greater creative control and flexibility. It enables users to shape the output by manipulating factors such as duration, intensity, and spectral characteristics. This approach facilitates experimentation and exploration in sound design and opens new sonic possibilities.

Keywords: convolution, synthesis, sampling, virtual instrument

Procedia PDF Downloads 64

395 Milk Yield and Fingerprinting of Beta-Casein Precursor (CSN2) Gene in Some Saudi Camel Breeds

Authors: Amr A. El Hanafy, Yasser M. Saad, Saleh A. Alkarim, Hussein A. Almehdar, Elrashdy M. Redwan

Abstract:

Camels are substantial providers of transport, milk, sport, meat, shelter, fuel, security and capital in many countries, particularly Saudi Arabia. Identification of animal breeds has progressed rapidly during the last decade. Advanced molecular techniques are playing a significant role in breeding or strain protection laws. On the other hand, fingerprinting of some molecular markers related to some productive traits in farm animals represents most important studies to our knowledge, which aim to conserve these local genetic resources, and to the genetic improvement of such local breeds by selective programs depending on gene markers. Milk records were taken two days in each week from female camels of Majahem, Safara, Wathaha, and Hamara breeds, respectively from different private farms in northern Jeddah, Riyadh and Alwagh governorates and average weekly yields were calculated. DNA sequencing for CSN2 gene was used for evaluating the genetic variations and calculating the genetic distance values among four Saudi camel populations which are Hamra(R), Safra(Y), Wadha(W) and Majaheim(M). In addition, this marker was analyzed for reconstructing the Neighbor joining tree among evaluating camel breeds. In respect to milk yield during winter season, result indicated that average weekly milk yield of Safara camel breed (30.05 Kg/week) is significantly (p < 0.05) lower than the other 3 breeds which ranged from 39.68 for Hamara to 42.42 Kg/week for Majahem, while there are not significant differences between these three breeds. The Neighbor Joining analysis that re-constructed based on DNA variations showed that samples are clustered into two unique clades. The first clade includes Y (from Y4 to Y18) and M (from M1, to M9). On the other hand, the second cluster is including all R (from R1 to R6) and W (from W1 to W6). The genetic distance values were equal 0.0068 (between the groups M&Y and R&W) and equal 0 (within each group).

Keywords: milk yield, beta-casein precursor (CSN2), Saudi camel, molecular markers

Procedia PDF Downloads 214

394 A Measurement and Motor Control System for Free Throw Shots in Basketball Using Gyroscope Sensor

Authors: Niloofar Zebarjad

Abstract:

This research aims at finding a tool to provide basketball players with real-time audio feedback on their shooting form in free throw shots. Free throws played a pivotal role in taking the lead in fierce competitions. The major problem in performing an accurate free throw seems to be improper training. Since the arm movement during the free throw shot is complex, the coach or the athlete might miss the movement details during practice. Hence, there is a necessity to create a system that measures arm movements' critical characteristics and control for improper kinematics. The proposed setup in this study quantifies arm kinematics and provides real-time feedback as an audio signal consisting of a gyroscope sensor. Spatial shoulder angle data are transmitted in a mobile application in real-time and can be saved and processed for statistical and analysis purposes. The proposed system is easy to use, inexpensive, portable, and real-time applicable. Objectives: This research aims to modify and control the free throw using audio feedback and determine if and to what extent the new setup reduces errors in arm formations during throws and finally assesses the successful throw rate. Methods: One group of elite basketball athletes and two novice athletes (control and study group) participated in this study. Each group contains 5 participants being studied in three separate sessions over a week. Results: Empirical results showed enhancements in the free throw shooting style, shot pocket (SP), and locked position (LP). The mean values of shoulder angle were controlled on 25° and 45° for SP and LP, respectively, recommended by valid FIBA references. Conclusion: Throughout the experiments, the system helped correct and control the shoulder angles toward the targeted pattern of shot pocket (SP) and locked position (LP). According to the desired results for arm motion, adding another sensor to measure and control the elbow angle is recommended.

Keywords: audio-feedback, basketball, free-throw, locked-position, motor-control, shot-pocket

Procedia PDF Downloads 295

393 OPEN-EmoRec-II-A Multimodal Corpus of Human-Computer Interaction

Authors: Stefanie Rukavina, Sascha Gruss, Steffen Walter, Holger Hoffmann, Harald C. Traue

Abstract:

OPEN-EmoRecII is an open multimodal corpus with experimentally induced emotions. In the first half of the experiment, emotions were induced with standardized picture material and in the second half during a human-computer interaction (HCI), realized with a wizard-of-oz design. The induced emotions are based on the dimensional theory of emotions (valence, arousal and dominance). These emotional sequences - recorded with multimodal data (mimic reactions, speech, audio and physiological reactions) during a naturalistic-like HCI-environment one can improve classification methods on a multimodal level. This database is the result of an HCI-experiment, for which 30 subjects in total agreed to a publication of their data including the video material for research purposes. The now available open corpus contains sensory signal of: video, audio, physiology (SCL, respiration, BVP, EMG Corrugator supercilii, EMG Zygomaticus Major) and mimic annotations.

Keywords: open multimodal emotion corpus, annotated labels, intelligent interaction

Procedia PDF Downloads 416

392 Broadcast Routing in Vehicular Ad hoc Networks (VANETs)

Authors: Muazzam A. Khan, Muhammad Wasim

Abstract:

Vehicular adhoc network (VANET) Cars for network (VANET) allowing vehicles to talk to each other, which is committed to building a strong network of mobile vehicles is technical. In VANETs vehicles are equipped with special devices that can get and share info with the atmosphere and other vehicles in the network. Depending on this data security and safety of the vehicles can be enhanced. Broadcast routing is dispersion of any audio or visual medium of mass communication scattered audience distribute audio and video content, but usually using electromagnetic radiation (waves). The lack of server or fixed infrastructure media messages in VANETs plays an important role for every individual application. Broadcast Message VANETs still open research challenge and requires some effort to come to good solutions. This paper starts with a brief introduction of VANET, its applications, and the law of the message-trends in this network starts. This work provides an important and comprehensive study of reliable broadcast routing in VANET scenario.

Keywords: vehicular ad-hoc network , broadcasting, networking protocols, traffic pattern, low intensity conflict

Procedia PDF Downloads 532

391 Genotyping and Phylogeny of Phaeomoniella Genus Associated with Grapevine Trunk Diseases in Algeria

Authors: A. Berraf-Tebbal, Z. Bouznad, , A.J.L. Phillips

Abstract:

Phaeomoniella is a fungus genus in the mitosporic ascomycota which includes Phaeomoniella chlamydospora specie associated with two declining diseases on grapevine (Vitis vinifera) namely Petri disease and esca. Recent studies have shown that several Phaeomoniella species also cause disease on many other woody crops, such as forest trees and woody ornamentals. Two new species, Phaeomoniella zymoides and Phaeomoniella pinifoliorum H.B. Lee, J.Y. Park, R.C. Summerbell et H.S. Jung, were isolated from the needle surface of Pinus densiflora Sieb. et Zucc. in Korea. The identification of species in Phaeomoniella genus can be a difficult task if based solely on morphological and cultural characters. In this respect, the application of molecular methods, particularly PCR-based techniques, may provide an important contribution. MSP-PCR (microsatellite primed-PCR) fingerprinting has proven useful in the molecular typing of fungal strains. The high discriminatory potential of this method is particularly useful when dealing with closely related or cryptic species. In the present study, the application of PCR fingerprinting was performed using the micro satellite primer M13 for the purpose of species identification and strain typing of 84 Phaeomoniella -like isolates collected from grapevines with typical symptoms of dieback. The bands produced by MSP-PCR profiles divided the strains into 3 clusters and 5 singletons with a reproducibility level of 80%. Representative isolates from each group and, when possible, isolates from Eutypa dieback and esca symptoms were selected for sequencing of the ITS region. The ITS sequences for the 16 isolates selected from the MSP-PCR profiles were combined and aligned with sequences of 18 isolates retrieved from GenBank, representing a selection of all known Phaeomoniella species. DNA sequences were compared with those available in GenBank using Neighbor-joining (NJ) and Maximum-parsimony (MP) analyses. The phylogenetic trees of the ITS region revealed that the Phaeomoniella isolates clustered with Phaeomoniella chlamydospora reference sequences with a bootstrap support of 100 %. The complexity of the pathosystems vine-trunk diseases shows clearly the need to identify unambiguously the fungal component in order to allow a better understanding of the etiology of these diseases and justify the establishment of control strategies against these fungal agents.

Keywords: Genotyping, MSP-PCR, ITS, phylogeny, trunk diseases

Procedia PDF Downloads 480

390 The Effectiveness of Using MS SharePoint for the Curriculum Repository System

Authors: Misook Ahn

Abstract:

This study examines the Institutional Curriculum Repository (ICR) developed with MS SharePoint. The purpose of using MS SharePoint is to organize, share, and manage the curriculum data. The ICR aims to build a centralized curriculum infrastructure, preserve all curriculum materials, and provide academic service to users (faculty, students, or other agencies). The ICR collection includes core language curriculum materials developed by each language school—foreign language textbooks, language survival kits, and audio files currently in or not in use at the schools. All core curriculum materials with audio and video files have been coded, collected, and preserved at the ICR. All metadata for the collected curriculum materials have been input by language, code, year, book type, level, user, version, and current status (in use/not in use). The qualitative content analysis, including the survey data, is used to evaluate the effectiveness of using MS SharePoint for the repository system. This study explains how to manage and preserve curriculum materials with MS SharePoint, along with challenges and suggestions for further research. This study will be beneficial to other universities or organizations considering archiving or preserving educational materials.

Keywords: digital preservation, ms sharepoint, repository, curriculum materials

Procedia PDF Downloads 105

389 A Combined Feature Extraction and Thresholding Technique for Silence Removal in Percussive Sounds

Authors: B. Kishore Kumar, Pogula Rakesh, T. Kishore Kumar

Abstract:

The music analysis is a part of the audio content analysis used to analyze the music by using the different features of audio signal. In music analysis, the first step is to divide the music signal to different sections based on the feature profiles of the music signal. In this paper, we present a music segmentation technique that will effectively segmentize the signal and thresholding technique to remove silence from the percussive sounds produced by percussive instruments, which uses two features of music, namely signal energy and spectral centroid. The proposed method impose thresholds on both the features which will vary depends on the music signal. Depends on the threshold, silence part is removed and the segmentation is done. The effectiveness of the proposed method is analyzed using MATLAB.

Keywords: percussive sounds, spectral centroid, spectral energy, silence removal, feature extraction

Procedia PDF Downloads 594

388 Development of a Method to Prepare In-School Tactile Guide Maps for Visually Impaired School Children

Authors: K. Doi, T. Nishimura, M. Kawano, H. Fujimoto, Y. Tanaka, M. Sawada, S. Oouchi, T. Kaneko, K. Kanamori

Abstract:

As part of reasonable accommodation for people with disabilities in Japan, which has ratified the Convention on the Rights of Persons with Disabilities, tactile guide maps are necessary. Such maps can enable visually impaired children to attend schools of special needs education (visual impairments) to grasp the arrangement of classrooms on their school campuses. However, it takes many years to be able to use a tactile guide map without difficulty. Thus, information support, in which audio information is added in addition to tactile information, is required. In the present research, a method to prepare an in-school tactile guide map with an additional audio reading function was developed. This map can enable visually impaired school children attending schools of special needs education (visual impairments) to grasp the arrangement of classrooms on their school campuses.

Keywords: accessible design, visually impaired, braille, tactile map, in-school tactile guide map

Procedia PDF Downloads 362

387 Gas Chromatography and Mass Spectrometry in Honey Fingerprinting: The Occurrence of 3,4-dihydro-3-oxoedulan and (E)-4-(r-1',t-2',c-4'-trihydroxy-3',6',6'-trimethylcyclohexyl)-but-3-en-2-one

Authors: Igor Jerkovic

Abstract:

Owing to the attractive sensory properties and low odour thresholds, norisoprenoids (degraded carotenoid-like structures with 3,5,5-trimethylcyclohex-2-enoic unit) have been identified as aroma contributors in a number of different matrices. C₁₃-Norisoprenoids have been found among volatile organic compounds of various honey types as well as C₉//C₁₀-norisoprenoids or C₁₄/C₁₅-norisoprenoids. Besides degradation of abscisic acid (which produces, e.g., dehydrovomifoliol, vomifoliol, others), the cleavage of the C(9)=C(10) bond of other carotenoid precursors directly generates nonspecific C₁₃-norisoprenoids such as trans-β-damascenone, 3-hydroxy-trans-β-damascone, 3-oxo-α-ionol, 3-oxo-α-ionone, β-ionone found in various honey types. β-Damascenone and β-ionone smelling like honey, exhibit the lowest odour threshold values of all C₁₃-norisoprenoids. The presentation is targeted on two uncommon C₁₃-norisoprenoids in the honey flavor that could be used as specific or nonspecific chemical markers of the botanical origin. Namely, after screening of different honey types, the focus was directed on Centaruea cyanus L. and Allium ursinum L. honey. The samples were extracted by headspace solid-phase microextraction (HS-SPME) and ultrasonic solvent extraction (USE) and the extracts were analysed by gas chromatography and mass spectrometry (GC-MS). SPME fiber with divinylbenzene/carboxen/polydimethylsiloxane (DVB/CAR/PDMS) coating was applied for the research of C. cyanus honey headspace and predominant identified compound was 3,4-dihydro-3-oxoedulan (2,5,5,8a-tetramethyl-2,3,5,6,8,8a-hexahydro-7H-chromen-7-one also known as 2,3,5,6,8,8a-hexahydro-2,5,5,8a-tetramethyl-7H-1-benzo-pyran-7-one). The oxoedulan structure contains epoxide and it is more volatile in comparison with its hydroxylated precursors. This compound has not been found in other honey types and can be considered specific for C. cyanus honey. The dichloromethane extract of A. ursinum honey contained abundant (E)-4-(r-1',t-2',c-4'-trihydroxy-3',6',6'-trimethylcyclohexyl)-but-3-en-2-one that was previously isolated as dominant substance from the ether extracts of New Zealand thyme honey. Although a wide variety of degraded carotenoid-like substances have been identified from different honey types, this appears to be rare situation where 3,4-dihydro-3-oxoedulan and (E)-4-(r-1',t-2',c-4'-trihydroxy-3',6',6'-trimethylcyclohexyl)-but-3-en-2-one have been found that is of great importance for chemical fingerprinting and identification of the chemical biomarkers that can complement the pollen analysis as the major method for the honey classification.

Keywords: 3, 4-dihydro-3-oxoedulan, (E)-4-(r-1', t-2', c-4'-trihydroxy-3', 6', 6'-trimethylcyclohexyl)-but-3-en-2-one, honey flavour, C₁₃-norisoprenoids

Procedia PDF Downloads 331

386 Error Analysis of the Pronunciation of English Consonants and Arabic Consonants by Egyptian Learners

Authors: Marwa A. Nasser

Abstract:

This is an empirical study that provides an investigation of the most significant errors of Egyptian learners in producing English consonants and Arabic consonants, and advice on how these can be remedied. The study adopts a descriptive approach and the analysis is based on audio recordings of two groups of people. The first group includes six volunteers of Egyptian learners belonging to the English Department at Faculty of Women who learn English as a foreign language. The other group includes six Egyptian learners who are studying Tajweed (how to recite Quran correctly). The audio recordings were examined, and sounds were analyzed in an attempt to highlight the most common error done by the learners while reading English or reading (or reciting) Quran. Results show that the two groups of learners have problems with certain phonemic contrasts. Both groups share common errors although both languages are different and not related (e.g. pre-aspiration of fortis stops, incorrect articulation of consonants and velarization of certain sounds).

Keywords: consonant articulations, Egyptian learners of English, Egyptian learners of Quran, empirical study, error analysis, pronunciation problems

Procedia PDF Downloads 269

385 FlameCens: Visualization of Expressive Deviations in Music Performance

Authors: Y. Trantafyllou, C. Alexandraki

Abstract:

Music interpretation accounts to the way musicians shape their performance by deliberately deviating from composers’ intentions, which are commonly communicated via some form of music transcription, such as a music score. For transcribed and non-improvised music, music expression is manifested by introducing subtle deviations in tempo, dynamics and articulation during the evolution of performance. This paper presents an application, named FlameCens, which, given two recordings of the same piece of music, presumably performed by different musicians, allow visualising deviations in tempo and dynamics during playback. The application may also compare a certain performance to the music score of that piece (i.e. MIDI file), which may be thought of as an expression-neutral representation of that piece, hence depicting the expressive queues employed by certain performers. FlameCens uses the Dynamic Time Warping algorithm to compare two audio sequences, based on CENS (Chroma Energy distribution Normalized Statistics) audio features. Expressive deviations are illustrated in a moving flame, which is generated by an animation of particles. The length of the flame is mapped to deviations in dynamics, while the slope of the flame is mapped to tempo deviations so that faster tempo changes the slope to the right and slower tempo changes the slope to the left. Constant slope signifies no tempo deviation. The detected deviations in tempo and dynamics can be additionally recorded in a text file, which allows for offline investigation. Moreover, in the case of monophonic music, the color of particles is used to convey the pitch of the notes during performance. FlameCens has been implemented in Python and it is openly available via GitHub. The application has been experimentally validated for different music genres including classical, contemporary, jazz and popular music. These experiments revealed that FlameCens can be a valuable tool for music specialists (i.e. musicians or musicologists) to investigate the expressive performance strategies employed by different musicians, as well as for music audience to enhance their listening experience.

Keywords: audio synchronization, computational music analysis, expressive music performance, information visualization

Procedia PDF Downloads 130

384 1D Convolutional Networks to Compute Mel-Spectrogram, Chromagram, and Cochleogram for Audio Networks

Authors: Elias Nemer, Greg Vines

Abstract:

Time-frequency transformation and spectral representations of audio signals are commonly used in various machine learning applications. Training networks on frequency features such as the Mel-Spectrogram or Cochleogram have been proven more effective and convenient than training on-time samples. In practical realizations, these features are created on a different processor and/or pre-computed and stored on disk, requiring additional efforts and making it difficult to experiment with different features. In this paper, we provide a PyTorch framework for creating various spectral features as well as time-frequency transformation and time-domain filter-banks using the built-in trainable conv1d() layer. This allows computing these features on the fly as part of a larger network and enabling easier experimentation with various combinations and parameters. Our work extends the work in the literature developed for that end: First, by adding more of these features and also by allowing the possibility of either starting from initialized kernels or training them from random values. The code is written as a template of classes and scripts that users may integrate into their own PyTorch classes or simply use as is and add more layers for various applications.

Keywords: neural networks Mel-Spectrogram, chromagram, cochleogram, discrete Fourrier transform, PyTorch conv1d()

Procedia PDF Downloads 233

383 New Methods to Acquire Grammatical Skills in A Foreign Language

Authors: Indu ray

Abstract:

In today’s digital world the internet is already flooded with information on how to master grammar in a foreign language. It is well known that one cannot master a language without grammar. Grammar is the backbone of any language. Without grammar there would be no structure to help you speak/write or listen/read. Successful communication is only possible if the form and function of linguistic utterances are firmly related to one another. Grammar has its own rules of use to formulate an easier-to-understand language. Like a tool, grammar formulates our thoughts and knowledge in a meaningful way. Every language has its own grammar. With grammar, we can quickly analyze whether there is any action in this text: (Present, past, future). Knowledge of grammar is an important prerequisite for mastering a foreign language. What’s most important is how teachers can make grammar lessons more interesting for students and thus promote grammar skills more successfully. Through this paper, we discuss a few important methods like (Interactive Grammar Exercises between students, Interactive Grammar Exercise between student to teacher, Grammar translation method, Audio -Visual Method, Deductive Method, Inductive Method). This paper is divided into two sections. In the first part, brief definitions and principles of these approaches will be provided. Then the possibility and the case of combination of this approach will be analyzed. In the last section of the paper, I would like to present a survey result conducted at my university on a few methods to quickly learn grammar in Foreign Language. We divided the Grammatical Skills in six Parts. 1.Grammatical Competence 2. Speaking Skills 3. Phonology 4. The syntax and the Semantics 5. Rule 6. Cognitive Function and conducted a survey among students. From our survey results, we can observe that phonology, speaking ability, syntax and semantics can be improved by inductive method, Audio-visual Method, and grammatical translation method, for grammar rules and cognitive functions we should choose IGE (teacher-student) method. and the IGE method (pupil-pupil). The study’s findings revealed, that the teacher delivery Methods should be blend or fusion based on the content of the Grammar.

Keywords: innovative method, grammatical skills, audio-visual, translation

Procedia PDF Downloads 77

382 A Peer-Produced Community of Learning: The Case of Second-Year Algerian Masters Students at a Distance

Authors: Nihad Alem

Abstract:

Nowadays, distance learning (DL) is widely perceived as a reformed type of education that takes advantage of technology to give more appealing opportunities especially for learners whose life conditions impede their attendance to regular classrooms however, creating interactional environment for students to expand their learning community and alleviate the feeling of loneliness and isolation should receive more attention when designing a distance learning course. This research aims to explore whether the audio/video peer learning can offer pedagogical add-ons to the Algerian distance learners and what are the pros and cons of its application as an educational experience in a synchronous environment mediated by Skype. Data were collected using video recordings of six sessions, reflective logs, and in-depth semi-structured interviews and will be analyzed by qualitatively identifying and measuring the three constitutional elements of the educational experience of peer learning namely the social presence, the cognitive presence, and the facilitation presence using a modified community of inquiry coding template. The findings from this study will provide recommendations for effective peer learning educational experience using the facilitation presence concept.

Keywords: audio/visual peer learning, community of inquiry, distance learning, facilitation presence

Procedia PDF Downloads 150

381 The Role of Smart Educational Aids in Learning Listening Among Pupils with Attention and Listening Problems

Authors: Sadeq Al Yaari, Muhammad Alkhunayn, Adham Al Yaari, Aayah Al Yaari, Montaha Al Yaari, Ayman Al Yaari, Sajedah Al Yaari, Fatehi Eissa

Abstract:

The recent rise of smart educational aids and the move away from traditional listening aids are leading to a fundamental shift in the way in which individuals with attention and listening problems (ALP) manipulate listening inputs and/or act appropriately to the spoken information presented to them. A total sample of twenty-six ALP pupils (m=20 and f=6) between 7-12 years old was selected from different strata based on gender, region and school. In the sample size, thirteen (10 males and 3 females) received the treatment in terms of smart classes provided with smart educational aids in a listening course that lasted for four months, while others did not (they studied the same course by the same instructor but in ordinary class). A pretest was administered to assess participants’ levels, and a posttest was given to evaluate their attention and listening comprehension performance, namely in phonetic and phonological tests with sociolinguistic themes that have been designed for this purpose. Test results were analyzed both psychoneurolinguistically and statistically. Results reveal a remarkable change in pupils’ behavioral listening where scores witnessed a significant difference in the performance of the experimental ALP group in the pretest compared to the posttest (Pupils performed better at the pretest-posttest on phonetics than at the two tests on phonology). It is concluded that smart educational aids designed for listening skills help not only increase the listening command of pupils with ALP to understand what they listen to but also develop their interactive listening capability and, at the same rate, are responsible for increasing concentrated and in-depth listening capacity. Plus, ALP pupils become able to grasp the audio content of text recordings, including educational audio recordings, news, oral stories and tales, views, spiritual/religious text and general knowledge. However, the pupils have not experienced individual smart audio-visual aids that connect listening to other language receptive and productive skills, which could be the future area of research.

Keywords: smart aids, attention, listening, problems

Procedia PDF Downloads 43

380 The Impact of Smart Educational Aids in Learning Listening Among Pupils with Attention and Listening Problems

Authors: Sadeq Al Yaari, Muhammad Alkhunayn, Adham Al Yaari, Ayah Al Yaari, Ayman Al Yaari, Montaha Al Yaari, Sajedah Al Yaari, Fatehi Eissa

Abstract:

The recent rise of smart educational aids and the move away from traditional listening aids are leading to a fundamental shift in the way in which individuals with attention and listening problems (ALP) manipulate listening inputs and/or act appropriately to the spoken information presented to them. A total sample of twenty-six ALP pupils (m=20 and f=6) between 7-12 years old was selected from different strata based on gender, region and school. In the sample size, thirteen (10 males and 3 females) received the treatment in terms of smart classes provided with smart educational aids in a listening course that lasted for four months, while others did not (they studied the same course by the same instructor but in ordinary class). A pretest was administered to assess participants’ levels, and a posttest was given to evaluate their attention and listening comprehension performance, namely in phonetic and phonological tests with sociolinguistic themes that have been designed for this purpose. Test results were analyzed both psychoneurolinguistically and statistically. Results reveal a remarkable change in pupils’ behavioral listening where scores witnessed a significant difference in the performance of the experimental ALP group in the pretest compared to the posttest (Pupils performed better at the pretest-posttest on phonetics than at the two tests on phonology). It is concluded that smart educational aids designed for listening skills help not only increase the listening command of pupils with ALP to understand what they listen to but also develop their interactive listening capability and, at the same rate, are responsible for increasing concentrated and in-depth listening capacity. Plus, ALP pupils become able to grasp the audio content of text recordings, including educational audio recordings, news, oral stories and tales, views, spiritual/religious text and general knowledge. However, the pupils have not experienced individual smart audio-visual aids that connect listening to other language receptive and productive skills, which could be the future area of research.

Keywords: smart educational aids, listening attention, pupils, problems

Procedia PDF Downloads 52

379 The Impact of Smart Educational Aids in Learning Listening Among Pupils with Attention and Listening Problems

Authors: Sadeq Al Yaari, Muhammad Alkhunayn, Aayah Al Yaari, Ayman Al Yaari, Montaha Al Yaari, Sajedah Al Yaari, Fatehi Eissa

Abstract:

The recent rise of smart educational aids and the move away from traditional listening aids are leading to a fundamental shift in the way in which individuals with attention and listening problems (ALP) manipulate listening inputs and/or act appropriately to the spoken information presented to them. A total sample of twenty-six ALP pupils (m=20 and f=6) between 7-12 years old was selected from different strata based on gender, region and school. In the sample size, thirteen (10 males and 3 females) received the treatment in terms of smart classes provided with smart educational aids in a listening course that lasted for four month-semester while others did not (they studied the same course by the same instructor but in ordinary class). A pretest was administered to assess participants’ levels, and a posttest was given to evaluate their attention and listening comprehension performance, namely in phonetic and phonological tests with sociolinguistic themes that have been designed for this purpose. Test results were analyzed both psychoneurolinguistically and statistically. Results reveal a remarkable change in pupils’ behavioral listening where scores witnessed a significant difference in the performance of the experimental ALP group in the pretest compared to the posttest (Pupils performed better at the pretest-posttest on phonetics than at the two tests on phonology). It is concluded that smart educational aids designed for listening skills help not only increase the listening command of pupils with ALP to understand what they listen to but also develop their interactive listening capability and, at the same rate, are responsible for increasing concentrated and in-depth listening capacity. Plus, ALP pupils become able to grasp the audio content of text recordings, including educational audio recordings, news, oral stories and tales, views, spiritual/religious text and general knowledge. However, the pupils have not experienced individual smart audio-visual aids that connect listening to other language receptive and productive skills, which could be the future area of research.

Keywords: language skills, implementing, listening skill, attention, smart aids

Procedia PDF Downloads 44

378 Heuristic Classification of Hydrophone Recordings

Authors: Daniel M. Wolff, Patricia Gray, Rafael de la Parra Venegas

Abstract:

An unsupervised machine listening system is constructed and applied to a dataset of 17,195 30-second marine hydrophone recordings. The system is then heuristically supplemented with anecdotal listening, contextual recording information, and supervised learning techniques to reduce the number of false positives. Features for classification are assembled by extracting the following data from each of the audio files: the spectral centroid, root-mean-squared values for each frequency band of a 10-octave filter bank, and mel-frequency cepstral coefficients in 5-second frames. In this way both time- and frequency-domain information are contained in the features to be passed to a clustering algorithm. Classification is performed using the k-means algorithm and then a k-nearest neighbors search. Different values of k are experimented with, in addition to different combinations of the available feature sets. Hypothesized class labels are 'primarily anthrophony' and 'primarily biophony', where the best class result conforming to the former label has 104 members after heuristic pruning. This demonstrates how a large audio dataset has been made more tractable with machine learning techniques, forming the foundation of a framework designed to acoustically monitor and gauge biological and anthropogenic activity in a marine environment.

Keywords: anthrophony, hydrophone, k-means, machine learning

Procedia PDF Downloads 170

377 Effects of Gym-Based and Audio-Visual Guided Home-Based Exercise Programmes on Some Anthropometric and Cardiovascular Parameters Among Overweight and Obese College Students

Authors: Abiodun Afolabi, Rufus Adesoji Adedoyin

Abstract:

This study investigated and compared the effects of gym-based exercise programme (GEBP) and audio-visual guided home-based exercise programme (AVGHBEP) on selected Anthropometric variables (Weight (W), Body Mass Index (BMI), Waist Circumference (WC), Hip Circumference (HC), Thigh Circumference (TC), Waist-Hip-Ratio (WHR), Waist-Height-Ratio (WHtR), Waist-Thigh-Ratio (WTR), Biceps Skinfold Thickness (BSFT), Triceps Skinfold Thickness (TSFT), Suprailliac Skinfold Thickness (SISFT), Subscapular Skinfold Thickness (SSSFT) and Percent Body Fat (PBF)); and Cardiovasular variables (Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP) and Heart Rate (HR)) of overweight and obese students of Federal College of Education (Special), Oyo, Oyo State, Nigeria, with a view to providing information and evidence for GBEP and AVGHBEP in reducing overweight and obesity for promoting cardiovascular fitness. Eighty overweight and obese students (BMI ≥ 25 Kg/m²) were involved in this pretest-posttest quasi experimental study. Participants were randomly assigned into GBEP (n = 40) and AVGBBEP (n = 40) groups. Anthropometric and cardiovascular variables were measured using a weighing scale, height meter, tape measure, skinfold caliper and electronic sphygmomanometer following standard protocols. GBEP and AVGHBEP were implemented following a circuit training (aerobic and resistance training) pattern with a duration of 40-60 minutes, thrice weekly for twelve weeks. GBEP consisted of gymnasium supervised exercise programme while AVGHBEP is a Visual Display guided exercise programme conducted at the home setting. Data were analyzed by Descriptive and Inferential Statistics. The mean ages of the participants were 22.55 ± 2.55 and 23.65 ± 2.89 years for the GBEP group and AVGHBEP group, respectively. Findings showed that in the GBEP group, there were significant reductions in anthropometric variables and adiposity measures of Weight, BMI, BSFT, TSFT, SISFT, SSSFT, WC, HC, TC, WHtR, and PBF at week 12 of the study. Similarly, in the AVGHBEP group, there were significant reductions in Weight, BMI, BSFT, TSFT, SISFT, SSSFT, WC, HC, TC, WHtR and PBF at the 12th week of intervention. Comparison of the effects of GEBP and AVGHBEP on anthropometric variables and measures of adiposity showed that there was no significant difference between the two groups in weight, BMI, BSFT, TSFT, SISFT, SSSFT, WC, HC, TC, WHR, WHtR, WTR and PBF between the two groups at week 12 of the study. Furthermore, findings on the effects of exercise on programmes on cardiovascular variables revealed that significant reductions occurred in SBP in GBEP group and AVGHBEP group respectively. Comparison of the effects of GBEP and AVGHBEP on cardiovascular variables showed that there was no significant difference in SBP, DBP and HR between the two groups at week 12 of the study. It was concluded that the Audio-Visual Guided Home-based Exercise Programme was as effective as the Gym-Based Exercise Programme in causing a significant reduction in anthropometric variables and body fat among college students who are overweight and obese over a period of twelve weeks. Both Gymnasium-Based Exercise Programme and Audio-Visual Guided Home-Based Exercise Programme led to significant reduction in Systolic Blood Pressure over a period of weeks. Audio-Visual Guided Home-Based Exercise Programme can, therefore, be used as an alternative therapy in the non-pharmacological management of people who are overweight and obese.

Keywords: gym-based exercises, audio-visual guided home-based exercises, anthropometric parameters, cardiovascular parameters, overweight students, obese students

Procedia PDF Downloads 37

376 Identification of Arglecins B and C and Actinofuranosin A from a Termite Gut-Associated Streptomyces Species

Authors: Christian A. Romero, Tanja Grkovic, John. R. J. French, D. İpek Kurtböke, Ronald J. Quinn

Abstract:

A high-throughput and automated 1H NMR metabolic fingerprinting dereplication approach was used to accelerate the discovery of unknown bioactive secondary metabolites. The applied dereplication strategy accelerated the discovery of natural products, provided rapid and competent identification and quantification of the known secondary metabolites and avoided time-consuming isolation procedures. The effectiveness of the technique was demonstrated by the isolation and elucidation of arglecins B (1), C (2) and actinofuranosin A (3) from a termite-gut associated Streptomyces sp. (USC 597) grown under solid state fermentation. The structures of these compounds were elucidated by extensive interpretation of 1H, 13C and 2D NMR spectroscopic data. These represent the first report of arglecin analogs isolated from a termite gut-associated Streptomyces species.

Keywords: actinomycetes, actinofuranosin, antibiotics, arglecins, NMR spectroscopy

Procedia PDF Downloads 61

375 Boundary Alert System for Powered Wheelchair in Confined Area Training

Authors: Tsoi Kim Ming, Yu King Pong

Abstract:

Background: With powered wheelchair, patients can travel more easily and conveniently. However, some patients suffer from other difficulties, such as visual impairment, cognitive disorder, or psychological issues, which make them unable to control powered wheelchair safely. Purpose: Therefore, those patients are required to complete a comprehensive driving training by therapists on confined area, which simulates narrow paths in daily live. During the training, therapists will give series of driving instruction to patients, which may be unaware of patients crossing out the boundary of area. To facilitate the training, it is needed to develop a device to provide warning to patients during training Method: We adopt LIDAR for distance sensing started from center of confined area. Then, we program the LIDAR with linear geometry to remember each side of the area. The LIDAR will sense the location of wheelchair continuously. Once the wheelchair is driven out of the boundary, audio alert will be given to patient. Result: Patients can pay their attention to the particular driving situation followed by audio alert during driving training, which can learn how to avoid out of boundary in similar situation next time. Conclusion: Instead of only instructed by therapist, the LIDAR can facilitate the powered wheelchair training by patients actively pay their attention to driving situation. After training, they are able to control the powered wheelchair safely when facing difficult and narrow path in real life.

Keywords: PWC, training, rehab, AT

Procedia PDF Downloads 105

374 The Role of Student Culture in Beginning Music Teachers’ Instruction in Urban School Settings

Authors: Kiana Williams

Abstract:

The purpose of this case study was to examine beginning music teachers’ perspectives of cultural relevance in relation to music instruction in urban school settings within a large Southwestern city. Research questions focused on the role of student culture in beginning music teachers’ instruction. Data were collected based on Seidman’s (2013) three interview series, consisting of audio recordings from two semi-structured individual interviews for each participant, a 15-20-minute video recording from each participant teaching in their classroom, and an audio recording of one focus group interview. Participants included three beginning music teachers currently employed in urban schools in a major metropolitan city in the Southern United States. In this study, a teacher was considered a beginning teacher if they had zero to three years of experience teaching music in urban school settings. The results revealed three broad themes related to connectivity and relatability, concerts, and differentiated instruction. Implications for current music educators as well as music teacher educators in higher education are included in this study. Further research should consider examining the effect of culturally relevant pedagogy on student retention in urban school music programs.

Keywords: culture, instruction, music, pedagogy, teacher, urban

Procedia PDF Downloads 139

373 Electrochemical Radiofrequency Scanning Tunneling Microscopy Measurements for Fingerprinting Single Electron Transfer Processes

Authors: Abhishek Kumar, Mohamed Awadein, Georg Gramse, Luyang Song, He Sun, Wolfgang Schofberger, Stefan Müllegger

Abstract:

Electron transfer is a crucial part of chemical reactions which drive everyday processes. With the help of an electro-chemical radio frequency scanning tunneling microscopy (EC-RF-STM) setup, we are observing single electron mediated oxidation-reduction processes in molecules like ferrocene and transition metal corroles. Combining the techniques of scanning microwave microscopy and cyclic voltammetry allows us to monitor such processes with attoampere sensitivity. A systematic study of such phenomena would be critical to understanding the nano-scale behavior of catalysts, molecular sensors, and batteries relevant to the development of novel material and energy applications.

Keywords: radiofrequency, STM, cyclic voltammetry, ferrocene

Procedia PDF Downloads 480

372 A Qualitative Study on Metacognitive Patterns among High and Low Performance Problem Based on Learning Groups

Authors: Zuhairah Abdul Hadi, Mohd Nazir bin Md. Zabit, Zuriadah Ismail

Abstract:

Metacognitive has been empirically evidenced to be one important element influencing learning outcomes. Expert learners engage in metacognition by monitoring and controlling their thinking, and listing, considering and selecting the best strategies to achieve desired goals. Studies also found that good critical thinkers engage in more metacognition and people tend to activate more metacognition when solving complex problems. This study extends past studies by performing a qualitative analysis to understand metacognitive patterns among two high and two low performing groups by carefully examining video and audio records taken during Problem-based learning activities. High performing groups are groups with majority members scored well in Watson Glaser II Critical Thinking Appraisal (WGCTA II) and academic achievement tests. Low performing groups are groups with majority members fail to perform in the two tests. Audio records are transcribed and analyzed using schemas adopted from past studies. Metacognitive statements are analyzed using three stages model and patterns of metacognitive are described by contexts, components, and levels for each high and low performing groups.

Keywords: academic achievement, critical thinking, metacognitive, problem-based learning

Procedia PDF Downloads 285