Search results for: optical musical recognition
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 3594

Search results for: optical musical recognition

3084 Investigation of Delivery of Triple Play Data in GE-PON Fiber to the Home Network

Authors: Ashima Anurag Sharma

Abstract:

Optical fiber based networks can deliver performance that can support the increasing demands for high speed connections. One of the new technologies that have emerged in recent years is Passive Optical Networks. This research paper is targeted to show the simultaneous delivery of triple play service (data, voice, and video). The comparison between various data rates is presented. It is demonstrated that as we increase the data rate, number of users to be decreases due to increase in bit error rate.

Keywords: BER, PON, TDMPON, GPON, CWDM, OLT, ONT

Procedia PDF Downloads 527
3083 Particle Dust Layer Density and the Optical Wavelength Absorption Relationship in Photovoltaic Module

Authors: M. Mesrouk, A. Hadj Arab

Abstract:

This work allows highlight the effect of dust on the absorption of the optical spectrum on the photovoltaic module, the effect of the particles dust presence on the photovoltaic modules have been a microscopic scale studied with COMSOL Multi-physic software simulation. In this paper, we have supposed the dust layer as a diffraction network repetitive optical structure characterized by the spacing between particle which represented by 'd' and the simulated structure (air-dust particle-glass). In this study we can observe the relationship between the wavelength and the particle spacing, the simulation shows us that the maximum wavelength transmission value corresponding, λ0 = 400nm, which represent the spacing value between the particles dust, d = 400 nm. In fact, we can observe that while increase dust layer density the wavelength transmission value decrease, there is a relationship between the density and wavelength value which can be absorbed in a dusty photovoltaic panel.

Keywords: dust effect, photovoltaic module, spectral absorption, wavelength transmission

Procedia PDF Downloads 463
3082 Evaluation of Features Extraction Algorithms for a Real-Time Isolated Word Recognition System

Authors: Tomyslav Sledevič, Artūras Serackis, Gintautas Tamulevičius, Dalius Navakauskas

Abstract:

This paper presents a comparative evaluation of features extraction algorithm for a real-time isolated word recognition system based on FPGA. The Mel-frequency cepstral, linear frequency cepstral, linear predictive and their cepstral coefficients were implemented in hardware/software design. The proposed system was investigated in the speaker-dependent mode for 100 different Lithuanian words. The robustness of features extraction algorithms was tested recognizing the speech records at different signals to noise rates. The experiments on clean records show highest accuracy for Mel-frequency cepstral and linear frequency cepstral coefficients. For records with 15 dB signal to noise rate the linear predictive cepstral coefficients give best result. The hard and soft part of the system is clocked on 50 MHz and 100 MHz accordingly. For the classification purpose, the pipelined dynamic time warping core was implemented. The proposed word recognition system satisfies the real-time requirements and is suitable for applications in embedded systems.

Keywords: isolated word recognition, features extraction, MFCC, LFCC, LPCC, LPC, FPGA, DTW

Procedia PDF Downloads 495
3081 Audio-Visual Recognition Based on Effective Model and Distillation

Authors: Heng Yang, Tao Luo, Yakun Zhang, Kai Wang, Wei Qin, Liang Xie, Ye Yan, Erwei Yin

Abstract:

Recent years have seen that audio-visual recognition has shown great potential in a strong noise environment. The existing method of audio-visual recognition has explored methods with ResNet and feature fusion. However, on the one hand, ResNet always occupies a large amount of memory resources, restricting the application in engineering. On the other hand, the feature merging also brings some interferences in a high noise environment. In order to solve the problems, we proposed an effective framework with bidirectional distillation. At first, in consideration of the good performance in extracting of features, we chose the light model, Efficientnet as our extractor of spatial features. Secondly, self-distillation was applied to learn more information from raw data. Finally, we proposed a bidirectional distillation in decision-level fusion. In more detail, our experimental results are based on a multi-model dataset from 24 volunteers. Eventually, the lipreading accuracy of our framework was increased by 2.3% compared with existing systems, and our framework made progress in audio-visual fusion in a high noise environment compared with the system of audio recognition without visual.

Keywords: lipreading, audio-visual, Efficientnet, distillation

Procedia PDF Downloads 134
3080 Surveillance of Super-Extended Objects: Bimodal Approach

Authors: Andrey V. Timofeev, Dmitry Egorov

Abstract:

This paper describes an effective solution to the task of a remote monitoring of super-extended objects (oil and gas pipeline, railways, national frontier). The suggested solution is based on the principle of simultaneously monitoring of seismoacoustic and optical/infrared physical fields. The principle of simultaneous monitoring of those fields is not new but in contrast to the known solutions the suggested approach allows to control super-extended objects with very limited operational costs. So-called C-OTDR (Coherent Optical Time Domain Reflectometer) systems are used to monitor the seismoacoustic field. Far-CCTV systems are used to monitor the optical/infrared field. A simultaneous data processing provided by both systems allows effectively detecting and classifying target activities, which appear in the monitored objects vicinity. The results of practical usage had shown high effectiveness of the suggested approach.

Keywords: C-OTDR monitoring system, bimodal processing, LPboost, SVM

Procedia PDF Downloads 469
3079 Characterization and Modification of the Optical Properties of Zirconia Ceramics for Aesthetic Dental Restorations

Authors: R. A. Shahmiri, O. Standard, J. Hart, C. C. Sorrell

Abstract:

Yttrium stabilized tetragonal zirconium polycrystalline (Y-TZP) has been used as a dental biomaterial. The strength and toughness of zirconia can be accounted for by its toughening mechanisms, such as crack deflection, zone shielding, contact shielding, and crack bridging. Prevention of crack propagation is of critical importance in high-fatigue situations, such as those encountered in mastication and para-function. However, the poor translucency of Y-TZP means that it may not meet the aesthetic requirements due to its white/grey appearance in polycrystalline form. To improve optical property of the Zirconia, precise evaluation of its refractive index is of significance. Zirconia`s optical properties need to be studied more in depth. Number of studies assumed, scattered light is isotropically distributed over all angles from biological media when defining optical parameters. Nevertheless, optical behaviour of real biological material depends on angular scattering of light by anisotropy material. Therefore, the average cosine of the scattering angle (which represent recovery phase function in the scattering angular distribution) usually characterized by anisotropy material. It has been identified that yttrium anti-sites present in the space charge layer have no significant role in the absorption of light in the visible range. Addition of cation dopant to polycrystalline zirconia results in segregate to grain boundaries and grain growth. Intrinsic and extrinsic properties of ZrO2 and their effect on optical properties need to be investigated. Intrinsic properties such as chemical composition, defect structure (oxygen vacancy), phase configuration (porosity, second phase) and distribution of phase need to be studied to comprehend their effect on refraction index, absorption/reflection and scattering. Extrinsic properties such as surface structure, thickness, underlying tooth structure, cement layer (type, thickness), and light source (natural, curing, artificial) of ZrO2 need to be studied to understand their effect on colour and translucency of material. This research reviewed effect of stabilization of tetragonal zirconia on optical property of zirconia for dental application.

Keywords: optical properties, zirconia dental biomaterial, chemical composition, phase composition

Procedia PDF Downloads 395
3078 Quality-Of-Service-Aware Green Bandwidth Allocation in Ethernet Passive Optical Network

Authors: Tzu-Yang Lin, Chuan-Ching Sue

Abstract:

Sleep mechanisms are commonly used to ensure the energy efficiency of each optical network unit (ONU) that concerns a single class delay constraint in the Ethernet Passive Optical Network (EPON). How long the ONUs can sleep without violating the delay constraint has become a research problem. Particularly, we can derive an analytical model to determine the optimal sleep time of ONUs in every cycle without violating the maximum class delay constraint. The bandwidth allocation considering such optimal sleep time is called Green Bandwidth Allocation (GBA). Although the GBA mechanism guarantees that the different class delay constraints do not violate the maximum class delay constraint, packets with a more relaxed delay constraint will be treated as those with the most stringent delay constraint and may be sent early. This means that the ONU will waste energy in active mode to send packets in advance which did not need to be sent at the current time. Accordingly, we proposed a QoS-aware GBA using a novel intra-ONU scheduling to control the packets to be sent according to their respective delay constraints, thereby enhancing energy efficiency without deteriorating delay performance. If packets are not explicitly classified but with different packet delay constraints, we can modify the intra-ONU scheduling to classify packets according to their packet delay constraints rather than their classes. Moreover, we propose the switchable ONU architecture in which the ONU can switch the architecture according to the sleep time length, thus improving energy efficiency in the QoS-aware GBA. The simulation results show that the QoS-aware GBA ensures that packets in different classes or with different delay constraints do not violate their respective delay constraints and consume less power than the original GBA.

Keywords: Passive Optical Networks, PONs, Optical Network Unit, ONU, energy efficiency, delay constraint

Procedia PDF Downloads 284
3077 Matter of the Artistic Content of Music (The Symphonies of Jovdat Hajiyev and the Creativity of Fikrat Amirov)

Authors: Vusala Amirbayova Yusif

Abstract:

As we know the formation of new composer’s schools is determined not with the national belonging, but firstly with the development of the national spirit and eternal traditions. The formation of ancient musical traditions with the classical European genre and forms stand in the center of music art with Azerbaijani written tradition. Though this duty is actual for the neighboring eastern countries (for example, Iran, Turkey, Arabian countries, India), it has not been realized in the same level in real creative practice. It is necessary to mention that, the symphonic mughams formed from the joining of Eastern mugham-magam and classical music forms of Western symphony have been greeted with amazement and it was valuable practice in national composer’s art. It is true that, the new examples of the genre were formed in the next years (S.Alasgarov, T.Bakikhanov and etc.) and F.Amirov came back to the genre of symphonic mugham as he created Gulustani-Bayati-Shiraz”in,-1970. New tendency has begun to show itself in the development of national symphonic genre. The new attitude for mugham traditions showed itself in symphonic creative work of A.Malikov, A.Alizada, M.Guliyev,V.Adigozalov. The voice of mugham mentality has entered the depth of the Azerbaijan symphony, has determined the meditation spirit, dramatist process and content. This movement has formed the new notion of “mugham mphonism” with new meaning by our musicologists. In the modern musical science, in addition to traditional methods and procedures, the formation of new theories and approaches caused to the further increase of scientific interest towards the problem of artistic content in the art of composition. The initiative has been made to have overall look on this important subject as an example of the creativity of FikratAmirov (1922-1984)and JovdatHaciyev(1917-2000), the great composers of Azerbaijan and to analyze his some symphonic works from this point of view in the current report. In this connection, main provisions of the new theoretical concept that were comprehensively annotated in the article of Russian musicologist V. Kholopova named "Special and non-special musical content" were used.

Keywords: content, composer, music, mugham symphony

Procedia PDF Downloads 487
3076 Automatic Landmark Selection Based on Feature Clustering for Visual Autonomous Unmanned Aerial Vehicle Navigation

Authors: Paulo Fernando Silva Filho, Elcio Hideiti Shiguemori

Abstract:

The selection of specific landmarks for an Unmanned Aerial Vehicles’ Visual Navigation systems based on Automatic Landmark Recognition has significant influence on the precision of the system’s estimated position. At the same time, manual selection of the landmarks does not guarantee a high recognition rate, which would also result on a poor precision. This work aims to develop an automatic landmark selection that will take the image of the flight area and identify the best landmarks to be recognized by the Visual Navigation Landmark Recognition System. The criterion to select a landmark is based on features detected by ORB or AKAZE and edges information on each possible landmark. Results have shown that disposition of possible landmarks is quite different from the human perception.

Keywords: clustering, edges, feature points, landmark selection, X-means

Procedia PDF Downloads 279
3075 Bit Error Rate Monitoring for Automatic Bias Control of Quadrature Amplitude Modulators

Authors: Naji Ali Albakay, Abdulrahman Alothaim, Isa Barshushi

Abstract:

The most common quadrature amplitude modulator (QAM) applies two Mach-Zehnder Modulators (MZM) and one phase shifter to generate high order modulation format. The bias of MZM changes over time due to temperature, vibration, and aging factors. The change in the biasing causes distortion to the generated QAM signal which leads to deterioration of bit error rate (BER) performance. Therefore, it is critical to be able to lock MZM’s Q point to the required operating point for good performance. We propose a technique for automatic bias control (ABC) of QAM transmitter using BER measurements and gradient descent optimization algorithm. The proposed technique is attractive because it uses the pertinent metric, BER, which compensates for bias drifting independently from other system variations such as laser source output power. The proposed scheme performance and its operating principles are simulated using OptiSystem simulation software for 4-QAM and 16-QAM transmitters.

Keywords: automatic bias control, optical fiber communication, optical modulation, optical devices

Procedia PDF Downloads 189
3074 Optical Properties of Tetrahydrofuran Clathrate Hydrates at Terahertz Frequencies

Authors: Hyery Kang, Dong-Yeun Koh, Yun-Ho Ahn, Huen Lee

Abstract:

Terahertz time-domain spectroscopy (THz-TDS) was used to observe the THF clathrate hydrate system with dosage of polyvinylpyrrolidone (PVP) with three different average molecular weights (10,000 g/mol, 40,000 g/mol, 360,000 g/mol). Distinct footprints of phase transition in the THz region (0.4 - 2.2 THz) were analyzed and absorption coefficients and complex refractive indices are obtained and compared in the temperature range of 253 K to 288 K. Along with the optical properties, ring breathing and stretching modes for different molecular weights of PVP in THF hydrate are analyzed by Raman spectroscopy.

Keywords: clathrate hydrate, terahertz, polyvinylpyrrolidone (PVP), THz-TDS, inhibitor

Procedia PDF Downloads 379
3073 Optical Vortex in Asymmetric Arcs of Rotating Intensity

Authors: Mona Mihailescu, Rebeca Tudor, Irina A. Paun, Cristian Kusko, Eugen I. Scarlat, Mihai Kusko

Abstract:

Specific intensity distributions in the laser beams are required in many fields: optical communications, material processing, microscopy, optical tweezers. In optical communications, the information embedded in specific beams and the superposition of multiple beams can be used to increase the capacity of the communication channels, employing spatial modulation as an additional degree of freedom, besides already available polarization and wavelength multiplexing. In this regard, optical vortices present interest due to their potential to carry independent data which can be multiplexed at the transmitter and demultiplexed at the receiver. Also, in the literature were studied their combinations: 1) axial or perpendicular superposition of multiple optical vortices or 2) with other laser beam types: Bessel, Airy. Optical vortices, characterized by stationary ring-shape intensity and rotating phase, are achieved using computer generated holograms (CGH) obtained by simulating the interference between a tilted plane wave and a wave passing through a helical phase object. Here, we propose a method to combine information through the reunion of two CGHs. One is obtained using the helical phase distribution, characterized by its topological charge, m. The other is obtained using conical phase distribution, characterized by its radial factor, r0. Each CGH is obtained using plane wave with different tilts: km and kr for CGH generated from helical phase object and from conical phase object, respectively. These reunions of two CGHs are calculated to be phase optical elements, addressed on the liquid crystal display of a spatial light modulator, to optically process the incident beam for investigations of the diffracted intensity pattern in far field. For parallel reunion of two CGHs and high values of the ratio between km and kr, the bright ring from the first diffraction order, specific for optical vortices, is changed in an asymmetric intensity pattern: a number of circle arcs. Both diffraction orders (+1 and -1) are asymmetrical relative to each other. In different planes along the optical axis, it is observed that this asymmetric intensity pattern rotates around its centre: in the +1 diffraction order the rotation is anticlockwise and in the -1 diffraction order, the rotation is clockwise. The relation between m and r0 controls the diameter of the circle arcs and the ratio between km and kr controls the number of arcs. For perpendicular reunion of the two CGHs and low values of the ratio between km and kr, the optical vortices are multiplied and focalized in different planes, depending on the radial parameter. The first diffraction order contains information about both phase objects. It is incident on the phase masks placed at the receiver, computed using the opposite values for topological charge or for the radial parameter and displayed successively. In all, the proposed method is exploited in terms of constructive parameters, for the possibility offered by the combination of different types of beams which can be used in robust optical communications.

Keywords: asymmetrical diffraction orders, computer generated holograms, conical phase distribution, optical vortices, spatial light modulator

Procedia PDF Downloads 310
3072 Sports Fans and Non-Interested Public Recognition of the Problems of Sports in Egypt through Caricature

Authors: Alaaeldin Hamdy Ahmed Mohammed

Abstract:

Introduction: This study examines sports’ fans and non-interested public perception and recognition of the problems that have negative impacts upon the Egyptian sports, particularly football, through caricatures. Eight caricature paintings were designed to express eight problems affecting the Egyptian sports and its development. These paintings were distributed on two groups of the fans and the non-interested public. Methods: The study was limited to eight caricatures representing the eight issues which are: the impact of stopping the sports activity on athletes, the effect of clubs’ disagreement, fanaticism between the members of the ultras of different clubs, the negative impact of the mingling of politics into sports, the negative role of the clubs affects the professionalism of the promising players, the conflict between the national organization responsible for sports, the breaking in of the fans to the playgrounds, the impact of the lack of planning on the national team. The Results: The results showed that both sports fans and those who are not interested in sports recognized the problems that the caricatures refer to and criticizes exaggeration although the rate was higher for the fans. These caricatures contributed also in their recognition of the danger of the negative impact of these problems on the Egyptian sports, particularly football which is the most common at the Egyptian sports fans. Discussion: This finding echoes the conclusion that caricatures are distinctive in the adults’ facial stimuli that are either systematically exaggerated recognition of them.

Keywords: caricature, fans, football, sports

Procedia PDF Downloads 317
3071 Low-Surface Roughness and High Optical Quality CdS Thin Film Deposited on Heated Substrate Using Room-Temperature Chemical Solution

Authors: A. Elsayed, M. H. Dewaidar, M. Ghali, M. Elkemary

Abstract:

The high production cost of the conventional solar cells requires the search for economic methods suitable for solar energy conversion. Cadmium Sulfide (CdS) is one of the most important semiconductors used in photovoltaics, especially in large area solar cells; and can be prepared in a thin film form by a wide variety of deposition techniques. The preparation techniques include vacuum evaporation, sputtering and molecular beam epitaxy. Other techniques, based on chemical solutions, are also used for depositing CdS films with dramatically low-cost compared to other vacuum-based methods. Although this technique is widely used during the last decades, due to simplicity and low-deposition temperature (~100°C), there is still a strong need for more information on the growth process and its relation with the quality of the deposited films. Here, we report on deposition of high-quality CdS thin films; with low-surface roughness ( < 3.0 nm) and sharp optical absorption edge; on low-temperature glass substrates (70°C) using a new method based on the room-temperature chemical solution. In this method, a mixture solution of cadmium acetate and thiourea at room temperature was used under special growth conditions for deposition of CdS films. X-ray diffraction (XRD) measurements were used to examine the crystal structure properties of the deposited CdS films. In addition, UV-VIS transmittance and low-temperature (4K) photoluminescence (PL) measurements were performed for quantifying optical properties of the deposited films. The deposited films show high optical quality as confirmed by observation of both, sharp edge in the transmittance spectra and strong PL intensity at room temperature. Furthermore, we found a strong effect of the growth conditions on the optical band gap of the deposited films; where remarkable red-shift in the absorption edge with temperature is clearly seen in both transmission and PL spectra. Such tuning of both optical band gap of the deposited CdS films can be utilized for tuning the electronic bands' alignments between CdS and other light-harvesting materials, like CuInGaSe or CdTe, for potential improvement in the efficiency of solar cells devices based on these heterostructures.

Keywords: chemical deposition, CdS, optical properties, surface, thin film

Procedia PDF Downloads 162
3070 A Speeded up Robust Scale-Invariant Feature Transform Currency Recognition Algorithm

Authors: Daliyah S. Aljutaili, Redna A. Almutlaq, Suha A. Alharbi, Dina M. Ibrahim

Abstract:

All currencies around the world look very different from each other. For instance, the size, color, and pattern of the paper are different. With the development of modern banking services, automatic methods for paper currency recognition become important in many applications like vending machines. One of the currency recognition architecture’s phases is Feature detection and description. There are many algorithms that are used for this phase, but they still have some disadvantages. This paper proposes a feature detection algorithm, which merges the advantages given in the current SIFT and SURF algorithms, which we call, Speeded up Robust Scale-Invariant Feature Transform (SR-SIFT) algorithm. Our proposed SR-SIFT algorithm overcomes the problems of both the SIFT and SURF algorithms. The proposed algorithm aims to speed up the SIFT feature detection algorithm and keep it robust. Simulation results demonstrate that the proposed SR-SIFT algorithm decreases the average response time, especially in small and minimum number of best key points, increases the distribution of the number of best key points on the surface of the currency. Furthermore, the proposed algorithm increases the accuracy of the true best point distribution inside the currency edge than the other two algorithms.

Keywords: currency recognition, feature detection and description, SIFT algorithm, SURF algorithm, speeded up and robust features

Procedia PDF Downloads 235
3069 Active Surface Tracking Algorithm for All-Fiber Common-Path Fourier-Domain Optical Coherence Tomography

Authors: Bang Young Kim, Sang Hoon Park, Chul Gyu Song

Abstract:

A conventional optical coherence tomography (OCT) system has limited imaging depth, which is 1-2 mm, and suffers unwanted noise such as speckle noise. The motorized-stage-based OCT system, using a common-path Fourier-domain optical coherence tomography (CP-FD-OCT) configuration, provides enhanced imaging depth and less noise so that we can overcome these limitations. Using this OCT systems, OCT images were obtained from an onion, and their subsurface structure was observed. As a result, the images obtained using the developed motorized-stage-based system showed enhanced imaging depth than the conventional system, since it is real-time accurate depth tracking. Consequently, the developed CP-FD-OCT systems and algorithms have good potential for the further development of endoscopic OCT for microsurgery.

Keywords: common-path OCT, FD-OCT, OCT, tracking algorithm

Procedia PDF Downloads 380
3068 An Ensemble-based Method for Vehicle Color Recognition

Authors: Saeedeh Barzegar Khalilsaraei, Manoocheher Kelarestaghi, Farshad Eshghi

Abstract:

The vehicle color, as a prominent and stable feature, helps to identify a vehicle more accurately. As a result, vehicle color recognition is of great importance in intelligent transportation systems. Unlike conventional methods which use only a single Convolutional Neural Network (CNN) for feature extraction or classification, in this paper, four CNNs, with different architectures well-performing in different classes, are trained to extract various features from the input image. To take advantage of the distinct capability of each network, the multiple outputs are combined using a stack generalization algorithm as an ensemble technique. As a result, the final model performs better than each CNN individually in vehicle color identification. The evaluation results in terms of overall average accuracy and accuracy variance show the proposed method’s outperformance compared to the state-of-the-art rivals.

Keywords: Vehicle Color Recognition, Ensemble Algorithm, Stack Generalization, Convolutional Neural Network

Procedia PDF Downloads 85
3067 Sarcasm Recognition System Using Hybrid Tone-Word Spotting Audio Mining Technique

Authors: Sandhya Baskaran, Hari Kumar Nagabushanam

Abstract:

Sarcasm sentiment recognition is an area of natural language processing that is being probed into in the recent times. Even with the advancements in NLP, typical translations of words, sentences in its context fail to provide the exact information on a sentiment or emotion of a user. For example, if something bad happens, the statement ‘That's just what I need, great! Terrific!’ is expressed in a sarcastic tone which could be misread as a positive sign by any text-based analyzer. In this paper, we are presenting a unique real time ‘word with its tone’ spotting technique which would provide the sentiment analysis for a tone or pitch of a voice in combination with the words being expressed. This hybrid approach increases the probability for identification of special sentiment like sarcasm much closer to the real world than by mining text or speech individually. The system uses a tone analyzer such as YIN-FFT which extracts pitch segment-wise that would be used in parallel with a speech recognition system. The clustered data is classified for sentiments and sarcasm score for each of it determined. Our Simulations demonstrates the improvement in f-measure of around 12% compared to existing detection techniques with increased precision and recall.

Keywords: sarcasm recognition, tone-word spotting, natural language processing, pitch analyzer

Procedia PDF Downloads 293
3066 All-Silicon Raman Laser with Quasi-Phase-Matched Structures and Resonators

Authors: Isao Tomita

Abstract:

The principle of all-silicon Raman lasers for an output wavelength of 1.3 μm is presented, which employs quasi-phase-matched structures and resonators to enhance the output power. 1.3-μm laser beams for GE-PONs in FTTH systems generated from a silicon device are very important because such a silicon device can be monolithically integrated with the silicon planar lightwave circuits (Si PLCs) used in the GE-PONs. This reduces the device fabrication processes and time and also optical losses at the junctions between optical waveguides of the Si PLCs and Si laser devices when compared with 1.3-μm III-V semiconductor lasers set on the Si PLCs employed at present. We show that the quasi-phase-matched Si Raman laser with resonators can produce about 174 times larger laser power at 1.3 μm (at maximum) than that without resonators for a Si waveguide of Raman gain 20 cm/GW and optical loss 1.2 dB/cm, pumped at power 10 mW, where the length of the waveguide is 3 mm and its cross-section is (1.5 μm)2.

Keywords: All-Silicon Raman Laser, FTTH, GE-PON, Quasi-Phase-Matched Structure, resonator

Procedia PDF Downloads 253
3065 Finding a Paraguayan Voice: The Indigenous Language Guarani in Performances of Paraguayan Female Singers

Authors: Romy Martinez

Abstract:

This paper focuses on the use of the indigenous language Guarani in Paraguayan popular song and on some key interpreters born between the 1930s and 1980s. It analyses two representative musical genres of Paraguay, the Polka Paraguaya and Guarania. The lyrics of these genres follow one of four poetic-linguistic forms: to be entirely in Guarani, entirely in Spanish, bilingual (alternating verses in Guarani and Spanish), or in Jopará; the last being a form where words of both languages may be mixed in a single verse. Through these forms, the lyrics alternate and combine the indigenous voice with the one introduced with colonisation, in turn reflecting how Guarani seems to constantly transit, to and from, between a position of disdain and of value within Paraguayan society. Through analysing recordings of Polkas, Paraguayas, and Guaranias, it identifies three styles of singing adopted by female singers who include these genres in their repertoires, namely Paraguayan classical folk, Paraguayan folk, and Paraguayan pop-folk. This analysis is informed by a pilot study which consisted of online interviews with several Paraguayan artists, revealing significant aspects of their backgrounds and musical influences. In addition, it draws on autoethnographic approaches, building on the experience of the music researcher and singer. From a decolonising perspective, the paper brings together the distinctive voices and sounds expressed in popular songs from a marginalised country, language, and gender.

Keywords: female singers, Guarani, Paraguayan song, performance

Procedia PDF Downloads 201
3064 Intelligent Campus Monitoring: YOLOv8-Based High-Accuracy Activity Recognition

Authors: A. Degale Desta, Tamirat Kebamo

Abstract:

Background: Recent advances in computer vision and pattern recognition have significantly improved activity recognition through video analysis, particularly with the application of Deep Convolutional Neural Networks (CNNs). One-stage detectors now enable efficient video-based recognition by simultaneously predicting object categories and locations. Such advancements are highly relevant in educational settings where CCTV surveillance could automatically monitor academic activities, enhancing security and classroom management. However, current datasets and recognition systems lack the specific focus on campus environments necessary for practical application in these settings.Objective: This study aims to address this gap by developing a dataset and testing an automated activity recognition system specifically tailored for educational campuses. The EthioCAD dataset was created to capture various classroom activities and teacher-student interactions, facilitating reliable recognition of academic activities using deep learning models. Method: EthioCAD, a novel video-based dataset, was created with a design science research approach to encompass teacher-student interactions across three domains and 18 distinct classroom activities. Using the Roboflow AI framework, the data was processed, with 4.224 KB of frames and 33.485 MB of images managed for frame extraction, labeling, and organization. The Ultralytics YOLOv8 model was then implemented within Google Colab to evaluate the dataset’s effectiveness, achieving high mean Average Precision (mAP) scores. Results: The YOLOv8 model demonstrated robust activity recognition within campus-like settings, achieving an mAP50 of 90.2% and an mAP50-95 of 78.6%. These results highlight the potential of EthioCAD, combined with YOLOv8, to provide reliable detection and classification of classroom activities, supporting automated surveillance needs on educational campuses. Discussion: The high performance of YOLOv8 on the EthioCAD dataset suggests that automated activity recognition for surveillance is feasible within educational environments. This system addresses current limitations in campus-specific data and tools, offering a tailored solution for academic monitoring that could enhance the effectiveness of CCTV systems in these settings. Conclusion: The EthioCAD dataset, alongside the YOLOv8 model, provides a promising framework for automated campus activity recognition. This approach lays the groundwork for future advancements in CCTV-based educational surveillance systems, enabling more refined and reliable monitoring of classroom activities.

Keywords: deep CNN, EthioCAD, deep learning, YOLOv8, activity recognition

Procedia PDF Downloads 10
3063 Strong Down-Conversion Emission of Sm3+ Doped Borotellurite Glass under the 480nm Excitation Wavelength

Authors: M. R. S. Nasuha, K. Azman, H. Azhan, S. A. Senawi, A. Mardhiah

Abstract:

Studies on Samarium doped glasses possess lot of interest due to their potential applications for high-density optical memory, optical communication device, the design of laser and color display etc. Sm3+ doped borotellurite glasses of the system (70-x) TeO2-20B2O3-10ZnO-xSm2O3 (where x = 0.0, 0.5, 1.0, 1.5, 2.0 and 2.5 mol%) have been prepared using melt-quenching method. Their physical properties such as density, molar volume and oxygen packing density as well as the optical measurements by mean of their absorption and emission characteristic have been carried out at room temperature using UV/VIS and photoluminescence spectrophotometer. The results of physical properties are found to vary with respect to Sm3+ ions content. Meanwhile, three strong absorption peaks are observed and are well resolved in the ultra violet and visible regions due to transitions between the ground state and various excited state of Sm3+ ions. Thus, the photoluminescence spectra exhibit four emission bands from the initial state, which correspond to the 4G5/2 → 6H5/2, 4G5/2 → 6H7/2, 4G5/2 → 6H9/2 and 4G5/2 → 6H11/2 fluorescence transitions at 562 nm, 599 nm, 645 nm and 706 nm respectively.

Keywords: absorption, borotellurite, down-conversion, emission

Procedia PDF Downloads 684
3062 Musicals in Film Adaptation in Bollywood with Special Reference to Basu Bhattacharya's Film Teesari Kasam

Authors: Gokul G. Kshirsagar

Abstract:

Native folk theatre and folk songs have a significant influence on the origin and development of Indian cinema. Therefore, the presence of songs and music has been an integral part and special characteristics of Indian cinema which is popularly known as Bollywood. An Indian cinema without songs, either in Hindi or other regional languages, is simply unimaginable. The present paper, in the first part, attempts to explain the use and need of musical songs and also the psychology of Indian audience in this respect with reference to some of the films which give primary importance to songs. In the second part, the paper tries to situate the present study in the context by referring to the Hindi language drama film Teesari Kasam directed by Basu Bhattacharys. The film is based on the Hindi novelist Phanishwarnath Renu’s short story Teesari Kasam (Mare Gaye Gulfam) in this adapted film, the director has made use of eight songs, but these songs are the extensive versions of the songs as used in the original story. Thus, the main aim of the paper is to underscore the fact that through artistic use of the musical, the director has succeeded in transforming the central feelings conveyed in the original story. Eventually, through the present study of the film adaptation, the relevance of songs in films will be illustrated and understood.

Keywords: Bollywood, folk theatre, folk songs, film adaptation

Procedia PDF Downloads 219
3061 Human-Machine Cooperation in Facial Comparison Based on Likelihood Scores

Authors: Lanchi Xie, Zhihui Li, Zhigang Li, Guiqiang Wang, Lei Xu, Yuwen Yan

Abstract:

Image-based facial features can be classified into category recognition features and individual recognition features. Current automated face recognition systems extract a specific feature vector of different dimensions from a facial image according to their pre-trained neural network. However, to improve the efficiency of parameter calculation, an algorithm generally reduces the image details by pooling. The operation will overlook the details concerned much by forensic experts. In our experiment, we adopted a variety of face recognition algorithms based on deep learning, compared a large number of naturally collected face images with the known data of the same person's frontal ID photos. Downscaling and manual handling were performed on the testing images. The results supported that the facial recognition algorithms based on deep learning detected structural and morphological information and rarely focused on specific markers such as stains and moles. Overall performance, distribution of genuine scores and impostor scores, and likelihood ratios were tested to evaluate the accuracy of biometric systems and forensic experts. Experiments showed that the biometric systems were skilled in distinguishing category features, and forensic experts were better at discovering the individual features of human faces. In the proposed approach, a fusion was performed at the score level. At the specified false accept rate, the framework achieved a lower false reject rate. This paper contributes to improving the interpretability of the objective method of facial comparison and provides a novel method for human-machine collaboration in this field.

Keywords: likelihood ratio, automated facial recognition, facial comparison, biometrics

Procedia PDF Downloads 130
3060 Investigating Activity Recognition Using 9-Axis Sensors and Filters in Wearable Devices

Authors: Jun Gil Ahn, Jong Kang Park, Jong Tae Kim

Abstract:

In this paper, we analyze major components of activity recognition (AR) in wearable device with 9-axis sensors and sensor fusion filters. 9-axis sensors commonly include 3-axis accelerometer, 3-axis gyroscope and 3-axis magnetometer. We chose sensor fusion filters as Kalman filter and Direction Cosine Matrix (DCM) filter. We also construct sensor fusion data from each activity sensor data and perform classification by accuracy of AR using Naïve Bayes and SVM. According to the classification results, we observed that the DCM filter and the specific combination of the sensing axes are more effective for AR in wearable devices while classifying walking, running, ascending and descending.

Keywords: accelerometer, activity recognition, directiona cosine matrix filter, gyroscope, Kalman filter, magnetometer

Procedia PDF Downloads 333
3059 Electronic and Optical Properties of YNi4Si-Type DyNi4Si Compound: A Full Potential Study

Authors: Dinesh Kumar Maurya, Sapan Mohan Saini

Abstract:

A theoretical formalism to calculate the structural, electronic and optical properties of orthorhombic crystals from first principle calculations is described. This is applied first time to new YNi4Si-type DyNi4Si compound. Calculations are performed using full-potential augmented plane wave (FPLAPW) method in the framework of density functional theory (DFT). The Coulomb corrected local-spin density approximation (LSDA+U) in the self-interaction correction (SIC) has been used for exchange-correlation potential. Our optimized results of lattice parameters show good agreement to the previously reported experimental study. Analysis of the calculated band structure of DyNi4Si compound demonstrates their metallic character. We found Ni-3d states mainly contribute to density of states from -5.0 eV to the Fermi level while the Dy-f states peak stands tall in comparison to the small contributions made by the Ni-d and R-d states above Fermi level, which is consistent with experiment, in DNi4Si compound. Our calculated optical conductivity compares well with the experimental data and the results are analyzed in the light of band-to-band transitions. We also report the frequency-dependent refractive index n(ω) and the extinction coefficient k(ω) of the compound.

Keywords: band structure, density of states, optical properties, LSDA+U approximation, YNi4Si- type DyNi4Si compound

Procedia PDF Downloads 349
3058 Facial Emotion Recognition with Convolutional Neural Network Based Architecture

Authors: Koray U. Erbas

Abstract:

Neural networks are appealing for many applications since they are able to learn complex non-linear relationships between input and output data. As the number of neurons and layers in a neural network increase, it is possible to represent more complex relationships with automatically extracted features. Nowadays Deep Neural Networks (DNNs) are widely used in Computer Vision problems such as; classification, object detection, segmentation image editing etc. In this work, Facial Emotion Recognition task is performed by proposed Convolutional Neural Network (CNN)-based DNN architecture using FER2013 Dataset. Moreover, the effects of different hyperparameters (activation function, kernel size, initializer, batch size and network size) are investigated and ablation study results for Pooling Layer, Dropout and Batch Normalization are presented.

Keywords: convolutional neural network, deep learning, deep learning based FER, facial emotion recognition

Procedia PDF Downloads 273
3057 Random Subspace Neural Classifier for Meteor Recognition in the Night Sky

Authors: Carlos Vera, Tetyana Baydyk, Ernst Kussul, Graciela Velasco, Miguel Aparicio

Abstract:

This article describes the Random Subspace Neural Classifier (RSC) for the recognition of meteors in the night sky. We used images of meteors entering the atmosphere at night between 8:00 p.m.-5: 00 a.m. The objective of this project is to classify meteor and star images (with stars as the image background). The monitoring of the sky and the classification of meteors are made for future applications by scientists. The image database was collected from different websites. We worked with RGB-type images with dimensions of 220x220 pixels stored in the BitMap Protocol (BMP) format. Subsequent window scanning and processing were carried out for each image. The scan window where the characteristics were extracted had the size of 20x20 pixels with a scanning step size of 10 pixels. Brightness, contrast and contour orientation histograms were used as inputs for the RSC. The RSC worked with two classes and classified into: 1) with meteors and 2) without meteors. Different tests were carried out by varying the number of training cycles and the number of images for training and recognition. The percentage error for the neural classifier was calculated. The results show a good RSC classifier response with 89% correct recognition. The results of these experiments are presented and discussed.

Keywords: contour orientation histogram, meteors, night sky, RSC neural classifier, stars

Procedia PDF Downloads 138
3056 Healthcare-SignNet: Advanced Video Classification for Medical Sign Language Recognition Using CNN and RNN Models

Authors: Chithra A. V., Somoshree Datta, Sandeep Nithyanandan

Abstract:

Sign Language Recognition (SLR) is the process of interpreting and translating sign language into spoken or written language using technological systems. It involves recognizing hand gestures, facial expressions, and body movements that makeup sign language communication. The primary goal of SLR is to facilitate communication between hearing- and speech-impaired communities and those who do not understand sign language. Due to the increased awareness and greater recognition of the rights and needs of the hearing- and speech-impaired community, sign language recognition has gained significant importance over the past 10 years. Technological advancements in the fields of Artificial Intelligence and Machine Learning have made it more practical and feasible to create accurate SLR systems. This paper presents a distinct approach to SLR by framing it as a video classification problem using Deep Learning (DL), whereby a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) has been used. This research targets the integration of sign language recognition into healthcare settings, aiming to improve communication between medical professionals and patients with hearing impairments. The spatial features from each video frame are extracted using a CNN, which captures essential elements such as hand shapes, movements, and facial expressions. These features are then fed into an RNN network that learns the temporal dependencies and patterns inherent in sign language sequences. The INCLUDE dataset has been enhanced with more videos from the healthcare domain and the model is evaluated on the same. Our model achieves 91% accuracy, representing state-of-the-art performance in this domain. The results highlight the effectiveness of treating SLR as a video classification task with the CNN-RNN architecture. This approach not only improves recognition accuracy but also offers a scalable solution for real-time SLR applications, significantly advancing the field of accessible communication technologies.

Keywords: sign language recognition, deep learning, convolution neural network, recurrent neural network

Procedia PDF Downloads 27
3055 Characterization of Chemically Deposited CdS Thin Films Annealed in Different Atmospheres

Authors: J. Pantoja Enríquez, G. P. Hernández, G. I. Duharte, X. Mathew, J. Moreira, P. J. Sebastian

Abstract:

Cadmium sulfide films were deposited onto glass substrates by chemical bath deposition (CBD) from a bath containing cadmium acetate, ammonium acetate, thiourea, and ammonium hydroxide. The CdS thin films were annealed in air, argon, hydrogen and nitrogen for 1 h at various temperatures (300, 350, 400, 450 and 500 °C). The changes in optical and electrical properties of annealed treated CdS thin films were analyzed. The results showed that, the band-gap and resistivity depend on the post-deposition annealing atmosphere and temperatures. Thus, it was found that these properties of the films, were found to be affected by various processes with opposite effects, some beneficial and others unfavorable. The energy gap and resistivity for different annealing atmospheres was seen to oscillate by thermal annealing. Recrystallization, oxidation, surface passivation, sublimation and materials evaporation were found the main factors of the heat-treatment process responsible for this oscillating behavior. Annealing over 400 °C was seen to degrade the optical and electrical properties of the film.

Keywords: cds, thin films, annealing, optical, electrical properties

Procedia PDF Downloads 510