Search results for: biological data mining
25663 A Study of Safety of Data Storage Devices of Graduate Students at Suan Sunandha Rajabhat University
Authors: Komol Phaisarn, Natcha Wattanaprapa
Abstract:
This research is a survey research with an objective to study the safety of data storage devices of graduate students of academic year 2013, Suan Sunandha Rajabhat University. Data were collected by questionnaire on the safety of data storage devices according to CIA principle. A sample size of 81 was drawn from population by purposive sampling method. The results show that most of the graduate students of academic year 2013 at Suan Sunandha Rajabhat University use handy drive to store their data and the safety level of the devices is at good level.Keywords: security, safety, storage devices, graduate students
Procedia PDF Downloads 35325662 Simulation of a Cost Model Response Requests for Replication in Data Grid Environment
Authors: Kaddi Mohammed, A. Benatiallah, D. Benatiallah
Abstract:
Data grid is a technology that has full emergence of new challenges, such as the heterogeneity and availability of various resources and geographically distributed, fast data access, minimizing latency and fault tolerance. Researchers interested in this technology address the problems of the various systems related to the industry such as task scheduling, load balancing and replication. The latter is an effective solution to achieve good performance in terms of data access and grid resources and better availability of data cost. In a system with duplication, a coherence protocol is used to impose some degree of synchronization between the various copies and impose some order on updates. In this project, we present an approach for placing replicas to minimize the cost of response of requests to read or write, and we implement our model in a simulation environment. The placement techniques are based on a cost model which depends on several factors, such as bandwidth, data size and storage nodes.Keywords: response time, query, consistency, bandwidth, storage capacity, CERN
Procedia PDF Downloads 27125661 Prompt Design for Code Generation in Data Analysis Using Large Language Models
Authors: Lu Song Ma Li Zhi
Abstract:
With the rapid advancement of artificial intelligence technology, large language models (LLMs) have become a milestone in the field of natural language processing, demonstrating remarkable capabilities in semantic understanding, intelligent question answering, and text generation. These models are gradually penetrating various industries, particularly showcasing significant application potential in the data analysis domain. However, retraining or fine-tuning these models requires substantial computational resources and ample downstream task datasets, which poses a significant challenge for many enterprises and research institutions. Without modifying the internal parameters of the large models, prompt engineering techniques can rapidly adapt these models to new domains. This paper proposes a prompt design strategy aimed at leveraging the capabilities of large language models to automate the generation of data analysis code. By carefully designing prompts, data analysis requirements can be described in natural language, which the large language model can then understand and convert into executable data analysis code, thereby greatly enhancing the efficiency and convenience of data analysis. This strategy not only lowers the threshold for using large models but also significantly improves the accuracy and efficiency of data analysis. Our approach includes requirements for the precision of natural language descriptions, coverage of diverse data analysis needs, and mechanisms for immediate feedback and adjustment. Experimental results show that with this prompt design strategy, large language models perform exceptionally well in multiple data analysis tasks, generating high-quality code and significantly shortening the data analysis cycle. This method provides an efficient and convenient tool for the data analysis field and demonstrates the enormous potential of large language models in practical applications.Keywords: large language models, prompt design, data analysis, code generation
Procedia PDF Downloads 4225660 Comparison of Different Methods to Produce Fuzzy Tolerance Relations for Rainfall Data Classification in the Region of Central Greece
Authors: N. Samarinas, C. Evangelides, C. Vrekos
Abstract:
The aim of this paper is the comparison of three different methods, in order to produce fuzzy tolerance relations for rainfall data classification. More specifically, the three methods are correlation coefficient, cosine amplitude and max-min method. The data were obtained from seven rainfall stations in the region of central Greece and refers to 20-year time series of monthly rainfall height average. Three methods were used to express these data as a fuzzy relation. This specific fuzzy tolerance relation is reformed into an equivalence relation with max-min composition for all three methods. From the equivalence relation, the rainfall stations were categorized and classified according to the degree of confidence. The classification shows the similarities among the rainfall stations. Stations with high similarity can be utilized in water resource management scenarios interchangeably or to augment data from one to another. Due to the complexity of calculations, it is important to find out which of the methods is computationally simpler and needs fewer compositions in order to give reliable results.Keywords: classification, fuzzy logic, tolerance relations, rainfall data
Procedia PDF Downloads 31425659 Surface Sediment Quality Assessment in a Coastal Lagoon (NW Adriatic Sea) Based on SEM-AVS Analysis
Authors: Roberta Guerra, Juan Pablo Pozo Hernandez
Abstract:
Surface sediments from the coastal lagoon of Pialassa Piomboni in the NW Adriatic Sea were collected and analysed and the potential ecological risks in the area were assessed based on the acid-volatile sulphide (AVS) model. The AVS levels are between 0.03 and 8.8 µmol g-1, with the average at 3.1 µmol g-1. The simultaneously extracted metals (∑SEM), which is the molar sum of Cd, Cu, Ni, Pb, and Zn, range from 0.3 to 6.6 µmol g-1, with the average at 1.7 µmol g-1. Most of the high ∑SEM concentrations are located in the southern area of the lagoon. [SEM]Zn had the comparatively high mean concentration (1.4 µmol g-1), and a maximum value of 6.1 µmol g-1, respectively. Concentrations of [SEM]Cd, [SEM]Cu, [SEM]Ni, and [SEM]Pb were consistently lower, with maximum values of 0.007 µmol g-1, 1.4 µmol g-1, 0.3 µmol g-1 and 0.2 µmol g-1, respectively. Compared to other metals, [SEM]Zn was the dominant component in all samples and accounted for approximately 31 - 93% of the ∑SEM, whereas the contribution of Cd – the most toxic metal studied – to ∑SEM was no more than 1%. According to the USEPA evaluation method, the sediment samples can be divided into the three following categories: category 1, adverse biological effects on aquatic life may be expected when ([SEM]–[AVS])/fOC > 3000; category 2, adverse effects on aquatic life are uncertain when ([SEM]–[AVS])/fOC = 130 to 3,000; and category 3, no indication of adverse effects when ([SEM]–[AVS])/fOC < 130. Most of the surface sediments of the Pialassa Piomboni lagoon (>90%) had no adverse biological effects according to the criterion proposed by the USEPA; while adverse effects were uncertain in few stations (~2%).Keywords: sediment quality, heavy metals, coastal lagoon, bioavailability, SEM, AVS
Procedia PDF Downloads 40625658 Customer Satisfaction and Effective HRM Policies: Customer and Employee Satisfaction
Authors: S. Anastasiou, C. Nathanailides
Abstract:
The purpose of this study is to examine the possible link between employee and customer satisfaction. The service provided by employees, help to build a good relationship with customers and can help at increasing their loyalty. Published data for job satisfaction and indicators of customer services were gathered from relevant published works which included data from five different countries. The reviewed data indicate a significant correlation between indicators of customer and employee satisfaction in the Banking sector. There was a significant correlation between the two parameters (Pearson correlation R2=0.52 P<0.05) The reviewed data provide evidence that there is some practical evidence which links these two parameters.Keywords: job satisfaction, job performance, customer’ service, banks, human resources management
Procedia PDF Downloads 32125657 Spectroscopic Autoradiography of Alpha Particles on Geologic Samples at the Thin Section Scale Using a Parallel Ionization Multiplier Gaseous Detector
Authors: Hugo Lefeuvre, Jerôme Donnard, Michael Descostes, Sophie Billon, Samuel Duval, Tugdual Oger, Herve Toubon, Paul Sardini
Abstract:
Spectroscopic autoradiography is a method of interest for geological sample analysis. Indeed, researchers may face different issues such as radioelement identification and quantification in the field of environmental studies. Imaging gaseous ionization detectors find their place in geosciences for conducting specific measurements of radioactivity to improve the monitoring of natural processes using naturally-occurring radioactive tracers, but also for the nuclear industry linked to the mining sector. In geological samples, the location and identification of the radioactive-bearing minerals at the thin-section scale remains a major challenge as the detection limit of the usual elementary microprobe techniques is far higher than the concentration of most of the natural radioactive decay products. The spatial distribution of each decay product in the case of uranium in a geomaterial is interesting for relating radionuclides concentration to the mineralogy. The present study aims to provide spectroscopic autoradiography analysis method for measuring the initial energy of alpha particles with a parallel ionization multiplier gaseous detector. The analysis method has been developed thanks to Geant4 modelling of the detector. The track of alpha particles recorded in the gas detector allow the simultaneous measurement of the initial point of emission and the reconstruction of the initial particle energy by a selection based on the linear energy distribution. This spectroscopic autoradiography method was successfully used to reproduce the alpha spectra from a 238U decay chain on a geological sample at the thin-section scale. The characteristics of this measurement are an energy spectrum resolution of 17.2% (FWHM) at 4647 keV and a spatial resolution of at least 50 µm. Even if the efficiency of energy spectrum reconstruction is low (4.4%) compared to the efficiency of a simple autoradiograph (50%), this novel measurement approach offers the opportunity to select areas on an autoradiograph to perform an energy spectrum analysis within that area. This opens up possibilities for the detailed analysis of heterogeneous geological samples containing natural alpha emitters such as uranium-238 and radium-226. This measurement will allow the study of the spatial distribution of uranium and its descendants in geo-materials by coupling scanning electron microscope characterizations. The direct application of this dual modality (energy-position) of analysis will be the subject of future developments. The measurement of the radioactive equilibrium state of heterogeneous geological structures, and the quantitative mapping of 226Ra radioactivity are now being actively studied.Keywords: alpha spectroscopy, digital autoradiography, mining activities, natural decay products
Procedia PDF Downloads 15125656 Evaluation of Australian Open Banking Regulation: Balancing Customer Data Privacy and Innovation
Authors: Suman Podder
Abstract:
As Australian ‘Open Banking’ allows customers to share their financial data with accredited Third-Party Providers (‘TPPs’), it is necessary to evaluate whether the regulators have achieved the balance between protecting customer data privacy and promoting data-related innovation. Recognising the need to increase customers’ influence on their own data, and the benefits of data-related innovation, the Australian Government introduced ‘Consumer Data Right’ (‘CDR’) to the banking sector through Open Banking regulation. Under Open Banking, TPPs can access customers’ banking data that allows the TPPs to tailor their products and services to meet customer needs at a more competitive price. This facilitated access and use of customer data will promote innovation by providing opportunities for new products and business models to emerge and grow. However, the success of Open Banking depends on the willingness of the customers to share their data, so the regulators have augmented the protection of data by introducing new privacy safeguards to instill confidence and trust in the system. The dilemma in policymaking is that, on the one hand, lenient data privacy laws will help the flow of information, but at the risk of individuals’ loss of privacy, on the other hand, stringent laws that adequately protect privacy may dissuade innovation. Using theoretical and doctrinal methods, this paper examines whether the privacy safeguards under Open Banking will add to the compliance burden of the participating financial institutions, resulting in the undesirable effect of stifling other policy objectives such as innovation. The contribution of this research is three-fold. In the emerging field of customer data sharing, this research is one of the few academic studies on the objectives and impact of Open Banking in the Australian context. Additionally, Open Banking is still in the early stages of implementation, so this research traces the evolution of Open Banking through policy debates regarding the desirability of customer data-sharing. Finally, the research focuses not only on the customers’ data privacy and juxtaposes it with another important objective of promoting innovation, but it also highlights the critical issues facing the data-sharing regime. This paper argues that while it is challenging to develop a regulatory framework for protecting data privacy without impeding innovation and jeopardising yet unknown opportunities, data privacy and innovation promote different aspects of customer welfare. This paper concludes that if a regulation is appropriately designed and implemented, the benefits of data-sharing will outweigh the cost of compliance with the CDR.Keywords: consumer data right, innovation, open banking, privacy safeguards
Procedia PDF Downloads 14125655 Synthetic, Characterization and Biological Studies of Bis(Tetrathiomolybdate) Compounds of Pt (II), Pd (II) and Ni (II)
Authors: V. K. Srivastava
Abstract:
The chemistry of compounds containing transition metals bound to sulfur containing ligands has been actively studied. Interest in these compounds arises from the identification of the biological importance of iron-sulfur containing proteins as well as the unusual behaviour of several types of synthetic metal-sulfur complexes. Metal complexes (C₆H₅)₄P)₂ Pt(Mos₄)₂, (C₆H₅)₄P)₂ Pd(MoS₄)₂, (C₆H₅)₄P)₂ Ni(MoS₄)₂ of bioinorganic relevance were investigated. The complexes [M(M'S₄)₂]²⁻ were prepared with high yield and purity as salts of the variety of organic cations. The diamagnetism and spectroscopic properties of these complexes confirmed that their structures are essentially equivalent with two bidentate M'S₄²⁻ ligands coordinated to the central d⁸ metal in a square planer geometry. The interaction of the complexes with CT-DNA was studied. Results showed that metal complexes increased DNA's relative viscosity and quench the fluorescence intensity of EB bound to DNA. In antimicrobial activities, all complexes showed good antimicrobial activity higher than ligand against gram positive, gram negative bacteria and fungi. The antitumor properties have been tested in vitro against two tumor human cell lines, Hela (derived from cervical cancer) and MCF-7 (derived from breast cancer) using metabolic activity tests. Result showed that the complexes are promising chemotherapeutic alternatives in the search of anticancer agents.Keywords: anti cancer, biocidal, DNA binding, spectra
Procedia PDF Downloads 15925654 Generation of Automated Alarms for Plantwide Process Monitoring
Authors: Hyun-Woo Cho
Abstract:
Earlier detection of incipient abnormal operations in terms of plant-wide process management is quite necessary in order to improve product quality and process safety. And generating warning signals or alarms for operating personnel plays an important role in process automation and intelligent plant health monitoring. Various methodologies have been developed and utilized in this area such as expert systems, mathematical model-based approaches, multivariate statistical approaches, and so on. This work presents a nonlinear empirical monitoring methodology based on the real-time analysis of massive process data. Unfortunately, the big data includes measurement noises and unwanted variations unrelated to true process behavior. Thus the elimination of such unnecessary patterns of the data is executed in data processing step to enhance detection speed and accuracy. The performance of the methodology was demonstrated using simulated process data. The case study showed that the detection speed and performance was improved significantly irrespective of the size and the location of abnormal events.Keywords: detection, monitoring, process data, noise
Procedia PDF Downloads 25225653 Integrated On-Board Diagnostic-II and Direct Controller Area Network Access for Vehicle Monitoring System
Authors: Kavian Khosravinia, Mohd Khair Hassan, Ribhan Zafira Abdul Rahman, Syed Abdul Rahman Al-Haddad
Abstract:
The CAN (controller area network) bus is introduced as a multi-master, message broadcast system. The messages sent on the CAN are used to communicate state information, referred as a signal between different ECUs, which provides data consistency in every node of the system. OBD-II Dongles that are based on request and response method is the wide-spread solution for extracting sensor data from cars among researchers. Unfortunately, most of the past researches do not consider resolution and quantity of their input data extracted through OBD-II technology. The maximum feasible scan rate is only 9 queries per second which provide 8 data points per second with using ELM327 as well-known OBD-II dongle. This study aims to develop and design a programmable, and latency-sensitive vehicle data acquisition system that improves the modularity and flexibility to extract exact, trustworthy, and fresh car sensor data with higher frequency rates. Furthermore, the researcher must break apart, thoroughly inspect, and observe the internal network of the vehicle, which may cause severe damages to the expensive ECUs of the vehicle due to intrinsic vulnerabilities of the CAN bus during initial research. Desired sensors data were collected from various vehicles utilizing Raspberry Pi3 as computing and processing unit with using OBD (request-response) and direct CAN method at the same time. Two types of data were collected for this study. The first, CAN bus frame data that illustrates data collected for each line of hex data sent from an ECU and the second type is the OBD data that represents some limited data that is requested from ECU under standard condition. The proposed system is reconfigurable, human-readable and multi-task telematics device that can be fitted into any vehicle with minimum effort and minimum time lag in the data extraction process. The standard operational procedure experimental vehicle network test bench is developed and can be used for future vehicle network testing experiment.Keywords: CAN bus, OBD-II, vehicle data acquisition, connected cars, telemetry, Raspberry Pi3
Procedia PDF Downloads 20525652 Response of Chickpea (Cicer arietinum L.) Genotypes to Drought Stress at Different Growth Stages
Authors: Ali. Marjani, M. Farsi, M. Rahimizadeh
Abstract:
Chickpea (Cicer arietinum L.) is one of the important grain legume crops in the world. However, drought stress is a serious threat to chickpea production, and development of drought-resistant varieties is a necessity. Field experiments were conducted to evaluate the response of 8 chickpea genotypes (MCC* 696, 537, 80, 283, 392, 361, 252, 397) and drought stress (S1: non-stress, S2: stress at vegetative growth stage, S3: stress at early bloom, S4: stress at early pod visible) at different growth stages. Experiment was arranged in split plot design with four replications. Difference among the drought stress time was found to be significant for investigated traits except biological yield. Differences were observed for genotypes in flowering time, pod information time, physiological maturation time and yield. Plant height reduced due to drought stress in vegetative growth stage. Stem dry weight reduced due to drought stress in pod visibly. Flowering time, maturation time, pod number, number of seed per plant and yield cause of drought stress in flowering was also reduced. The correlation between yield and number of seed per plant and biological yield was positive. The MCC283 and MCC696 were the high-tolerance genotypes. These results demonstrated that drought stress delayed phonological growth in chickpea and that flowering stage is sensitive.Keywords: chickpea, drought stress, growth stage, tolerance
Procedia PDF Downloads 26125651 Comparison of Classical and Ultrasound-Assisted Extractions of Hyphaene thebaica Fruit and Evaluation of Its Extract as Antibacterial Activity in Reducing Severity of Erwinia carotovora
Authors: Hanan Moawad, Naglaa M. Abd EL-Rahman
Abstract:
Erwinia carotovora var. carotovora is the main cause of soft rot in potatoes. Hyphaene thebaica was studied for biocontrol of E. carotovora which inhibited growth of E. carotovora on solid medium, a comparative study of classical and ultrasound-assisted extractions of Hyphaene thebaica fruit. The use of ultrasound decreased significant the total time of treatment and increase the total amount of crude extract. The crude extract was subjected to determine the in vitro, by a bioassay technique revealed that the treatment of paper disks with ultrasound extraction of Hyphaene thebaica reduced the growth of pathogen and produced inhibition zones up to 38mm in diameter. The antioxidant activity of ultrasound-ethanolic extract of Doum fruits (Hyphaene thebaica) was determined. Data obtained showed that the extract contains the secondary metabolites such as Tannins, Saponin, Flavonoids, Phenols, Steroids, Terpenoids, Glycosides and Alkaloids.Keywords: ultrasound, classical extract, biological control, Erwinia carotovora, Hyphaene thebaica
Procedia PDF Downloads 52025650 Parasitological Study and Its Role in Fisheries Management and Stock Assessment of Boops boops (Lineauses, 1758) along the Tunisian Coast
Authors: I. Chebbi, L. Boudaya, L. Neifar
Abstract:
The bogue, Boops boops is an economically important fishery resource and commonly captured in the Mediterranean, and its diversity in parasites has been used as a tool to differentiate between stocks along with Tunisia since it is widely acceptable in fisheries management. In this study, a total of 90 fish are investigated from three localities off Tunisia, including Kelibia, Mahdia, and Zarzis. Fifteen species of parasites totaling 1270 individuals were harvested from B. boops, whereas ten parasites were used as biological tags. Based on Mahalanobis distance, each parasite species shows a great importance in the discrimination between groups. Tetraphyllidea larvae are the most influential parasites in determining the position of samples belonging to Kelibia. Monogenean species and Hysterothylacium sp. are the most important species for determining the position of samples from Mahdia. Specimens from Zarzis are characterized by the absence of the four Monogenean species and the Tetraphyllidea larvae. Parasites allocate B. boops population correctly to their origin communities with an accuracy of 83.3%. These results were corroborated by the discriminant analyses, highlighted the presence of three stocks, and improved that the parasitological method can be considered as a reliable key to provide imperative information for discriminating among B. boops stocks in Tunisian waters.Keywords: biological marker, Boops boops, parasite, population structure
Procedia PDF Downloads 13425649 The Effect of Different Cucumber (Cucumis sativus L.) Varieties on Growth and Development Time of Aphis gossypii Glover (Hemiptera: Aphididae)
Authors: Rochelyn Dona, Mohamed F. Nur, Serdar Satar
Abstract:
The biological response of Aphis gossypii Glover (Hom. Aphididae) was investigated on the effects of seven cucumber varieties (Cucumis sativus L.) such as Kitir, Muhika, Ayda, Beit, 14-F1, Ruzgar, and Ptk in the laboratory condition at 24±1°C, 65±5% relative humidity (RH) and a photoperiod of 16:8 (L:D) hour. The results were related that the developmental time of A. gossypii at the nymphal stages was presented a significant difference only on the first instar stage. From the lowest to the highest respectively, 0.98 days on ruzgar to 1.18 days on Kitir, the second nymphal stage 0.98 days to Beit alfa, 1.08 days on Muhika, the third from 0.94 days to Kitir, from 1.16 days to 14-F1, and the last instar 1.22 days on Ptk, 1.48 days on Kitir were investigated. The total development time was evaluated at 4.46 days Beit on alfa 4.72 days on Kitir. The offspring number was 60.42 aphids on ayda and 83.72 aphids on muhika, the significant differences between varieties were based on one-way ANOVA (Tukey test). The lifetime of A. gossypii was recorded 19.10 days on Kitir, 27.64 days on Ptk. The results showed that cucumber cultivars were affected by the biological life of A. gossypii. The combination of this study with the other methods of the IPM tactics can serve as the best strategy for controlling this pest on cucumber varieties into the greenhouse.Keywords: cucumber cultivars, fecundity, intrinsic rate, mortality, resistance
Procedia PDF Downloads 19025648 The Impact of Mycotoxins on the Anaerobic Digestion Process
Authors: Harald Lindorfer, Bettina Frauz, Dietmar Ramhold
Abstract:
Next to the well-known inhibitors in anaerobic digestion like ammonia, antibiotics or disinfectants, the number of process failures connected with mould growth in the feedstock increased significantly in the last years. It was assumed that mycotoxins are the cause of the negative effects. The financial damage to plants associated with these process failures is considerable. The aim of this study was to find a way of predicting the failures and furthermore strategies for a fast process recovery. In a first step, mould-contaminated feedstocks causing process failures in full-scale digesters were sampled and analysed on mycotoxin content. A selection of these samples was applied to biological inhibition tests. In this test, crystalline cellulose is applied in addition to the feedstock sample as standard substrate. Affected digesters were also sampled and analytical process data as well as operational data of the plants were recorded. Additionally, different mycotoxin substances, Deoxynivalenol, Zearalenon, Aflatoxin B1, Mycophenolic acid and Citrinin, were applied as pure substances to lab-scale digesters, individually and in various combinations, and effects were monitored. As expected, various mycotoxins were detected in all of the mould-contaminated samples. Nevertheless, inhibition effects were observed with only one of the collected samples, after applying it to an inhibition test. With this sample, the biogas yield of the standard substrate was reduced by approx. 20%. This result corresponds with observations made on full-scale plants. However, none of the tested mycotoxins applied as pure substance caused a negative effect on biogas production in lab scale digesters, neither after application as individual substance nor in combination. The recording of the process data in full-scale plants affected by process failures in most cases showed a severe accumulation of fatty acids alongside a decrease in biogas production and methane concentration. In the analytical data of the digester samples, a typical distribution of fatty acids with exceptionally high acetic acid concentrations could be identified. This typical fatty acid pattern can be used as a rapid identification parameter pointing to the cause of the process troubles and enable a fast implication of countermeasures. The results of the study show that more attention needs to be paid to feedstock storage and feedstock conservation before their application to anaerobic digesters. This is all the more important since first studies indicate that the occurrence of mycotoxins will likely increase in Europe due to the ongoing climate change.Keywords: Anaerobic digestion, Biogas, Feedstock conservation, Fungal mycotoxins, Inhibition, process failure
Procedia PDF Downloads 13025647 Minimum Data of a Speech Signal as Special Indicators of Identification in Phonoscopy
Authors: Nazaket Gazieva
Abstract:
Voice biometric data associated with physiological, psychological and other factors are widely used in forensic phonoscopy. There are various methods for identifying and verifying a person by voice. This article explores the minimum speech signal data as individual parameters of a speech signal. Monozygotic twins are believed to be genetically identical. Using the minimum data of the speech signal, we came to the conclusion that the voice imprint of monozygotic twins is individual. According to the conclusion of the experiment, we can conclude that the minimum indicators of the speech signal are more stable and reliable for phonoscopic examinations.Keywords: phonogram, speech signal, temporal characteristics, fundamental frequency, biometric fingerprints
Procedia PDF Downloads 14425646 Learning Grammars for Detection of Disaster-Related Micro Events
Authors: Josef Steinberger, Vanni Zavarella, Hristo Tanev
Abstract:
Natural disasters cause tens of thousands of victims and massive material damages. We refer to all those events caused by natural disasters, such as damage on people, infrastructure, vehicles, services and resource supply, as micro events. This paper addresses the problem of micro - event detection in online media sources. We present a natural language grammar learning algorithm and apply it to online news. The algorithm in question is based on distributional clustering and detection of word collocations. We also explore the extraction of micro-events from social media and describe a Twitter mining robot, who uses combinations of keywords to detect tweets which talk about effects of disasters.Keywords: online news, natural language processing, machine learning, event extraction, crisis computing, disaster effects, Twitter
Procedia PDF Downloads 47825645 Synthesis of Biologically Active Heterocyclic Compounds via C-H Bond Activation
Authors: Neeraj Kumar Mishra, In Su Kim
Abstract:
The isoindoline, indazole and indole heterocycles are ubiquitous structural motif found in heterocyclic compounds as they exhibit biological and medicinal applications. For example, isoindoline motif is present in molecules that act as endothelin-A receptor antagonists and dipeptidyl peptidase inhibitors. Moreover, isoindoline derivatives are very crucial constituents in the field of materials science as attractive candidates for organic light-emitting devices. However, compounds containing the indazole motif are known to exhibit to a variety of biological activities, such as estrogen receptor, HIV protease inhibition and anti-tumor activity. The prevalence of indazoles and indoles has led to the development of many useful methods for their preparation. Thus, isoindoline, indazole and indole heterocycles can be new candidates for the next generation of pharmaceuticals. Therefore, the development of highly efficient strategies for the formation of these heterocyclic architectures is an area of great interest in organic synthesis. The past years, transition-metal-catalyzed C−H activation followed by annulation reaction has been frequently used as a powerful tool to construct various heterocycles. Herein, we describe our recent achievements about the transition-metal-catalyzed tandem cyclization reactions of N-benzyltriflamides, 1,2-disubstituted arylhydrazines, acetanilides, etc. via C−H bond activation to access the corresponding bioactive heterocylic scaffolds.Keywords: biologically active, C-H activation, heterocyclic compounds, transition-metal catalysts
Procedia PDF Downloads 30925644 Evaluation of Airborne Particulate Matter Early Biological Effects in Children with Micronucleus Cytome Assay: The MAPEC_LIFE Project
Authors: E. Carraro, Sa. Bonetta, Si. Bonetta, E. Ceretti, G. C. V. Viola, C. Pignata, S. Levorato, T. Salvatori, S. Vannini, V. Romanazzi, A. Carducci, G. Donzelli, T. Schilirò, A. De Donno, T. Grassi, S. Bonizzoni, A. Bonetti, G. Gilli, U. Gelatti
Abstract:
In 2013, air pollution and particulate matter were classified as carcinogenic to human by the IARC. At present, PM is Europe's most problematic pollutant in terms of harm to health, as reported by European Environmental Agency (EEA) in the EEA Technical Report on Air quality in Europe, 2015. A percentage between 17-30 of the EU urban population lives in areas where the EU air quality 24-hour limit value for PM10 is exceeded. Many studies have found a consistent association between exposure to PM and the incidence and mortality for some chronic diseases (i.e. lung cancer, cardiovascular diseases). Among the mechanisms responsible for these adverse effects, genotoxic damage is of particular concern. Children are a high-risk group in terms of the health effects of air pollution and early exposure during childhood can increase the risk of developing chronic diseases in adulthood. The MAPEC_LIFE (Monitoring Air Pollution Effects on Children for supporting public health policy) is a project founded by EU Life+ Programme (LIFE12 ENV/IT/000614) which intends to evaluate the associations between air pollution and early biological effects in children and to propose a model for estimating the global risk of early biological effects due to air pollutants and other factors in children. This work is focused on the micronuclei frequency in child buccal cells in association with airborne PM levels taking into account the influence of other factors associated with the lifestyle of children. The micronucleus test was performed in exfoliated buccal cells of 6–8 years old children from 5 Italian towns with different air pollution levels. Data on air quality during the study period were obtained from the Regional Agency for Environmental Protection. A questionnaire administered to children’s parents was used to obtain details on family socio-economic status, children health condition, exposures to other indoor and outdoor pollutants (i.e. passive smoke) and life-style, with particular reference to eating habits. During the first sampling campaign (winter 2014-15) 1315 children were recruited and sampled for Micronuclei test in buccal cells. In the sampling period the levels of the main pollutants and PM10 were, as expected, higher in the North of Italy (PM10 mean values 62 μg/m3 in Torino and 40 μg/m3 in Brescia) than in the other towns (Pisa, Perugia, Lecce). A higher Micronucleus frequency in buccal cells of children was found in Brescia (0.6/1000 cells) than in the other towns (range 0.3-0.5/1000 cells). The statistical analysis underlines a relation of the micronuclei frequency with PM concentrations, traffic level near child residence, and level of education of parents. The results suggest that, in addition to air pollution exposure, some other factors, related to lifestyle or further exposures, may influence micronucleus frequency and cellular response to air pollutants.Keywords: air pollution, buccal cells, children, micronucleus cytome assay
Procedia PDF Downloads 25325643 A Non-parametric Clustering Approach for Multivariate Geostatistical Data
Authors: Francky Fouedjio
Abstract:
Multivariate geostatistical data have become omnipresent in the geosciences and pose substantial analysis challenges. One of them is the grouping of data locations into spatially contiguous clusters so that data locations within the same cluster are more similar while clusters are different from each other, in some sense. Spatially contiguous clusters can significantly improve the interpretation that turns the resulting clusters into meaningful geographical subregions. In this paper, we develop an agglomerative hierarchical clustering approach that takes into account the spatial dependency between observations. It relies on a dissimilarity matrix built from a non-parametric kernel estimator of the spatial dependence structure of data. It integrates existing methods to find the optimal cluster number and to evaluate the contribution of variables to the clustering. The capability of the proposed approach to provide spatially compact, connected and meaningful clusters is assessed using bivariate synthetic dataset and multivariate geochemical dataset. The proposed clustering method gives satisfactory results compared to other similar geostatistical clustering methods.Keywords: clustering, geostatistics, multivariate data, non-parametric
Procedia PDF Downloads 47725642 Big Data in Telecom Industry: Effective Predictive Techniques on Call Detail Records
Authors: Sara ElElimy, Samir Moustafa
Abstract:
Mobile network operators start to face many challenges in the digital era, especially with high demands from customers. Since mobile network operators are considered a source of big data, traditional techniques are not effective with new era of big data, Internet of things (IoT) and 5G; as a result, handling effectively different big datasets becomes a vital task for operators with the continuous growth of data and moving from long term evolution (LTE) to 5G. So, there is an urgent need for effective Big data analytics to predict future demands, traffic, and network performance to full fill the requirements of the fifth generation of mobile network technology. In this paper, we introduce data science techniques using machine learning and deep learning algorithms: the autoregressive integrated moving average (ARIMA), Bayesian-based curve fitting, and recurrent neural network (RNN) are employed for a data-driven application to mobile network operators. The main framework included in models are identification parameters of each model, estimation, prediction, and final data-driven application of this prediction from business and network performance applications. These models are applied to Telecom Italia Big Data challenge call detail records (CDRs) datasets. The performance of these models is found out using a specific well-known evaluation criteria shows that ARIMA (machine learning-based model) is more accurate as a predictive model in such a dataset than the RNN (deep learning model).Keywords: big data analytics, machine learning, CDRs, 5G
Procedia PDF Downloads 13925641 Parallel Coordinates on a Spiral Surface for Visualizing High-Dimensional Data
Authors: Chris Suma, Yingcai Xiao
Abstract:
This paper presents Parallel Coordinates on a Spiral Surface (PCoSS), a parallel coordinate based interactive visualization method for high-dimensional data, and a test implementation of the method. Plots generated by the test system are compared with those generated by XDAT, a software implementing traditional parallel coordinates. Traditional parallel coordinate plots can be cluttered when the number of data points is large or when the dimensionality of the data is high. PCoSS plots display multivariate data on a 3D spiral surface and allow users to see the whole picture of high-dimensional data with less cluttering. Taking advantage of the 3D display environment in PCoSS, users can further reduce cluttering by zooming into an axis of interest for a closer view or by moving vantage points and by reorienting the viewing angle to obtain a desired view of the plots.Keywords: human computer interaction, parallel coordinates, spiral surface, visualization
Procedia PDF Downloads 1225640 A Dynamic Ensemble Learning Approach for Online Anomaly Detection in Alibaba Datacenters
Authors: Wanyi Zhu, Xia Ming, Huafeng Wang, Junda Chen, Lu Liu, Jiangwei Jiang, Guohua Liu
Abstract:
Anomaly detection is a first and imperative step needed to respond to unexpected problems and to assure high performance and security in large data center management. This paper presents an online anomaly detection system through an innovative approach of ensemble machine learning and adaptive differentiation algorithms, and applies them to performance data collected from a continuous monitoring system for multi-tier web applications running in Alibaba data centers. We evaluate the effectiveness and efficiency of this algorithm with production traffic data and compare with the traditional anomaly detection approaches such as a static threshold and other deviation-based detection techniques. The experiment results show that our algorithm correctly identifies the unexpected performance variances of any running application, with an acceptable false positive rate. This proposed approach has already been deployed in real-time production environments to enhance the efficiency and stability in daily data center operations.Keywords: Alibaba data centers, anomaly detection, big data computation, dynamic ensemble learning
Procedia PDF Downloads 20125639 The Role of Synthetic Data in Aerial Object Detection
Authors: Ava Dodd, Jonathan Adams
Abstract:
The purpose of this study is to explore the characteristics of developing a machine learning application using synthetic data. The study is structured to develop the application for the purpose of deploying the computer vision model. The findings discuss the realities of attempting to develop a computer vision model for practical purpose, and detail the processes, tools, and techniques that were used to meet accuracy requirements. The research reveals that synthetic data represents another variable that can be adjusted to improve the performance of a computer vision model. Further, a suite of tools and tuning recommendations are provided.Keywords: computer vision, machine learning, synthetic data, YOLOv4
Procedia PDF Downloads 22525638 Synthesis of Flavonoid Derivatives Precursors of Active Pharmaceutical Ingredients by Mechanical Chemistry
Authors: Imen Abid, Rachel Calvet, Michel Baltas
Abstract:
Flavonoids are secondary metabolites that belong to a polyphenolic class, present in fruits and vegetables, playing a significant role in biological systems. The structural variations of these flavonoids are associated with many biological and pharmacological activities (antioxidant, anti-inflammatory, anticancer, antibacterial, antifungal, antiviral, and antimalarial). Given their importance in plants and health-promoting roles in humans, significant efforts have been devoted towards their isolation of flavonoids and chemical elaboration (organic synthesis). But with the increasing public concern over environmental degradation and future resources, it is of great importance for chemists to come up with different approaches, less hazardous to human health and the environment. Being employed in large amounts, the solvents used in organic synthesis are high on the list of environmental pollutants. To overcome these problems, our approach is to develop unconventional processes involving solvent-free conditions. The application of mechanical forces to solvent-free or solvent-less reaction mixtures through the use of ball mills offers many advantages over traditional solvent-based strategies. It is one of the unconventional activation methods, which makes it possible to overcome the use of solvents, in the context of green chemistry and more respectful of the environment.Keywords: organic synthesis, green chemistry, mecanochemistry, pharmaceutical molecules
Procedia PDF Downloads 7925637 Perception-Oriented Model Driven Development for Designing Data Acquisition Process in Wireless Sensor Networks
Authors: K. Indra Gandhi
Abstract:
Wireless Sensor Networks (WSNs) have always been characterized for application-specific sensing, relaying and collection of information for further analysis. However, software development was not considered as a separate entity in this process of data collection which has posed severe limitations on the software development for WSN. Software development for WSN is a complex process since the components involved are data-driven, network-driven and application-driven in nature. This implies that there is a tremendous need for the separation of concern from the software development perspective. A layered approach for developing data acquisition design based on Model Driven Development (MDD) has been proposed as the sensed data collection process itself varies depending upon the application taken into consideration. This work focuses on the layered view of the data acquisition process so as to ease the software point of development. A metamodel has been proposed that enables reusability and realization of the software development as an adaptable component for WSN systems. Further, observing users perception indicates that proposed model helps in improving the programmer's productivity by realizing the collaborative system involved.Keywords: data acquisition, model-driven development, separation of concern, wireless sensor networks
Procedia PDF Downloads 43425636 Comparative Analysis of Data Gathering Protocols with Multiple Mobile Elements for Wireless Sensor Network
Authors: Bhat Geetalaxmi Jairam, D. V. Ashoka
Abstract:
Wireless Sensor Networks are used in many applications to collect sensed data from different sources. Sensed data has to be delivered through sensors wireless interface using multi-hop communication towards the sink. The data collection in wireless sensor networks consumes energy. Energy consumption is the major constraints in WSN .Reducing the energy consumption while increasing the amount of generated data is a great challenge. In this paper, we have implemented two data gathering protocols with multiple mobile sinks/elements to collect data from sensor nodes. First, is Energy-Efficient Data Gathering with Tour Length-Constrained Mobile Elements in Wireless Sensor Networks (EEDG), in which mobile sinks uses vehicle routing protocol to collect data. Second is An Intelligent Agent-based Routing Structure for Mobile Sinks in WSNs (IAR), in which mobile sinks uses prim’s algorithm to collect data. Authors have implemented concepts which are common to both protocols like deployment of mobile sinks, generating visiting schedule, collecting data from the cluster member. Authors have compared the performance of both protocols by taking statistics based on performance parameters like Delay, Packet Drop, Packet Delivery Ratio, Energy Available, Control Overhead. Authors have concluded this paper by proving EEDG is more efficient than IAR protocol but with few limitations which include unaddressed issues likes Redundancy removal, Idle listening, Mobile Sink’s pause/wait state at the node. In future work, we plan to concentrate more on these limitations to avail a new energy efficient protocol which will help in improving the life time of the WSN.Keywords: aggregation, consumption, data gathering, efficiency
Procedia PDF Downloads 49725635 Status and Results from EXO-200
Authors: Ryan Maclellan
Abstract:
EXO-200 has provided one of the most sensitive searches for neutrinoless double-beta decay utilizing 175 kg of enriched liquid xenon in an ultra-low background time projection chamber. This detector has demonstrated excellent energy resolution and background rejection capabilities. Using the first two years of data, EXO-200 has set a limit of 1.1x10^25 years at 90% C.L. on the neutrinoless double-beta decay half-life of Xe-136. The experiment has experienced a brief hiatus in data taking during a temporary shutdown of its host facility: the Waste Isolation Pilot Plant. EXO-200 expects to resume data taking in earnest this fall with upgraded detector electronics. Results from the analysis of EXO-200 data and an update on the current status of EXO-200 will be presented.Keywords: double-beta, Majorana, neutrino, neutrinoless
Procedia PDF Downloads 41425634 Remaining Useful Life (RUL) Assessment Using Progressive Bearing Degradation Data and ANN Model
Authors: Amit R. Bhende, G. K. Awari
Abstract:
Remaining useful life (RUL) prediction is one of key technologies to realize prognostics and health management that is being widely applied in many industrial systems to ensure high system availability over their life cycles. The present work proposes a data-driven method of RUL prediction based on multiple health state assessment for rolling element bearings. Bearing degradation data at three different conditions from run to failure is used. A RUL prediction model is separately built in each condition. Feed forward back propagation neural network models are developed for prediction modeling.Keywords: bearing degradation data, remaining useful life (RUL), back propagation, prognosis
Procedia PDF Downloads 436