Search results for: high correlated data
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 38656

Search results for: high correlated data

38656 Partial Least Square Regression for High-Dimentional and High-Correlated Data

Authors: Mohammed Abdullah Alshahrani

Abstract:

The research focuses on investigating the use of partial least squares (PLS) methodology for addressing challenges associated with high-dimensional correlated data. Recent technological advancements have led to experiments producing data characterized by a large number of variables compared to observations, with substantial inter-variable correlations. Such data patterns are common in chemometrics, where near-infrared (NIR) spectrometer calibrations record chemical absorbance levels across hundreds of wavelengths, and in genomics, where thousands of genomic regions' copy number alterations (CNA) are recorded from cancer patients. PLS serves as a widely used method for analyzing high-dimensional data, functioning as a regression tool in chemometrics and a classification method in genomics. It handles data complexity by creating latent variables (components) from original variables. However, applying PLS can present challenges. The study investigates key areas to address these challenges, including unifying interpretations across three main PLS algorithms and exploring unusual negative shrinkage factors encountered during model fitting. The research presents an alternative approach to addressing the interpretation challenge of predictor weights associated with PLS. Sparse estimation of predictor weights is employed using a penalty function combining a lasso penalty for sparsity and a Cauchy distribution-based penalty to account for variable dependencies. The results demonstrate sparse and grouped weight estimates, aiding interpretation and prediction tasks in genomic data analysis. High-dimensional data scenarios, where predictors outnumber observations, are common in regression analysis applications. Ordinary least squares regression (OLS), the standard method, performs inadequately with high-dimensional and highly correlated data. Copy number alterations (CNA) in key genes have been linked to disease phenotypes, highlighting the importance of accurate classification of gene expression data in bioinformatics and biology using regularized methods like PLS for regression and classification.

Keywords: partial least square regression, genetics data, negative filter factors, high dimensional data, high correlated data

Procedia PDF Downloads 11
38655 Network Analysis of Genes Involved in the Biosynthesis of Medicinally Important Naphthodianthrone Derivatives of Hypericum perforatum

Authors: Nafiseh Noormohammadi, Ahmad Sobhani Najafabadi

Abstract:

Hypericins (hypericin and pseudohypericin) are natural napthodianthrone derivatives produced by Hypericum perforatum (St. John’s Wort), which have many medicinal properties such as antitumor, antineoplastic, antiviral, and antidepressant activities. Production and accumulation of hypericin in the plant are influenced by both genetic and environmental conditions. Despite the existence of different high-throughput data on the plant, genetic dimensions of hypericin biosynthesis have not yet been completely understood. In this research, 21 high-quality RNA-seq data on different parts of the plant were integrated into metabolic data to reconstruct a coexpression network. Results showed that a cluster of 30 transcripts was correlated with total hypericin. The identified transcripts were divided into three main groups based on their functions, including hypericin biosynthesis genes, transporters, detoxification genes, and transcription factors (TFs). In the biosynthetic group, different isoforms of polyketide synthase (PKSs) and phenolic oxidative coupling proteins (POCPs) were identified. Phylogenetic analysis of protein sequences integrated into gene expression analysis showed that some of the POCPs seem to be very important in the biosynthetic pathway of hypericin. In the TFs group, six TFs were correlated with total hypericin. qPCR analysis of these six TFs confirmed that three of them were highly correlated. The identified genes in this research are a rich resource for further studies on the molecular breeding of H. perforatum in order to obtain varieties with high hypericin production.

Keywords: hypericin, St. John’s Wort, data mining, transcription factors, secondary metabolites

Procedia PDF Downloads 55
38654 Partners Sharing Resources, Costs, and Risks

Authors: Lee Li

Abstract:

The strategic management literature posits that the major motive of strategic alliances is to share resources, costs and risks. However, the literature also indicates that such sharing leads to transaction costs which are positively correlated with environmental dynamism. As such, it is not clear why firms are willing to cover high transaction costs for sharing resources, costs and risks. This study categorizes resources into firm-specific and general resource; costs into accounting and non-accounting cost; and risks into visible and invisible risks. Using data from 167 Canadian firms in technology industries, we find that sharing firm-specific resources and non-accounting costs are negatively correlated with environmental dynamism but sharing general resources, accounting costs and visible risks are positively correlated with environmental dynamism. Findings suggest that sharing certain resources, costs and risks do not necessarily incur high transaction costs.

Keywords: environmental dynamism, strategic alliances, resource/cost/risk sharing

Procedia PDF Downloads 338
38653 Determination of Relationship among Shape Indexes Used for Land Consolidation

Authors: Firat Arslan, Hasan Degirmenci, Serife Tulin Akkaya Aslan

Abstract:

The aim of the current experiment was to determine the relationship among shape indexes which are used by the researchers in many fields to evaluate parcel shapes which is very important for farming even if these indexes are controversial. In the current study, land consolidation project of Halitaga village in Mersin province in Turkey which has 278 parcel and cover 894.4 ha, was taken as a material. Commonly used indicators such as fractal dimension (FD), shape index (SI), form factor (FORM), areal form factor (AFF) and two distinct area-perimeter ratio (APR-1 and APR2) in land consolidation are used to measure agricultural plot’s shape. FD was positively correlated with SI, APR-1 and APR-2 whereas it was negatively correlated with FORM and AFF. SI was positively correlated with APR-1 and APR-2 whereas it was negatively correlated with FORM and AFF. As a conclusion, it is likely that these indexes involved may be used interchangeably due to high correlations among them.

Keywords: GIS, land consolidation, parcel shape, shape index

Procedia PDF Downloads 157
38652 Estimating the Receiver Operating Characteristic Curve from Clustered Data and Case-Control Studies

Authors: Yalda Zarnegarnia, Shari Messinger

Abstract:

Receiver operating characteristic (ROC) curves have been widely used in medical research to illustrate the performance of the biomarker in correctly distinguishing the diseased and non-diseased groups. Correlated biomarker data arises in study designs that include subjects that contain same genetic or environmental factors. The information about correlation might help to identify family members at increased risk of disease development, and may lead to initiating treatment to slow or stop the progression to disease. Approaches appropriate to a case-control design matched by family identification, must be able to accommodate both the correlation inherent in the design in correctly estimating the biomarker’s ability to differentiate between cases and controls, as well as to handle estimation from a matched case control design. This talk will review some developed methods for ROC curve estimation in settings with correlated data from case control design and will discuss the limitations of current methods for analyzing correlated familial paired data. An alternative approach using Conditional ROC curves will be demonstrated, to provide appropriate ROC curves for correlated paired data. The proposed approach will use the information about the correlation among biomarker values, producing conditional ROC curves that evaluate the ability of a biomarker to discriminate between diseased and non-diseased subjects in a familial paired design.

Keywords: biomarker, correlation, familial paired design, ROC curve

Procedia PDF Downloads 202
38651 High Performance Computing and Big Data Analytics

Authors: Branci Sarra, Branci Saadia

Abstract:

Because of the multiplied data growth, many computer science tools have been developed to process and analyze these Big Data. High-performance computing architectures have been designed to meet the treatment needs of Big Data (view transaction processing standpoint, strategic, and tactical analytics). The purpose of this article is to provide a historical and global perspective on the recent trend of high-performance computing architectures especially what has a relation with Analytics and Data Mining.

Keywords: high performance computing, HPC, big data, data analysis

Procedia PDF Downloads 484
38650 The Relevant Study of Leisure Motivation, Leisure Attitude and Health Promotion Lifestyle of Elderly People in Taiwan

Authors: Cheng-Yu Tsai, Chiung-En Huang, Ming-Tsang Wu

Abstract:

The purpose of this study was to investigate the relationships among leisure motivation, leisure attitude, and health promotion lifestyle. The participants were recruited from a convenience sampling that subjects were at least 55 years of age in Tainan City, Taiwan. Three hundred survey instruments were distributed, and 227 effective instruments were returned, for an effective rate of 75.7%. The collected data were analyzed statistically. The findings of this research were as follows: 1.There is significantly correlated between leisure motivation and leisure attitude. 2. There is significantly correlated between leisure attitude and health promotion lifestyle. 3. There is significantly correlated between leisure motivation and health promotion lifestyle.

Keywords: leisure motivation, leisure attitude, health promotion lifestyle, tourism

Procedia PDF Downloads 329
38649 Recombination Center Levels in Gold and Platinum Doped N-type Silicon for High-Speed Thyristor

Authors: Nam Chol Yu, GyongIl Chu, HoJong Ri

Abstract:

Using DLTS (Deep-level transient spectroscopy) measurement techniques, we determined the dominant recombination center levels (defects of both A and B) in gold and platinum doped n-type silicon. Also, the injection and temperature dependence of the Shockley-Read-Hall (SRH) carrier lifetime was studied under low-level injection and high-level injection. Here measurements show that the dominant level under low-level injection located at EC-0.25 eV (A) correlated to the Pt+G1 and the dominant level under high-level injection located at EC-0.54 eV (B) correlated to the Au+G4. Finally, A and B are the same dominant levels for controlling the lifetime in gold-platinum doped n-silicon.

Keywords: recombination center level, lifetime, carrier lifetime control, Gold, Platinum, Silicon

Procedia PDF Downloads 36
38648 Cobalt Ions Adsorption by Quartz and Illite and Calcite from Waste Water

Authors: Saad A. Aljlil

Abstract:

Adsorption of cobalt ions on quartz and illite and calcite from waste water was investigated. The effect of pH on the adsorption of cobalt ions was studied. The maximum capacities of cobalt ions of the three adsorbents increase with increasing cobalt solution temperature. The maximum capacities were (4.66) mg/g for quartz, (3.94) mg/g for illite, and (3.44) mg/g for calcite. The enthalpy, Gibbs free energy, and entropy for adsorption of cobalt ions on the three adsorbents were calculated. It was found that the adsorption process of the cobalt ions of the adsorbent was an endothermic process. consequently increasing the temperature causes the increase of the cobalt ions adsorption of the adsorbents. Therefore, the adsorption process is preferred at high temperature levels. The equilibrium adsorption data were correlated using Langmuir model, Freundlich model. The experimental data of cobalt ions of the adsorbents correlated well with Freundlich model.

Keywords: adsorption, Langmuir, Freundlich, quartz, illite, calcite, waste water

Procedia PDF Downloads 343
38647 A Data-Driven Monitoring Technique Using Combined Anomaly Detectors

Authors: Fouzi Harrou, Ying Sun, Sofiane Khadraoui

Abstract:

Anomaly detection based on Principal Component Analysis (PCA) was studied intensively and largely applied to multivariate processes with highly cross-correlated process variables. Monitoring metrics such as the Hotelling's T2 and the Q statistics are usually used in PCA-based monitoring to elucidate the pattern variations in the principal and residual subspaces, respectively. However, these metrics are ill suited to detect small faults. In this paper, the Exponentially Weighted Moving Average (EWMA) based on the Q and T statistics, T2-EWMA and Q-EWMA, were developed for detecting faults in the process mean. The performance of the proposed methods was compared with that of the conventional PCA-based fault detection method using synthetic data. The results clearly show the benefit and the effectiveness of the proposed methods over the conventional PCA method, especially for detecting small faults in highly correlated multivariate data.

Keywords: data-driven method, process control, anomaly detection, dimensionality reduction

Procedia PDF Downloads 268
38646 Perceived and Performed E-Health Literacy: Survey and Simulated Performance Test

Authors: Efrat Neter, Esther Brainin, Orna Baron-Epel

Abstract:

Background: Connecting end-users to newly developed ICT technologies and channeling patients to new products requires an assessment of compatibility. End user’s assessment is conveyed in the concept of eHealth literacy. The study examined the association between perceived and performed eHealth literacy (EHL) in a heterogeneous age sample in Israel. Methods: Participants included 100 Israeli adults (mean age 43,SD 13.9) who were first phone interviewed and then tested on a computer simulation of health-related Internet tasks. Performed, perceived and evaluated EHL were assessed. Levels of successful completion of tasks represented EHL performance and evaluated EHL included observed motivation, confidence, and amount of help provided. Results: The skills of accessing, understanding, appraising, applying, and generating new information had a decreasing successful completion rate with increase in complexity of the task. Generating new information, though highly correlated with all other skills, was least correlated with the other skills. Perceived and performed EHL were correlated (r=.40, P=.001), while facets of performance (i.e, digital literacy and EHL) were highly correlated (r=.89, P<.001). Participants low and high in performed EHL were significantly different: low performers were older, had attained less education, used the Internet for less time and perceived themselves as less healthy. They also encountered more difficulties, required more assistance, were less confident in their conduct and exhibited less motivation than high performers. Conclusions: The association in this age-hetrogenous ample was larger than in previous age-homogenous samples. The moderate association between perceived and performed EHL indicates that the two are associated yet distinct, the latter requiring separate assessment. Features of future rapid performed EHL tools are discussed.

Keywords: eHealth, health literacy, performance, simulation

Procedia PDF Downloads 212
38645 Recombination Center Levels in Gold and Platinum Doped N-Type Silicon

Authors: Nam Chol Yu, Kyong Il Chu

Abstract:

Using DLTS measurement techniques, we determined the dominant recombination center levels (defects of both A and B) in gold and platinum doped n-type silicon. Also, the injection and temperature dependence of the Shockley-Read-Hall (SRH) carrier lifetime was studied under low-level injection and high-level injection. Here measurements show that the dominant level under low-level injection located at EC-0.25eV(A) correlated to the Pt+G1 and the dominant level under high-level injection located at EC-0.54eV(B) correlated to the Au+G4. Finally, A and B are the same dominant levels for controlling the lifetime in gold-platinum doped n-silicon.

Keywords: recombination center level, lifetime, carrier lifetime control, gold, platinum, silicon

Procedia PDF Downloads 124
38644 Happiness, Life Satisfaction, Self-Efficacy, Coping Strategies and Perceived Stress among High School Students

Authors: Mustafa Jahanara, Mohsen Shahbakhti

Abstract:

The current study examined the relationship between happiness, life satisfaction, self-efficacy, coping strategies and perceived stress among high school students, Eshtehard city, Alboez province, Iran. One hundred and sixty-seven high school students were asked to complete the Subjective Happiness scale (SHS), The Satisfaction with Life Scale (SWLS), General Self-efficacy (GSE), the Brief COPE, and Perceived Stress Scale (PSS). Results revealed a positive correlation between happiness with life satisfaction, self-efficacy, problem-focused coping, adopted coping and it was a negative correlation with stress. Self-efficacy is a significant positive correlated with life satisfaction, problem-focused coping and it is negative correlated with stress. However, the findings suggest that self-efficacy and problem-focused coping could influence on happiness and life satisfaction.

Keywords: happiness, life satisfaction, self-efficacy, perceived stress and coping strategies

Procedia PDF Downloads 508
38643 Affective Attributes and Second Language Performance of Third Year Maritime Students: A Teacher's Compass

Authors: Sonia Pajaron, Flaviano Sentina, Ranulfo Etulle

Abstract:

Learning a second language calls for a total commitment from the learner whose response is necessary to successfully send and receive linguistic messages. It is relevant to virtually every aspect of human behaviour which is even more challenging when the components on -affective domains- are involved in second language learning. This study investigated the association between the identified affective attributes and second language performance of the one hundred seventeen (117) randomly selected third year maritime students. A descriptive-correlational method was utilized to generate data on their affective attributes while composition writing (2 series) and IELTS-based interview was done for speaking test. Additionally, to establish the respondents’ English language profile, data on their high school grades (GPA), entrance exam results in English subject (written) as well as in the interview was extracted as baseline information. Data were subjected to various statistical treatment (average means, percentages and pearson-r moment coefficient correlation) and found out that, Nautical Science and Marine Engineering students were found to have average high school grade, entrance test results, both written and in the interview turned out to be very satisfactory at 50% passing percentage. Varied results were manifested in their affective attributes towards learning the second language. On attitude, nautical science students had true positive attitude while marine engineering had only a moderate positive one. Secondly, the former were positively motivated to learn English while the latter were just moderately motivated. As regards anxiety, both groups embodied a moderate level of anxiety in the English language. Finally, data showed that nautical science students exuded real confidence while the marine engineering group had only moderate confidence with the second language. Respondents’ English academic achievement (GWA) was significantly correlated with confidence and speaking with anxiety towards the second language among the students from the nautical science group with moderate positive and low negative degree of correlation, respectively. On the other hand, the marine engineering students’ speaking test result was significantly correlated with anxiety and self-confidence with a moderate negative and low positive degree of correlation, respectively while writing was significantly correlated with motivation bearing a low positive degree of correlation.

Keywords: affective attributes, second language, second language performance, anxiety, attitude, self-confidence and motivation

Procedia PDF Downloads 244
38642 Toxicities associated with EBRT and Brachytherapy for Intermediate and High Risk Prostate Cancer, Correlated with Intra-operative Dosing

Authors: Rebecca Dunne, Cormac Small, Geraldine O'Boyle, Nazir Ibrahim, Anisha

Abstract:

Prostate cancer is the most common cancer among men, excluding non-melanoma skin cancers. It is estimated that approximately 12% of men will develop prostate cancer during their lifetime. Patients with intermediate, high risk, and very-high risk prostate cancer often undergo a combination of radiation treatments. These treatments include external beam radiotherapy with a low-dose rate or high-dose rate brachytherapy boost, often with concomitant androgen deprivation therapy. The literature on follow-up of patients that receive brachytherapy is scarce, particularly follow-up of patients that undergo high-dose rate brachytherapy. This retrospective study aims to investigate the biochemical failure and toxicities associated with triple therapy and external beam radiotherapy given in combination with brachytherapy. Reported toxicities and prostate specific antigen (PSA) were retrospectively evaluated in eighty patients that previously underwent external beam radiotherapy with a low-dose rate or high dose-rate brachytherapy boost. The severity of toxicities were correlated with intra-operative dosing during brachytherapy on ultrasound and CT scan. The results of this study will provide further information for clinicians and patients when considering treatment options.

Keywords: toxicities, combination, brachytherapy, intra-operative dosing, biochemical failure

Procedia PDF Downloads 201
38641 International Students in the US: Personality and Cross-Cultural Adaptability

Authors: Nhi Phuoc Thuc Le

Abstract:

Cross-cultural adaptability —one’s readiness to interact with people who are different from oneself or to adapt to living in another culture— is essential to the well-being and experience of international students. This research was set out to find the correlation between certain personality traits of international students and their likelihood to adapt to the U.S., the host culture. The study used Qualtrics, an online survey, to investigate the relationships between international students’ social self-efficacy, ego-resiliency, cultural intelligence, Big Five personality traits and cross-cultural adaptability (sociocultural and psychological adaptability). The data were analysed with the software SPSS. The findings of this quantitative study show that high scores in ego-resiliency, social self-efficacy, cultural intelligence and personality traits (including extraversion, agreeableness, intellect and conscientiousness) are correlated with better cross-cultural adaptation. Meanwhile, the Big-Five trait neuroticism is correlated with lower cross-cultural adaptability. Such insight is suggested to help international students be better prepared for an immersion into the US culture.

Keywords: Big Five, cross-cultural adaptability, cultural intelligence, ego-resiliency, international students, personality, self-efficacy

Procedia PDF Downloads 166
38640 Relay-Augmented Bottleneck Throughput Maximization for Correlated Data Routing: A Game Theoretic Perspective

Authors: Isra Elfatih Salih Edrees, Mehmet Serdar Ufuk Türeli

Abstract:

In this paper, an energy-aware method is presented, integrating energy-efficient relay-augmented techniques for correlated data routing with the goal of optimizing bottleneck throughput in wireless sensor networks. The system tackles the dual challenge of throughput optimization while considering sensor network energy consumption. A unique routing metric has been developed to enable throughput maximization while minimizing energy consumption by utilizing data correlation patterns. The paper introduces a game theoretic framework to address the NP-complete optimization problem inherent in throughput-maximizing correlation-aware routing with energy limitations. By creating an algorithm that blends energy-aware route selection strategies with the best reaction dynamics, this framework provides a local solution. The suggested technique considerably raises the bottleneck throughput for each source in the network while reducing energy consumption by choosing the best routes that strike a compromise between throughput enhancement and energy efficiency. Extensive numerical analyses verify the efficiency of the method. The outcomes demonstrate the significant decrease in energy consumption attained by the energy-efficient relay-augmented bottleneck throughput maximization technique, in addition to confirming the anticipated throughput benefits.

Keywords: correlated data aggregation, energy efficiency, game theory, relay-augmented routing, throughput maximization, wireless sensor networks

Procedia PDF Downloads 32
38639 Fine-Scale Modeling the Influencing Factors of Multi-Time Dimensions of Transit Ridership at Station Level: The Study of Guangzhou City

Authors: Dijiang Lyu, Shaoying Li, Zhangzhi Tan, Zhifeng Wu, Feng Gao

Abstract:

Nowadays, China is experiencing rapidly urban rail transit expansions in the world. The purpose of this study is to finely model factors influencing transit ridership at multi-time dimensions within transit stations’ pedestrian catchment area (PCA) in Guangzhou, China. This study was based on multi-sources spatial data, including smart card data, high spatial resolution images, points of interest (POIs), real-estate online data and building height data. Eight multiple linear regression models using backward stepwise method and Geographic Information System (GIS) were created at station-level. According to Chinese code for classification of urban land use and planning standards of development land, residential land-use were divided into three categories: first-level (e.g. villa), second-level (e.g. community) and third-level (e.g. urban villages). Finally, it concluded that: (1) four factors (CBD dummy, number of feeder bus route, number of entrance or exit and the years of station operation) were proved to be positively correlated with transit ridership, but the area of green land-use and water land-use negative correlated instead. (2) The area of education land-use, the second-level and third-level residential land-use were found to be highly connected to the average value of morning peak boarding and evening peak alighting ridership. But the area of commercial land-use and the average height of buildings, were significantly positive associated with the average value of morning peak alighting and evening peak boarding ridership. (3) The area of the second-level residential land-use was rarely correlated with ridership in other regression models. Because private car ownership is still large in Guangzhou now, and some residents living in the community around the stations go to work by transit at peak time, but others are much more willing to drive their own car at non-peak time. The area of the third-level residential land-use, like urban villages, was highly positive correlated with ridership in all models, indicating that residents who live in the third-level residential land-use are the main passenger source of the Guangzhou Metro. (4) The diversity of land-use was found to have a significant impact on the passenger flow on the weekend, but was non-related to weekday. The findings can be useful for station planning, management and policymaking.

Keywords: fine-scale modeling, Guangzhou city, multi-time dimensions, multi-sources spatial data, transit ridership

Procedia PDF Downloads 119
38638 Nutrition Environments and the Development of Taste Preferences: A Cross-Sectional Study of Primary School Children in Trinidad and Tobago

Authors: Fareena Alladin

Abstract:

In the Caribbean, issues of food security, health and taste are intricately linked, seen most clearly in the increasing incidence of lifestyle diseases among children coupled with a taste for high calorie and Westernized diets. In order to fully appreciate this link, the role of nutrition environments must be examined. To this end, the present study incorporates tenets of Bourdieu’s social constructivist theory with the Community Nutrition Environment Model. The aim of this study was to examine the relationships between availability of and access to healthy/unhealthy foods within nutrition environments, namely the household and school, and the development of taste preferences for healthy/unhealthy foods among primary school children in a selected educational district in Trinidad and Tobago. A cross-sectional survey of 400 children between the ages of 9 and 11 years was conducted. Data analysis was conducted using SPSS 24. Results indicated that availability of healthy food at home was positively correlated with preference for vegetables, and negatively correlated with preference for salty snacks and fast food. The availability of unhealthy food within the home was found to be negatively correlated with preference for vegetables and positively correlated with preference for salty snacks. Access to unhealthy foods at school had a positive correlation with preference for fast food. These findings highlight the role of the food environment in shaping taste preferences, and point to the need for interrogating the centrality of food security concerns in emerging health concerns of Caribbean countries. Such interrogations are a necessary part of the development of research agendas, and policy formulation and implementation.

Keywords: food security, nutrition environment, taste preference, Trinidad and Tobago

Procedia PDF Downloads 106
38637 Discerning Divergent Nodes in Social Networks

Authors: Mehran Asadi, Afrand Agah

Abstract:

In data mining, partitioning is used as a fundamental tool for classification. With the help of partitioning, we study the structure of data, which allows us to envision decision rules, which can be applied to classification trees. In this research, we used online social network dataset and all of its attributes (e.g., Node features, labels, etc.) to determine what constitutes an above average chance of being a divergent node. We used the R statistical computing language to conduct the analyses in this report. The data were found on the UC Irvine Machine Learning Repository. This research introduces the basic concepts of classification in online social networks. In this work, we utilize overfitting and describe different approaches for evaluation and performance comparison of different classification methods. In classification, the main objective is to categorize different items and assign them into different groups based on their properties and similarities. In data mining, recursive partitioning is being utilized to probe the structure of a data set, which allow us to envision decision rules and apply them to classify data into several groups. Estimating densities is hard, especially in high dimensions, with limited data. Of course, we do not know the densities, but we could estimate them using classical techniques. First, we calculated the correlation matrix of the dataset to see if any predictors are highly correlated with one another. By calculating the correlation coefficients for the predictor variables, we see that density is strongly correlated with transitivity. We initialized a data frame to easily compare the quality of the result classification methods and utilized decision trees (with k-fold cross validation to prune the tree). The method performed on this dataset is decision trees. Decision tree is a non-parametric classification method, which uses a set of rules to predict that each observation belongs to the most commonly occurring class label of the training data. Our method aggregates many decision trees to create an optimized model that is not susceptible to overfitting. When using a decision tree, however, it is important to use cross-validation to prune the tree in order to narrow it down to the most important variables.

Keywords: online social networks, data mining, social cloud computing, interaction and collaboration

Procedia PDF Downloads 118
38636 Correlation between Body Mass Index and Blood Sugar/Serum Lipid Levels in Fourth-Grade Boys in Japan

Authors: Kotomi Yamashita, Hiromi Kawasaki, Satoko Yamasaki, Susumu Fukita, Risako Sakai

Abstract:

Lifestyle-related diseases develop from the long-term accumulation of health consequences from a poor lifestyle. Thus, schoolchildren, who have not accumulated long-term lifestyle habits, are believed to be at a lower risk for lifestyle-related diseases. However, schoolchildren rarely receive blood tests unless they are under treatment for a serious disease; without such data on their blood, the impacts of their young lifestyle could not be known. Blood data from physical measurements can help in the implementation of more effective health education. Therefore, we examined the correlation between body mass index (BMI) and blood sugar/serum lipid (BS/SL) levels. From 2014 to 2016, we measured the blood data of fourth-grade students living in a city in Japan. The present study reported on the results of 281 fourth-grade boys only (80.3% of total). We analyzed their BS/SL levels by comparing the blood data against the criteria of the National Center for Child Health and Development in Japan. Next, we examined the correlation between BMI and BS/SL levels. IBM SPSS Statistics for Windows, Version 25 was used for analysis. A total of 69 boys (24.6%) were within the normal range for BMI (18.5–24), whereas 193 (71.5%) and 8 boys (2.8%) had lower and higher BMI, respectively. Regarding BS levels, 280 boys were within the normal range (70–90 mg/dl); 1 boy reported a higher value. All the boys were within the normal range for glycated Hemoglobin (HbA1c) (4.6–6.2%). Regarding SL levels, 271 boys were within the normal range (125–230 mg/dl) for total cholesterol (TC), whereas 5 boys (1.8%) had lower and 5 boys (1.8%) had higher levels. A total of 243 boys (92.7%) were within the normal range (36-138mg/dL) for triglycerides (TG), whereas 19 boys (7.3%) had lower and 19 boys (7.3%) had higher levels. Regarding high-density lipoprotein cholesterol (HDL-C), 276 boys (98.2%) were within the normal range (40-mg/dl), whereas 5 boys (1.8%) reported lower values. All but one boy (280, 99.6%) were within the normal range (-170 mg/dl) for low-density lipoprotein cholesterol (LDL-C); the exception (0.4%) had a higher level. BMI and BS didn’t show a correlation. BMI and HbA1c were moderately positively correlated (r = 0.139, p=0.019). We also observed moderate positive correlations between BMI and TG (r = 0.328, p < 0.01), TC (r=0.239, p< 0.01), LDL-C (r = 0.324, p < 0.01), respectively. BMI and HDL-C were low correlated (r = -0.185, p = 0.002). Most of the boys were within the normal range for BS/SL levels. However, some boys exceeded the normal TG range. Fourth graders with a high TG may develop a lifestyle-related disease in the future. Given its relation to TG, food habits should be improved in this group. Our findings suggested a positive correlation between BMI and BS/SL levels. Fourth-grade schoolboys with a high BMI may be at high risk for developing lifestyle-related diseases. Lifestyle improvement may be recommended to lower the BS/SL levels in this group.

Keywords: blood sugar level, lifestyle-related diseases, school students, serum lipid level

Procedia PDF Downloads 113
38635 A Generalized Sparse Bayesian Learning Algorithm for Near-Field Synthetic Aperture Radar Imaging: By Exploiting Impropriety and Noncircularity

Authors: Pan Long, Bi Dongjie, Li Xifeng, Xie Yongle

Abstract:

The near-field synthetic aperture radar (SAR) imaging is an advanced nondestructive testing and evaluation (NDT&E) technique. This paper investigates the complex-valued signal processing related to the near-field SAR imaging system, where the measurement data turns out to be noncircular and improper, meaning that the complex-valued data is correlated to its complex conjugate. Furthermore, we discover that the degree of impropriety of the measurement data and that of the target image can be highly correlated in near-field SAR imaging. Based on these observations, A modified generalized sparse Bayesian learning algorithm is proposed, taking impropriety and noncircularity into account. Numerical results show that the proposed algorithm provides performance gain, with the help of noncircular assumption on the signals.

Keywords: complex-valued signal processing, synthetic aperture radar, 2-D radar imaging, compressive sensing, sparse Bayesian learning

Procedia PDF Downloads 97
38634 Determines of Professional Competencies among Newly Registered Nurses in Teaching Hospital in Kingdom of Saudi Arabia

Authors: Rana Alkattan

Abstract:

Aim: This study aims to identify and analyze the factors predicting the professional clinical competency among newly recruited registered nurses. In addition, it aims to explore factors significantly correlated with high and low professional clinical competency score. Method: A descriptive analytical is applied in this study, cross-sectional which conducted between June 2012 and June 2013 at King Abdulaziz University Hospital, as one of the largest governmental university tertiary Hospital in Saudi Arabia. A survey questionnaire was designed to collect data. And then, data were analyzed using the SPSS. Results: A total of the 86 nurses provided valid responses. 69 were female and 17 were male. The majority of the participants in this study were married, from the Philippines, between 20-29 years old. The majority had certified university bachelor’s degree in nursing, as well as had prior experience in nursing between 1 to 5 years. There are two categories emerged from the data, which significantly correlated with nurses' professional competence and development. The first was the newly employed registered nurses demographic characteristic (correlation coefficients 0.154 to 0.470, P < 0.05), while the second was the list of studied environmental factors except 'job rotation factor' (correlation coefficients 0.122 to 0.540, P < 0.01). However, nurses' attitude including motivation and confidence were not associated with nurse's professional competency. Conclusion: that nurses' professional competence development is a process affected by certain personal demographic and environmental factors which will enable newly graduates nurses to provide safe effective patients' care and maintain their career responsibilities.

Keywords: clinical, competence, development nurses professional, registered

Procedia PDF Downloads 333
38633 Improving Temporal Correlations in Empirical Orthogonal Function Expansions for Data Interpolating Empirical Orthogonal Function Algorithm

Authors: Ping Bo, Meng Yunshan

Abstract:

Satellite-derived sea surface temperature (SST) is a key parameter for many operational and scientific applications. However, the disadvantage of SST data is a high percentage of missing data which is mainly caused by cloud coverage. Data Interpolating Empirical Orthogonal Function (DINEOF) algorithm is an EOF-based technique for reconstructing the missing data and has been widely used in oceanographic field. The reconstruction of SST images within a long time series using DINEOF can cause large discontinuities and one solution for this problem is to filter the temporal covariance matrix to reduce the spurious variability. Based on the previous researches, an algorithm is presented in this paper to improve the temporal correlations in EOF expansion. Similar with the previous researches, a filter, such as Laplacian filter, is implemented on the temporal covariance matrix, but the temporal relationship between two consecutive images which is used in the filter is considered in the presented algorithm, for example, two images in the same season are more likely correlated than those in the different seasons, hence the latter one is less weighted in the filter. The presented approach is tested for the monthly nighttime 4-km Advanced Very High Resolution Radiometer (AVHRR) Pathfinder SST for the long-term period spanning from 1989 to 2006. The results obtained from the presented algorithm are compared to those from the original DINEOF algorithm without filtering and from the DINEOF algorithm with filtering but without taking temporal relationship into account.

Keywords: data interpolating empirical orthogonal function, image reconstruction, sea surface temperature, temporal filter

Procedia PDF Downloads 287
38632 A Human Factors Approach to Workload Optimization for On-Screen Review Tasks

Authors: Christina Kirsch, Adam Hatzigiannis

Abstract:

Rail operators and maintainers worldwide are increasingly replacing walking patrols in the rail corridor with mechanized track patrols -essentially data capture on trains- and on-screen reviews of track infrastructure in centralized review facilities. The benefit is that infrastructure workers are less exposed to the dangers of the rail corridor. The impact is a significant change in work design from walking track sections and direct observation in the real world to sedentary jobs in the review facility reviewing captured data on screens. Defects in rail infrastructure can have catastrophic consequences. Reviewer performance regarding accuracy and efficiency of reviews within the available time frame is essential to ensure safety and operational performance. Rail operators must optimize workload and resource loading to transition to on-screen reviews successfully. Therefore, they need to know what workload assessment methodologies will provide reliable and valid data to optimize resourcing for on-screen reviews. This paper compares objective workload measures, including track difficulty ratings and review distance covered per hour, and subjective workload assessments (NASA TLX) and analyses the link between workload and reviewer performance, including sensitivity, precision, and overall accuracy. An experimental study was completed with eight on-screen reviewers, including infrastructure workers and engineers, reviewing track sections with different levels of track difficulty over nine days. Each day the reviewers completed four 90-minute sessions of on-screen inspection of the track infrastructure. Data regarding the speed of review (km/ hour), detected defects, false negatives, and false positives were collected. Additionally, all reviewers completed a subjective workload assessment (NASA TLX) after each 90-minute session and a short employee engagement survey at the end of the study period that captured impacts on job satisfaction and motivation. The results showed that objective measures for tracking difficulty align with subjective mental demand, temporal demand, effort, and frustration in the NASA TLX. Interestingly, review speed correlated with subjective assessments of physical and temporal demand, but to mental demand. Subjective performance ratings correlated with all accuracy measures and review speed. The results showed that subjective NASA TLX workload assessments accurately reflect objective workload. The analysis of the impact of workload on performance showed that subjective mental demand correlated with high precision -accurately detected defects, not false positives. Conversely, high temporal demand was negatively correlated with sensitivity and the percentage of detected existing defects. Review speed was significantly correlated with false negatives. With an increase in review speed, accuracy declined. On the other hand, review speed correlated with subjective performance assessments. Reviewers thought their performance was higher when they reviewed the track sections faster, despite the decline in accuracy. The study results were used to optimize resourcing and ensure that reviewers had enough time to review the allocated track sections to improve defect detection rates in accordance with the efficiency-thoroughness trade-off. Overall, the study showed the importance of a multi-method approach to workload assessment and optimization, combining subjective workload assessments with objective workload and performance measures to ensure that recommendations for work system optimization are evidence-based and reliable.

Keywords: automation, efficiency-thoroughness trade-off, human factors, job design, NASA TLX, performance optimization, subjective workload assessment, workload analysis

Procedia PDF Downloads 85
38631 Self-Esteem, Self-Efficacy and Psychological Distress among the High School Teachers in Afghanistan

Authors: Mustafa Jahanara

Abstract:

The purpose of the research was to study the relationship between self-esteem, self-Efficacy with psychological distress in the high school teachers. A total of 245 teachers (92 male and 153 female) in the high school of Kabul and Mazar-e-Sharif from Afghanistan completed inventories General Self-Efficacy, Rosenberg Self-Esteem Scale, and General Health Questionnaire-12 and that assessed their Self-Efficacy, self-esteem with psychological distress. Correlational analysis showed that self-efficacy and self-esteem were significantly and positively correlated with each other. The results of the study indicated that psychological distress is negatively related to self-esteem, and self-efficacy. However, the findings suggest that self-esteem, and self-efficacy could influence on mental health.

Keywords: high school teachers, self-esteem, self-efficacy, psychological distress

Procedia PDF Downloads 343
38630 The Correlation of Physical Activity and Plantar Pressure in Young Adults

Authors: Lovro Štefan

Abstract:

Background: The main purpose of the present study was to explore the correlations between physical activity and peak plantar pressure in dynamic mode. Methods: Participants were one hundred forty-six first-year university students (30.8% girls). Plantar pressure generated under each region of the foot (forefoot, midfoot, and heel) was measured by using Zebris dynamometric platform (Isny, Germany). The level of physical activity (PA) was calculated with the International Physical Activity questionnaire (IPAQ - short form). Results: In boys, forefoot peak plantar pressure was correlated with moderate PA (MPA; r=-0.21), vigorous PA (VPA; r=-0.18), and moderate-to-vigorous PA (MVPA; r=-0.28). No significant correlations with other foot regions (p>0.05) were observed. In girls, forefoot peak plantar pressure was correlated with MPA (r =-0.30), VPA (r=-0.39) and MVPA (r=-0.38). Also, heel peak pressure was significantly correlated with MPA (r=-0.33), while no significant correlations with VPA (r=0.05) and MVPA (r=-0.15) were observed. Conclusion: This study shows that different intensities of PA were mostly correlated with forefoot peak plantar pressure in both boys and girls. Therefore, strategies that reduce plantar pressure through a more active lifestyle should be implemented within the education system.

Keywords: pedobarography, youth, exercise, associations

Procedia PDF Downloads 74
38629 Relationshiop Between Occupants' Behaviour And Indoor Air Quality In Malaysian Public Hospital Outpatient Department

Authors: Farha Ibrahim, Ely Zarina Samsudin, Ahmad Razali Ishak, Jeyanthini Sathasivam

Abstract:

Introduction: Indoor air quality (IAQ) has recently gained substantial traction as the airborne transmission of infectious respiratory disease has become an increasing public health concern. Public hospital outpatient department (OPD). IAQ warrants special consideration as it is the most visited department in which patients and staff are all directly impacted by poor IAQ. However, there is limited evidence on IAQ in these settings. Moreover, occupants’ behavior like occupant’s movement and operation of door, windows and appliances, have been shown to significantly affect IAQ, yet the influence of these determinants on IAQ in such settings have not been established. Objectives: This study aims to examine IAQ in Malaysian public hospitals OPD and assess its relationships with occupants’ behavior. Methodology: A multicenter cross-sectional study in which stratified random sampling of Johor public hospitals OPD (n=6) according to building age was conducted. IAQ measurements include indoor air temperature, relative humidity (RH), air velocity (AV), carbon dioxide (CO2), total bacterial count (TBC) and total fungal count (TFC). Occupants’ behaviors in Malaysian public hospital OPD are assessed using observation forms, and results were analyzed. Descriptive statistics were performed to characterize all study variables, whereas non-parametric Spearman Rank correlation analysis was used to assess the correlation between IAQ and occupants’ behavior. Results: After adjusting for potential cofounder, the study has suggested that occupants’ movement in new building, like seated quietly, is significantly correlated with AV in new building (r 0.642, p-value 0.010), CO2 in new (r 0.772, p-value <0.001) and old building (r -0.559, p-value 0.020), TBC in new (r 0.747, p-value 0.001) and old building (r -0.559, p-value 0.020), and TFC in new (r 0.777, p-value <0.001) and old building (r -0.485, p-value 0.049). In addition, standing relaxed movement is correlated with indoor air temperature (r 0.823, p-value <0.001) in new building, CO2 (r 0.559, p-value 0.020), TBC (r 0.559, p-value 0.020), and TFC (r -0.485, p-value 0.049) in old building, while walking is correlated with AV in new building (r -0.642, p-value 0.001), CO2 in new (r -0.772, p-value <0.001) and old building (r 0.559, p-value 0.020), TBC in new (r -0.747, p-value 0.001) and old building (r 0.559, p-value 0.020), and TFC in old building (r -0.485, p-value 0.049). The indoor air temperature is significantly correlated with number of doors kept opened (r 0.522, p-value 0.046), frequency of door adjustments (r 0.753, p-value 0.001), number of windows kept opened (r 0.522, p-value 0.046), number of air-conditioned (AC) switched on (r 0.698, p-value 0.004) and frequency of AC adjustment (r 0.753, p-value 0.001) in new hospital OPD building. AV is found to be significantly correlated with number of doors kept opened (r 0.642, p-value 0.01), frequency of door adjustments (r 0.553, p-value 0.032), number of windows kept opened (r 0.642, p-value 0.01), and frequency of AC adjustment, number of fans switched on, and frequency of fans adjustment(all with r 0.553, p-value 0.032) in new building. In old hospital OPD building, the number of doors kept opened is significantly correlated with CO₂, TBC (both r -0.559, p-value 0.020) and TFC (r -0.495, p-value 0.049), frequency of door adjustment is significantly correlated with CO₂, TBC (both r-0.559, p-value 0.020) and TFC (r -0.495, p-value 0.049), number of windows kept opened is significantly correlated with CO₂, TBC (both r 0.559, p-value 0.020) and TFC (r 0.495, p-value 0.049), frequency of window adjustment is significantly correlated with CO₂,TBC (both r -0.559, p-value 0.020) and TFC (r -0.495, p-value 0.049), number of AC switched on is significantly correlated with CO₂, TBC (both r -0.559, p-value 0.020) and TFC (r -0.495, p-value 0.049),, frequency of AC adjustment is significantly correlated with CO2 (r 0.559, p-value 0.020), TBC (0.559, p-value 0.020) and TFC (r -0.495, p-value 0.049), number of fans switched on is significantly correlated with CO2, TBC (both r 0.559, p-value 0.020) and TFC (r 0.495, p-value 0.049), and frequency of fans adjustment is significantly correlated with CO2, TBC (both r -0.559, p-value 0.020) and TFC (r -0.495, p-value 0.049). Conclusion: This study provided evidence on IAQ parameters in Malaysian public hospitals OPD and significant factors that may be effective targets of prospective intervention, thus enabling stakeholders to develop appropriate policies and programs to mitigate IAQ issues in Malaysian public hospitals OPD.

Keywords: outpatient department, iaq, occupants practice, public hospital

Procedia PDF Downloads 62
38628 Towards Visual Personality Questionnaires Based on Deep Learning and Social Media

Authors: Pau Rodriguez, Jordi Gonzalez, Josep M. Gonfaus, Xavier Roca

Abstract:

Image sharing in social networks has increased exponentially in the past years. Officially, there are 600 million Instagrammers uploading around 100 million photos and videos per day. Consequently, there is a need for developing new tools to understand the content expressed in shared images, which will greatly benefit social media communication and will enable broad and promising applications in education, advertisement, entertainment, and also psychology. Following these trends, our work aims to take advantage of the existing relationship between text and personality, already demonstrated by multiple researchers, so that we can prove that there exists a relationship between images and personality as well. To achieve this goal, we consider that images posted on social networks are typically conditioned on specific words, or hashtags, therefore any relationship between text and personality can also be observed with those posted images. Our proposal makes use of the most recent image understanding models based on neural networks to process the vast amount of data generated by social users to determine those images most correlated with personality traits. The final aim is to train a weakly-supervised image-based model for personality assessment that can be used even when textual data is not available, which is an increasing trend. The procedure is described next: we explore the images directly publicly shared by users based on those accompanying texts or hashtags most strongly related to personality traits as described by the OCEAN model. These images will be used for personality prediction since they have the potential to convey more complex ideas, concepts, and emotions. As a result, the use of images in personality questionnaires will provide a deeper understanding of respondents than through words alone. In other words, from the images posted with specific tags, we train a deep learning model based on neural networks, that learns to extract a personality representation from a picture and use it to automatically find the personality that best explains such a picture. Subsequently, a deep neural network model is learned from thousands of images associated with hashtags correlated to OCEAN traits. We then analyze the network activations to identify those pictures that maximally activate the neurons: the most characteristic visual features per personality trait will thus emerge since the filters of the convolutional layers of the neural model are learned to be optimally activated depending on each personality trait. For example, among the pictures that maximally activate the high Openness trait, we can see pictures of books, the moon, and the sky. For high Conscientiousness, most of the images are photographs of food, especially healthy food. The high Extraversion output is mostly activated by pictures of a lot of people. In high Agreeableness images, we mostly see flower pictures. Lastly, in the Neuroticism trait, we observe that the high score is maximally activated by animal pets like cats or dogs. In summary, despite the huge intra-class and inter-class variabilities of the images associated to each OCEAN traits, we found that there are consistencies between visual patterns of those images whose hashtags are most correlated to each trait.

Keywords: emotions and effects of mood, social impact theory in social psychology, social influence, social structure and social networks

Procedia PDF Downloads 164
38627 Evaluation of Colour Perception in Different Correlated Colour Temperature of LED Lighting

Authors: Saadet Akbay, Ayşe Nihan Avcı

Abstract:

The perception of colour is a subjective experience which depends on age, gender, race, cultural and educational backgrounds, etc. of an individual. However, colour perception is also affected by the correlated colour temperature (CCT) of a light source which is considered as one of the most fundamental quantitative lighting characteristics. This study focuses on evaluating colour perception in different CCT of light emitting diodes (LED) lighting. The aim is to compare the inherent colours with the perceived colours under two CCT of ‘warm’ (2700K), and ‘cool’ (4000K) LED lights and to understand how different CTT affect the perception of a colour. Analysis and specifications of colour attributes are made with Natural Colour System (NCS) which is an international colour communication system. The outcome of the study reveals the possible tendencies for perceived colours under different illuminance levels of LED lighting.

Keywords: colour perception, correlated colour temperature, inherent and perceived colour, LED lighting, natural colour system (NCS)

Procedia PDF Downloads 237