Search results for: association measures
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 5482

Search results for: association measures

5482 Semi-Automatic Method to Assist Expert for Association Rules Validation

Authors: Amdouni Hamida, Gammoudi Mohamed Mohsen

Abstract:

In order to help the expert to validate association rules extracted from data, some quality measures are proposed in the literature. We distinguish two categories: objective and subjective measures. The first one depends on a fixed threshold and on data quality from which the rules are extracted. The second one consists on providing to the expert some tools in the objective to explore and visualize rules during the evaluation step. However, the number of extracted rules to validate remains high. Thus, the manually mining rules task is very hard. To solve this problem, we propose, in this paper, a semi-automatic method to assist the expert during the association rule's validation. Our method uses rule-based classification as follow: (i) We transform association rules into classification rules (classifiers), (ii) We use the generated classifiers for data classification. (iii) We visualize association rules with their quality classification to give an idea to the expert and to assist him during validation process.

Keywords: association rules, rule-based classification, classification quality, validation

Procedia PDF Downloads 405
5481 A Method for Reduction of Association Rules in Data Mining

Authors: Diego De Castro Rodrigues, Marcelo Lisboa Rocha, Daniela M. De Q. Trevisan, Marcos Dias Da Conceicao, Gabriel Rosa, Rommel M. Barbosa

Abstract:

The use of association rules algorithms within data mining is recognized as being of great value in the knowledge discovery in databases. Very often, the number of rules generated is high, sometimes even in databases with small volume, so the success in the analysis of results can be hampered by this quantity. The purpose of this research is to present a method for reducing the quantity of rules generated with association algorithms. Therefore, a computational algorithm was developed with the use of a Weka Application Programming Interface, which allows the execution of the method on different types of databases. After the development, tests were carried out on three types of databases: synthetic, model, and real. Efficient results were obtained in reducing the number of rules, where the worst case presented a gain of more than 50%, considering the concepts of support, confidence, and lift as measures. This study concluded that the proposed model is feasible and quite interesting, contributing to the analysis of the results of association rules generated from the use of algorithms.

Keywords: data mining, association rules, rules reduction, artificial intelligence

Procedia PDF Downloads 130
5480 Association among Trait Mindfulness, Leukocyte Telomere Length, and Psychological Symptoms in Singaporean Han Chinese

Authors: Shian-Ling Keng, Onn Siong Yim, Poh San Lai, Soo Chong Chew, Anne Chong, Richard Ebstein

Abstract:

Research has demonstrated a positive association between mindfulness meditation and physical health. Little work, however, has examined the association between trait mindfulness and leukocyte telomere length (LTL), an emerging marker of cellular aging. The present study aimed to examine whether facets of trait mindfulness are correlated with longer LTL in a Singaporean Han Chinese sample and whether these facets may mediate the association between psychological symptoms and LTL. 158 adults (mean age = 27.24 years) completed measures assessing trait mindfulness and psychological symptoms (i.e., depression and stress) and provided blood samples for analyses of LTL using qPCR. Multiple regression analyses were conducted to assess the association between facets of trait mindfulness and LTL. Bootstrapping-based mediational analyses were run to examine the role of trait mindfulness as a mediator of the association between psychological symptoms and LTL. Of five facets of trait mindfulness (describe, act with awareness, observe, nonreactivity, and nonjudging), nonreactivity was significantly associated with LTL, after controlling for the effects of age, gender, and education, β = .21, p = .006. Further, there was a trend for overall trait mindfulness, β = .15, p = .06, and nonjudging, β = .13, p = .095, to each predict longer LTL. Nonreactivity significantly mediated the association between depression and LTL, BCa 95% CI [-.004, -.0004], p=.03, as well as the association between stress and LTL, BCa 95% CI [-.004, -.0004], p=.04. The results provide preliminary evidence for a positive association between selected facets of trait mindfulness and slower cellular aging, indexed by LTL. The findings suggest that individuals who are high on equanimity may experience slower aging at the cellular level, presumably through engaging in more effective coping mechanisms and modulation of stress. The findings also highlight the role of nonreactivity as a potential mechanism that underlies the association between LTL and psychological symptoms.

Keywords: depression, mindfulness, stress, telomere length

Procedia PDF Downloads 314
5479 Clustering of Association Rules of ISIS & Al-Qaeda Based on Similarity Measures

Authors: Tamanna Goyal, Divya Bansal, Sanjeev Sofat

Abstract:

In world-threatening terrorist attacks, where early detection, distinction, and prediction are effective diagnosis techniques and for functionally accurate and precise analysis of terrorism data, there are so many data mining & statistical approaches to assure accuracy. The computational extraction of derived patterns is a non-trivial task which comprises specific domain discovery by means of sophisticated algorithm design and analysis. This paper proposes an approach for similarity extraction by obtaining the useful attributes from the available datasets of terrorist attacks and then applying feature selection technique based on the statistical impurity measures followed by clustering techniques on the basis of similarity measures. On the basis of degree of participation of attributes in the rules, the associative dependencies between the attacks are analyzed. Consequently, to compute the similarity among the discovered rules, we applied a weighted similarity measure. Finally, the rules are grouped by applying using hierarchical clustering. We have applied it to an open source dataset to determine the usability and efficiency of our technique, and a literature search is also accomplished to support the efficiency and accuracy of our results.

Keywords: association rules, clustering, similarity measure, statistical approaches

Procedia PDF Downloads 291
5478 Analysis of the Simulation Merger and Economic Benefit of Local Farmers' Associations in Taiwan

Authors: Lu Yung-Hsiang, Chang Kuming, Dai Yi-Fang, Liao Ching-Yi

Abstract:

According to Taiwan’s administrative division of future land planning may lead farmer association and service areas facing recombination or merger. Thus, merger combination and the economic benefit of the farmer association are worth to be discussed. The farmer association in the merger, which may cause some then will not be consolidated, or consolidate two, or ever more to one association. However, under what condition to merge is greatest, as one of observation of this study. In addition, research without using simulation methods and only on the credit department rather whole farmer association. Therefore, this paper will use the simulation approach, and examine both the merge of farmer association and the condition under which the benefits are the greatest. The data of this study set include 266 farmer associations in Taiwan period 2012 to 2013. Empirical results showed that the number of the farmer association optimal simulation combination is 108.After the merger from the first stage can be reduced by 60% of the farmers’ association. The cost saving effects of the post-merger is not different. The cost efficiency of the farmers’ association improved it. The economies of scale and scope would decrease by the merger. The research paper hopes the finding will benefit the future merger of the farmers’ association.

Keywords: simulation merger, farmer association, assurance region, data envelopment analysis

Procedia PDF Downloads 326
5477 Operational Measures for Greenhouse Gas Reduction from Ships

Authors: Gorana Jelic Mrcelic

Abstract:

In order to reduce greenhouse gas emissions from ships, technical and operational measures can be used. Operational measures are easier and cheaper compared to technical measures, so are well recommended. One of the most cost-effective operational measure is fuel consumption. Fuel consumption can be reduced by various options but it sometimes needs investments in new equipment, new procedures and crew education. In order to implement operational measures in everyday procedures and routines on board, good understanding of the mechanisms by which these measures work is essential for the seamen.

Keywords: green shipping, gas emission reduction, operational measures, seamen

Procedia PDF Downloads 481
5476 Empirical Exploration of Correlations between Software Design Measures: A Replication Study

Authors: Jehad Al Dallal

Abstract:

Software engineers apply different measures to quantify the quality of software design. These measures consider artifacts developed at low or high level software design phases. The results are used to point to design weaknesses and to indicate design points that have to be restructured. Understanding the relationship among the quality measures and among the design quality aspects considered by these measures is important to interpreting the impact of a measure for a quality aspect on other potentially related aspects. In addition, exploring the relationship between quality measures helps to explain the impact of different quality measures on external quality aspects, such as reliability and maintainability. In this paper, we report a replication study that empirically explores the correlation between six well known and commonly applied design quality measures. These measures consider several quality aspects, including complexity, cohesion, coupling, and inheritance. The results indicate that inheritance measures are weakly correlated to other measures, whereas complexity, coupling, and cohesion measures are mostly strongly correlated.  

Keywords: quality attribute, quality measure, software design quality, Spearman correlation

Procedia PDF Downloads 253
5475 Association Rules Mining Task Using Metaheuristics: Review

Authors: Abir Derouiche, Abdesslem Layeb

Abstract:

Association Rule Mining (ARM) is one of the most popular data mining tasks and it is widely used in various areas. The search for association rules is an NP-complete problem that is why metaheuristics have been widely used to solve it. The present paper presents the ARM as an optimization problem and surveys the proposed approaches in the literature based on metaheuristics.

Keywords: Optimization, Metaheuristics, Data Mining, Association rules Mining

Procedia PDF Downloads 134
5474 More Precise: Patient-Reported Outcomes after Stroke

Authors: Amber Elyse Corrigan, Alexander Smith, Anna Pennington, Ben Carter, Jonathan Hewitt

Abstract:

Background and Purpose: Morbidity secondary to stroke is highly heterogeneous, but it is important to both patients and clinicians in post-stroke management and adjustment to life after stroke. The consideration of post-stroke morbidity clinically and from the patient perspective has been poorly measured. The patient-reported outcome measures (PROs) in morbidity assessment help improve this knowledge gap. The primary aim of this study was to consider the association between PRO outcomes and stroke predictors. Methods: A multicenter prospective cohort study assessed 549 stroke patients at 19 hospital sites across England and Wales during 2019. Following a stroke event, demographic, clinical, and PRO measures were collected. Prevalence of morbidity within PRO measures was calculated with associated 95% confidence intervals. Predictors of domain outcome were calculated using a multilevel generalized linear model. Associated P -values and 95% confidence intervals are reported. Results: Data were collected from 549 participants, 317 men (57.7%) and 232 women (42.3%) with ages ranging from 25 to 97 (mean 72.7). PRO morbidity was high post-stroke; 93.2% of the cohort report post-stroke PRO morbidity. Previous stroke, diabetes, and gender are associated with worse patient-reported outcomes across both the physical and cognitive domains. Conclusions: This large-scale multicenter cohort study illustrates the high proportion of morbidity in PRO measures. Further, we demonstrate key predictors of adverse outcomes (Diabetes, previous stroke, and gender) congruence with clinical predictors. The PRO has been demonstrated to be an informative and useful stroke when considering patient-reported outcomes and has wider implications for considerations of PROs in clinical management. Future longitudinal follow-up with PROs is needed to consider association of long-term morbidity.

Keywords: morbidity, patient-reported outcome, PRO, stroke

Procedia PDF Downloads 104
5473 An Approach for Association Rules Ranking

Authors: Rihab Idoudi, Karim Saheb Ettabaa, Basel Solaiman, Kamel Hamrouni

Abstract:

Medical association rules induction is used to discover useful correlations between pertinent concepts from large medical databases. Nevertheless, ARs algorithms produce huge amount of delivered rules and do not guarantee the usefulness and interestingness of the generated knowledge. To overcome this drawback, we propose an ontology based interestingness measure for ARs ranking. According to domain expert, the goal of the use of ARs is to discover implicit relationships between items of different categories such as ‘clinical features and disorders’, ‘clinical features and radiological observations’, etc. That’s to say, the itemsets which are composed of ‘similar’ items are uninteresting. Therefore, the dissimilarity between the rule’s items can be used to judge the interestingness of association rules; the more different are the items, the more interesting the rule is. In this paper, we design a distinct approach for ranking semantically interesting association rules involving the use of an ontology knowledge mining approach. The basic idea is to organize the ontology’s concepts into a hierarchical structure of conceptual clusters of targeted subjects, where each cluster encapsulates ‘similar’ concepts suggesting a specific category of the domain knowledge. The interestingness of association rules is, then, defined as the dissimilarity between corresponding clusters. That is to say, the further are the clusters of the items in the AR, the more interesting the rule is. We apply the method in our domain of interest – mammographic domain- using an existing mammographic ontology called Mammo with the goal of deriving interesting rules from past experiences, to discover implicit relationships between concepts modeling the domain.

Keywords: association rule, conceptual clusters, interestingness measures, ontology knowledge mining, ranking

Procedia PDF Downloads 299
5472 Association Between Swallowing Disorders and Cognitive Disorders in Adults: Systematic Review and Metaanalysis

Authors: Shiva Ebrahimian Dehaghani, Afsaneh Doosti, Morteza Zare

Abstract:

Background: There is no consensus regarding the association between dysphagia and cognition. Purpose: The aim of this study was to quantitatively and qualitatively analyze the available evidence on the direction and strength of association between dysphagia and cognition. Methodology: PubMed, Scopus, Embase and Web of Science were searched about the association between dysphagia and cognition. A random-effects model was used to determine weighted odds ratios (OR) and 95% confidence intervals (CI). Sensitivity analysis was performed to determine the impact of each individual study on the pooled results. Results: A total of 1427 participants showed that some cognitive disorders were significantly associated with dysphagia (OR = 3.23; 95% CI, 2.33–4.48). Conclusion: The association between cognition and swallowing disorders suggests that multiple neuroanatomical systems are involved in these two functions.

Keywords: adult, association, cognitive impairment, dysphagia, systematic review

Procedia PDF Downloads 128
5471 Agglomerative Hierarchical Clustering Using the Tθ Family of Similarity Measures

Authors: Salima Kouici, Abdelkader Khelladi

Abstract:

In this work, we begin with the presentation of the Tθ family of usual similarity measures concerning multidimensional binary data. Subsequently, some properties of these measures are proposed. Finally, the impact of the use of different inter-elements measures on the results of the Agglomerative Hierarchical Clustering Methods is studied.

Keywords: binary data, similarity measure, Tθ measures, agglomerative hierarchical clustering

Procedia PDF Downloads 445
5470 Data Stream Association Rule Mining with Cloud Computing

Authors: B. Suraj Aravind, M. H. M. Krishna Prasad

Abstract:

There exist emerging applications of data streams that require association rule mining, such as network traffic monitoring, web click streams analysis, sensor data, data from satellites etc. Data streams typically arrive continuously in high speed with huge amount and changing data distribution. This raises new issues that need to be considered when developing association rule mining techniques for stream data. This paper proposes to introduce an improved data stream association rule mining algorithm by eliminating the limitation of resources. For this, the concept of cloud computing is used. Inclusion of this may lead to additional unknown problems which needs further research.

Keywords: data stream, association rule mining, cloud computing, frequent itemsets

Procedia PDF Downloads 470
5469 Conformational Switch of hRAGE upon Self-Association

Authors: Ikhlas Ahmed, Jamillah Zamoon

Abstract:

The human receptor for advanced glycation end product is a plasma membrane receptor with an intrinsically disordered region. The protein consists of three extracellular domains, a single membrane spanning transmembrane domain, and a cytosolic domain which is intrinsically disordered and responsible for signaling. The disordered nature of the cytosolic domain allows it to be dynamic in solution. This receptor self-associates to higher forms. The association is triggered by ligand, metal or by the extracellular domain. Fluorescence spectroscopy technique is used to test the self-association of the different concentrations of the cytosolic domain. This work has concluded that the cytosolic domain of this receptor also self-associates. Moreover, the self-association does not require ligand or metal.

Keywords: fluorescence spectroscopy, hRAGE, IDP, Self-association

Procedia PDF Downloads 338
5468 A Holistic Approach to Institutional Cyber Security

Authors: Mehmet Kargaci

Abstract:

It is more important to access information than to get the correct information and to transform it to the knowledge in a proper way. Every person, organizations or governments who have the knowledge now become the target. Cyber security involves the range of measures to be taken from individual to the national level. The National institutions refer to academic, military and major public and private institutions, which are very important for the national security. Thus they need further cyber security measures. It appears that the traditional cyber security measures in the national level are alone not sufficient, while the individual measures remain in a restricted level. It is evaluated that the most appropriate method for preventing the cyber vulnerabilities rather than existing measures are to develop institutional measures. This study examines the cyber security measures to be taken, especially in the national institutions.

Keywords: cyber defence, information, critical infrastructure, security

Procedia PDF Downloads 503
5467 A Systematic Review for the Association between Active Smoking and Latent Tuberculosis Infection

Authors: Pui Hong Chung, Wing Chi Ho, Jun Li, Cyrus Leung, Ek Yeoh

Abstract:

Background: Cigarette smoking is associated with poor tuberculosis (TB) outcomes in terms of progression of active TB, relapse of TB and TB-related mortality, but the association with latent tuberculosis infection (LTBI) is unclear. The systematic review conducted aimed at studying the association between active smoking and LTBI, and likelihood of dose-response relationship. Methods: Two independent reviewers searched three electronic databases comprising PudMed, Medline by EBSCOHOST, ExcerptaMedica Database (EMBASE), from inception up to 31st Dec 2015 for studies reporting data on current smoking and the LTBI with tuberculin skin test (TST) or interferon-γ release assays (IGRAs) results, comparing the odds ratios (ORs) of outcome measure of TST or IGRAs among current smokers with 95% confidence intervals (CI). Results: Seven studies were identified, including six cross-sectional studies and one longitudinal cohort study. The outcome measures from three studies were in TST, three studies in IGRAs and one for both tests. For TST, OR ranging from 1.39 to 3.40 (95% CI) with all studies shown positive association between cigarette smoking and LTBI. For IGRAs, OR ranging from 0.47 to 1.89 (95% CI) with one study shown the negative association that might be related to impaired interferon-gamma production in immunosuppressive persons. One identified study demonstrated positive dose-response relationship in TST result. Conclusions: Cigarette smoking is likely to be a risk factor of LTBI. There is the important implication for TB and tobacco control program to halt TB by empowering public health policy. Further study is also needed to provide more evidence of the dose-response model/relationship.

Keywords: latent tuberculosis infection, systematic review, active smoking, model

Procedia PDF Downloads 225
5466 On an Approach for Rule Generation in Association Rule Mining

Authors: B. Chandra

Abstract:

In Association Rule Mining, much attention has been paid for developing algorithms for large (frequent/closed/maximal) itemsets but very little attention has been paid to improve the performance of rule generation algorithms. Rule generation is an important part of Association Rule Mining. In this paper, a novel approach named NARG (Association Rule using Antecedent Support) has been proposed for rule generation that uses memory resident data structure named FCET (Frequent Closed Enumeration Tree) to find frequent/closed itemsets. In addition, the computational speed of NARG is enhanced by giving importance to the rules that have lower antecedent support. Comparative performance evaluation of NARG with fast association rule mining algorithm for rule generation has been done on synthetic datasets and real life datasets (taken from UCI Machine Learning Repository). Performance analysis shows that NARG is computationally faster in comparison to the existing algorithms for rule generation.

Keywords: knowledge discovery, association rule mining, antecedent support, rule generation

Procedia PDF Downloads 292
5465 Assessment of Association Between Microalbuminuria and Lung Function Test Among the Community of Jimma Town

Authors: Diriba Dereje

Abstract:

Background: Cardiac and renal disease are the most prevalent chronic non-communicable diseases (CNCD) affecting the community in a significant manner. The best and recommended method in halting CNCD is by working on prevention as early as possible. This is only possible if early surrogate markers are identified. As part of the stated solution, this study will identify an association between microalbuminuria (an early surrogate marker of renal and cardiac disease) and lung function test among adult in the community. Objective: The main aim of this study was to assess an association between microalbuminuria (an early surrogate marker of renal and cardiac disease) and lung function test among adult in the community. Methodology: Community based cross sectional study was conducted among 384 adult in Jimma town. A systematic sampling technique was used in selecting participants to the study. In searching for the possible association, binary and multivariate logistic regression and t-test was conducted. Finally, the association between microalbuminuria and lung function test was well stated in the form of figures and written description. Result and Conclusion: A significant association was found between microalbuminuria and different lung function test parameters.

Keywords: microalbuminuria, lung function, association, test

Procedia PDF Downloads 163
5464 Proposal of a Model Supporting Decision-Making on Information Security Risk Treatment

Authors: Ritsuko Kawasaki, Takeshi Hiromatsu

Abstract:

Management is required to understand all information security risks within an organization, and to make decisions on which information security risks should be treated in what level by allocating how much amount of cost. However, such decision-making is not usually easy, because various measures for risk treatment must be selected with the suitable application levels. In addition, some measures may have objectives conflicting with each other. It also makes the selection difficult. Therefore, this paper provides a model which supports the selection of measures by applying multi-objective analysis to find an optimal solution. Additionally, a list of measures is also provided to make the selection easier and more effective without any leakage of measures.

Keywords: information security risk treatment, selection of risk measures, risk acceptance, multi-objective optimization

Procedia PDF Downloads 348
5463 Genome-Wide Association Study Identify COL2A1 as a Susceptibility Gene for the Hand Development Failure of Kashin-Beck Disease

Authors: Feng Zhang

Abstract:

Kashin-Beck disease (KBD) is a chronic osteochondropathy. The mechanism of hand growth and development failure of KBD remains elusive now. In this study, we conducted a two-stage genome-wide association study (GWAS) of palmar length-width ratio (LWR) of KBD, totally involving 493 Chinese Han KBD patients. Affymetrix Genome Wide Human SNP Array 6.0 was applied for SNP genotyping. Association analysis was conducted by PLINK software. Imputation analysis was performed by IMPUTE against the reference panel of the 1000 genome project. In the GWAS, the most significant association was observed between palmar LWR and rs2071358 of COL2A1 gene (P value = 4.68×10-8). Imputation analysis identified 3 SNPs surrounding rs2071358 with significant or suggestive association signals. Replication study observed additional significant association signals at both rs2071358 (P value = 0.017) and rs4760608 (P value = 0.002) of COL2A1 gene after Bonferroni correction. Our results suggest that COL2A1 gene was a novel susceptibility gene involved in the growth and development failure of hand of KBD.

Keywords: Kashin-Beck disease, genome-wide association study, COL2A1, hand

Procedia PDF Downloads 181
5462 The Global Relationship between the Prevalence of Diabetes Mellitus and Incidence of Tuberculosis: 2000-2012

Authors: Alaa Badawi, Suzan Sayegh, Mohamed Sallam, Eman Sadoun, Mohamed Al-Thani, Muhammad W. Alam, Paul Arora

Abstract:

Background: The dual burden of tuberculosis (TB) and diabetes mellitus (DM) has increased over the past decade with DM prevalence increasing in countries already afflicted with a high burden of TB. The coexistence of the two conditions presents a serious threat to global public health. Objective: The present study examines the global relationship between the prevalence of DM and the incidence of TB to evaluate their coexistence worldwide and their contribution to one another. Methods: This is an ecological longitudinal study covering the period between years 2000 to 2012. We utilized data from the WHO and World Bank sources and International Diabetes Federation to estimate prevalence of DM (%) and the incidence of TB (per 100,000). Measures of central tendency and dispersion as well as the harmonic mean and linear regression were used for different WHO regions. The association between DM prevalence and TB incidence was examined by quartile of DM prevalence. Results: The worldwide average (±S.D.) prevalence of DM within the study period was 6.6±3.8% whereas TB incidence was 135.0±190.5 per 100,000. DM prevalence was highest in the Eastern Mediterranean (8.3±4.1) and West Pacific (8.2±5.6) regions and lowest in the Africa (3.5±2.6). TB incidence was highest in Africa (313.1±275.9 per 100,000) and South-East Asia (216.7±124.9) and lowest in the European (46.5±68.6) and American (47.2±52.9) regions. Only countries with high DM prevalence (>7.6%) showed a significant positive association with TB incidence (r=0.17, p=0.013). Conclusion: A positive association between DM and TB may exist in some – but not all – world regions, a dual burden that necessitates identifying the nature of this coexistence to assist in developing public health approaches that curb their rising burden.

Keywords: diabetes mellitus, tuberculosis, disease burden, global association

Procedia PDF Downloads 433
5461 A Review of Physiological Measures for Cognitive Workload Assessment of Aircrew

Authors: Naveed Tahir, Adnan Maqsood

Abstract:

Cognitive workload is a significant factor affecting user performance, and it has been broadly investigated for its application in ergonomics as well as in designing and optimizing effective human-machine interactions. It is mentally challenging to maneuver an aircraft, and pilots must control the aircraft and adequately communicate to the verbal-auditory stimuli. Several physiological measures have long been researched and used to demonstrate the cognitive workload. In our current study, we have summarized recent findings of the effectiveness, accuracy, and applicability of commonly used physiological measures in evaluating cognitive workload. We have also highlighted on the advancements in physiological measures. The strength and limitations of physiological measures have also been discussed to assess the cognitive workload of people, especially the aircrews in laboratory settings and real-time situations. We have presented the research findings of the physiological measures to base suggestions on the proper applications of the measures and settings demanding the use of single measure or their combinations.

Keywords: aircrew, cognitive workload, subjective measure, physiological measure, performance measure

Procedia PDF Downloads 126
5460 Association Rules Mining and NOSQL Oriented Document in Big Data

Authors: Sarra Senhadji, Imene Benzeguimi, Zohra Yagoub

Abstract:

Big Data represents the recent technology of manipulating voluminous and unstructured data sets over multiple sources. Therefore, NOSQL appears to handle the problem of unstructured data. Association rules mining is one of the popular techniques of data mining to extract hidden relationship from transactional databases. The algorithm for finding association dependencies is well-solved with Map Reduce. The goal of our work is to reduce the time of generating of frequent itemsets by using Map Reduce and NOSQL database oriented document. A comparative study is given to evaluate the performances of our algorithm with the classical algorithm Apriori.

Keywords: Apriori, Association rules mining, Big Data, Data Mining, Hadoop, MapReduce, MongoDB, NoSQL

Procedia PDF Downloads 133
5459 Predicting Groundwater Areas Using Data Mining Techniques: Groundwater in Jordan as Case Study

Authors: Faisal Aburub, Wael Hadi

Abstract:

Data mining is the process of extracting useful or hidden information from a large database. Extracted information can be used to discover relationships among features, where data objects are grouped according to logical relationships; or to predict unseen objects to one of the predefined groups. In this paper, we aim to investigate four well-known data mining algorithms in order to predict groundwater areas in Jordan. These algorithms are Support Vector Machines (SVMs), Naïve Bayes (NB), K-Nearest Neighbor (kNN) and Classification Based on Association Rule (CBA). The experimental results indicate that the SVMs algorithm outperformed other algorithms in terms of classification accuracy, precision and F1 evaluation measures using the datasets of groundwater areas that were collected from Jordanian Ministry of Water and Irrigation.

Keywords: classification, data mining, evaluation measures, groundwater

Procedia PDF Downloads 251
5458 Exploring Gender Bias in Self-Report Measures of Psychopathy

Authors: Katie Strong, Brian P. O'Connor, Jacqueline M. Kanippayoor

Abstract:

To date, self-report measures of psychopathy have largely been conceptualized with a male-focused understanding of the disorder, with the presumption that psychopathy expression is uniform across genders. However, generalizing this understanding to the female population may be misleading. The objective of this research was to explore gender differences in the expression of psychopathy and to assess current self-report psychopathy measures for gender bias. It was hypothesized that some items in commonly used measures of psychopathy may show gender bias and that existing measures may not contain enough items that are relevant to the manifestation of psychopathy in women. An exploratory investigation was conducted on statistical bias in common measures of psychopathy, and novel, relevant, but previously neglected items and measures were included in a new data collection. The participant pool included a sample of 403 university students and 354 participants recruited using Amazon Mechanical Turk. Item Response Theory methods - including Differential Item Functioning - were used to assess for the item- and test- level bias across several common self-report measures of psychopathy. Analyses indicated occasional and modest levels of item-level bias, and that some additional female-relevant items merit consideration for inclusion in measures of psychopathy. These findings suggest that current self-report measures of psychopathy may be demonstrating gender-bias and warrant further examination.

Keywords: gender, measurement bias, personality, psychopathy

Procedia PDF Downloads 224
5457 An Optimized Association Rule Mining Algorithm

Authors: Archana Singh, Jyoti Agarwal, Ajay Rana

Abstract:

Data Mining is an efficient technology to discover patterns in large databases. Association Rule Mining techniques are used to find the correlation between the various item sets in a database, and this co-relation between various item sets are used in decision making and pattern analysis. In recent years, the problem of finding association rules from large datasets has been proposed by many researchers. Various research papers on association rule mining (ARM) are studied and analyzed first to understand the existing algorithms. Apriori algorithm is the basic ARM algorithm, but it requires so many database scans. In DIC algorithm, less amount of database scan is needed but complex data structure lattice is used. The main focus of this paper is to propose a new optimized algorithm (Friendly Algorithm) and compare its performance with the existing algorithms A data set is used to find out frequent itemsets and association rules with the help of existing and proposed (Friendly Algorithm) and it has been observed that the proposed algorithm also finds all the frequent itemsets and essential association rules from databases as compared to existing algorithms in less amount of database scan. In the proposed algorithm, an optimized data structure is used i.e. Graph and Adjacency Matrix.

Keywords: association rules, data mining, dynamic item set counting, FP-growth, friendly algorithm, graph

Procedia PDF Downloads 389
5456 Expert-Based Validated Measures for Improving Quality Healthcare Services Utilization among Elderly Persons: A Cross-Section Survey

Authors: Uchenna Cosmas Ugwu, Osmond Chukwuemeka Ene

Abstract:

Globally, older adults are considered the most vulnerable groups to age-related diseases including diabetes mellitus, obesity, cardiovascular diseases, cancer and osteoporosis. With improved access to quality healthcare services, these complications can be prevented and the incidence rates reduced to the least occurrence. The aim of this study is to validate appropriate measures for improving quality healthcare services utilization among elderly persons in Nigeria and also to determine the significant association within demographic variables. A cross-sectional survey research design was adopted. Using a convenient sampling technique, a total of 400 experts (150 registered nurses and 250 public health professionals) with minimum of doctoral degree qualification were sampled and studied. A structured instrument titled “Expert-Based Healthcare Services Utilization Questionnaire (EBHSUQ) with .83 reliability index was used for data collection. All the statistical data analysis was completed using frequency counts, percentage scores and chi-square statistics. The results were significant at p≤0.05. It was found that quality healthcare services utilization by elderly persons in Nigeria would be improved if the services are: available (83%), affordable (82%), accessible (79%), suitable (77%), acceptable (77%), continuous (75%) and stress-free (75%). Statistically, significant association existed on quality healthcare services utilization with gender (p=.03<.05) and age (p=.01<.05) while none was observed on work experience (p=.23>.05), marital status (p=.11>.05) and employment category (p=.09>.05). To improve quality healthcare services utilization for elderly persons in Nigeria, the adoption of appropriate measures by Nigerian government and professionals in healthcare sectors are paramount. Therefore, there is need for collaborative efforts by the Nigerian government and healthcare professionals geared towards educating the general public through mass sensitization, awareness campaign, conferences, seminars and workshops for the importance of accessing healthcare services.

Keywords: elderly persons, healthcare services, cross-sectional survey research design, utilization.

Procedia PDF Downloads 29
5455 Traffic Calming Measures at Rural Roads in Dhofar

Authors: Mohammed Bakhit Kashoob, Mohammed Salim Al-Maashani, Ahmed Abdullah Al-Marhoon

Abstract:

Traffic calming measures are different design features or strategies used to reduce the speed of a traveling vehicle on a particular road. These calming measures are common on rural roads of Oman. Some of these measures are road speed limits, vertical deflections, horizontal deflections, and road signs. In general, vertical deflections such as rumble strips, road studs (cat’s eye), speed tables, and speed humps are widely used. In this paper, as vehicle speeding is a major cause of road traffic crashes and high fatalities in Oman, the effectiveness of existing traffic calming measures at current locations on rural roads is assessed. The study was conducted on the rural roads of Dhofar Governorate, which is located in the south of Oman. A special focus is given to the calming measures implemented on the mountain roads of Dhofar. It is shown that vertical deflection calming measures are effective in reducing vehicle speed to 20 to 40 kph, depending on the vertical deflection type and spacing. Calming measures are also proposed at locations with a high probability of traffic crashes based on the number of traffic crashes at these locations, road type, and road geometry.

Keywords: road safety, rural roads, speed, traffic calming measures, traffic crash

Procedia PDF Downloads 86
5454 Applying the Quad Model to Estimate the Implicit Self-Esteem of Patients with Depressive Disorders: Comparing the Psychometric Properties with the Implicit Association Test Effect

Authors: Yi-Tung Lin

Abstract:

Researchers commonly assess implicit self-esteem with the Implicit Association Test (IAT). The IAT’s measure, often referred to as the IAT effect, indicates the strengths of automatic preferences for the self relative to others, which is often considered an index of implicit self-esteem. However, based on the Dual-process theory, the IAT does not rely entirely on the automatic process; it is also influenced by a controlled process. The present study, therefore, analyzed the IAT data with the Quad model, separating four processes on the IAT performance: the likelihood that automatic association is activated by the stimulus in the trial (AC); that a correct response is discriminated in the trial (D); that the automatic bias is overcome in favor of a deliberate response (OB); and that when the association is not activated, and the individual fails to discriminate a correct answer, there is a guessing or response bias drives the response (G). The AC and G processes are automatic, while the D and OB processes are controlled. The AC parameter is considered as the strength of the association activated by the stimulus, which reflects what implicit measures of social cognition aim to assess. The stronger the automatic association between self and positive valence, the more likely it will be activated by a relevant stimulus. Therefore, the AC parameter was used as the index of implicit self-esteem in the present study. Meanwhile, the relationship between implicit self-esteem and depression is not fully investigated. In the cognitive theory of depression, it is assumed that the negative self-schema is crucial in depression. Based on this point of view, implicit self-esteem would be negatively associated with depression. However, the results among empirical studies are inconsistent. The aims of the present study were to examine the psychometric properties of the AC (i.e., test-retest reliability and its correlations with explicit self-esteem and depression) and compare it with that of the IAT effect. The present study had 105 patients with depressive disorders completing the Rosenberg Self-Esteem Scale, Beck Depression Inventory-II and the IAT on the pretest. After at least 3 weeks, the participants completed the second IAT. The data were analyzed by the latent-trait multinomial processing tree model (latent-trait MPT) with the TreeBUGS package in R. The result showed that the latent-trait MPT had a satisfactory model fit. The effect size of test-retest reliability of the AC and the IAT effect were medium (r = .43, p < .0001) and small (r = .29, p < .01) respectively. Only the AC showed a significant correlation with explicit self-esteem (r = .19, p < .05). Neither of the two indexes was correlated with depression. Collectively, the AC parameter was a satisfactory index of implicit self-esteem compared with the IAT effect. Also, the present study supported the results that implicit self-esteem was not correlated with depression.

Keywords: cognitive modeling, implicit association test, implicit self-esteem, quad model

Procedia PDF Downloads 91
5453 The Association between Corporate Social Responsibility Disclosure, Assurance, and Tax Aggressiveness: Evidence from Indonesia

Authors: Eko Budi Santoso

Abstract:

There is a growing interest in Corporate Social Responsibility (CSR) issues in developing countries such as Indonesia. Firms disclose their CSR activities, and some provide assurance to gain recognition as socially responsible firms. However, several of those socially responsible firms involve in tax scandals and raise a question of whether CSR disclosure is used to disguise firm misconduct or as a reflection of socially responsible firms. Specifically, whether firms engage in CSR disclosure and its assurance also responsible for their tax matters. This study examines the association between CSR disclosure and tax aggressiveness and the role of sustainability reporting assurance to the association. This research develops a modified index according to global reporting initiatives to measure CSR disclosure and various measurement for tax aggressiveness. Using a sample of Indonesian go public companies issued CSR disclosure, the empirical result shows that there is an association between CSR disclosure and tax aggressiveness. In addition, results also indicate sustainability reporting assurance moderate those association. The findings suggest that stakeholder in developing countries should examine carefully firms with active CSR disclosure before label it as socially responsible firms. JEL Classification: M14

Keywords: CSR disclosure, tax aggressiveness, assurance, business ethics

Procedia PDF Downloads 113