Search results for: oil palm tree census
394 Predictive Analytics of Student Performance Determinants
Authors: Mahtab Davari, Charles Edward Okon, Somayeh Aghanavesi
Abstract:
Every institute of learning is usually interested in the performance of enrolled students. The level of these performances determines the approach an institute of study may adopt in rendering academic services. The focus of this paper is to evaluate students' academic performance in given courses of study using machine learning methods. This study evaluated various supervised machine learning classification algorithms such as Logistic Regression (LR), Support Vector Machine, Random Forest, Decision Tree, K-Nearest Neighbors, Linear Discriminant Analysis, and Quadratic Discriminant Analysis, using selected features to predict study performance. The accuracy, precision, recall, and F1 score obtained from a 5-Fold Cross-Validation were used to determine the best classification algorithm to predict students’ performances. SVM (using a linear kernel), LDA, and LR were identified as the best-performing machine learning methods. Also, using the LR model, this study identified students' educational habits such as reading and paying attention in class as strong determinants for a student to have an above-average performance. Other important features include the academic history of the student and work. Demographic factors such as age, gender, high school graduation, etc., had no significant effect on a student's performance.Keywords: student performance, supervised machine learning, classification, cross-validation, prediction
Procedia PDF Downloads 128393 Enhancing Precision Agriculture through Object Detection Algorithms: A Study of YOLOv5 and YOLOv8 in Detecting Armillaria spp.
Authors: Christos Chaschatzis, Chrysoula Karaiskou, Pantelis Angelidis, Sotirios K. Goudos, Igor Kotsiuba, Panagiotis Sarigiannidis
Abstract:
Over the past few decades, the rapid growth of the global population has led to the need to increase agricultural production and improve the quality of agricultural goods. There is a growing focus on environmentally eco-friendly solutions, sustainable production, and biologically minimally fertilized products in contemporary society. Precision agriculture has the potential to incorporate a wide range of innovative solutions with the development of machine learning algorithms. YOLOv5 and YOLOv8 are two of the most advanced object detection algorithms capable of accurately recognizing objects in real time. Detecting tree diseases is crucial for improving the food production rate and ensuring sustainability. This research aims to evaluate the efficacy of YOLOv5 and YOLOv8 in detecting the symptoms of Armillaria spp. in sweet cherry trees and determining their health status, with the goal of enhancing the robustness of precision agriculture. Additionally, this study will explore Computer Vision (CV) techniques with machine learning algorithms to improve the detection process’s efficiency.Keywords: Armillaria spp., machine learning, precision agriculture, smart farming, sweet cherries trees, YOLOv5, YOLOv8
Procedia PDF Downloads 115392 Impact of Rapid Urbanization on Health Sector in India
Authors: Madhvi Bhayani
Abstract:
Introduction: Due to the rapid pace of urbanization, the urban health issues have become one of the significant threats to future development in India. It also poses serious repercussions on the citizen’s health. As urbanization in India is increasing at an unprecedented rate and it has generated the urban health crisis among the city dwellers especially the urban poor. The increasing proportion of the urban poor and vulnerable to the health indicators worse than the rural counterparts, they face social and financial barriers in accessing healthcare services and these conditions make human health at risk. The Local as well as the State and National governments are alike tackling with the challenges of urbanization as it has become very essential for the government to provide the basic necessities and better infrastructure that make life in cities safe and healthy. Thus, the paper argues that if no major realistic steps are taken with immediate effect, the citizens will face a huge burden of health hazards. Aim: This paper attempts to analyze the current infrastructure, government planning, and its future policy, it also discusses the challenges and outcomes of urbanization on health and its impact on it and it will also predict the future trend with regard to disease burden in the urban areas. Methods: The paper analyzes on the basis of the secondary data by taking into consideration the connection between the Rapid Urbanization and Public Health Challenges, health and health care system and its services delivery to the citizens especially to the urban poor. Extensive analyses of government census reports, health information and policy, the government health-related schemes, urban development and based on the past trends, the future status of urban infrastructure and health outcomes are predicted. The social-economic and political dimensions are also taken into consideration from regional, national and global perspectives, which are incorporated in the paper to make realistic predictions for the future. Findings and Conclusion: The findings of the paper show that India suffers a lot due to the double burden of rapidly increasing in diseases and also growing health inequalities and disparities in health outcomes. Existing tools of governance of urban health are falling short to provide the better health care services. They need to strengthen the collaboration and communication among the state, national and local governments and also with the non-governmental partners. Based on the findings the policy implications are then described and areas for future research are defined.Keywords: health care, urbanization, urban health, service delivery
Procedia PDF Downloads 211391 Physical and Mechanical Behavior of Compressed Earth Blocks Stabilized with Ca(OH)2 on Sub-Humid Warm Weather
Authors: D. Castillo T., Luis F. Jimenez
Abstract:
The compressed earth blocks (CEBs) constitute an alternative as a constructive element for building homes in regions with high levels of poverty and marginalization. Such is the case of Southeastern Mexico, where the population, predominantly indigene, build their houses with feeble materials like wood and palm, vulnerable to extreme weather in the area, because they do not have the financial resources to acquire concrete blocks. There are several advantages that can provide BTCs compared to traditional vibro-compressed concrete blocks, such as the availability of materials, low manufacturing cost and reduced CO2 emissions to the atmosphere for not be subjected to a burning process. However, to improve its mechanical properties and resistance to adverse weather conditions in terms of humidity and temperature of the sub-humid climate zones, it requires the use of a chemical stabilizer; in this case we chose Ca(OH)2. The stabilization method Eades-Grim was employed, according to ASTM C977-03. This method measures the optimum amount of lime required to stabilize the soil, increasing the pH to 12.4 or higher. The minimum amount of lime required in this experiment was 1% and the maximum was 10%. The employed material was clay unconsolidated low to medium plasticity (CL type according to the Unified Soil Classification System). Based on these results, the CEBs manufacturing process was determined. The obtained blocks were from 10x15x30 cm using a mixture of soil, water and lime in different proportions. Later these blocks were put to dry outdoors and subjected to several physical and mechanical tests, such as compressive strength, absorption and drying shrinkage. The results were compared with the limits established by the Mexican Standard NMX-C-404-ONNCCE-2005 for the construction of housing walls. In this manner an alternative and sustainable material was obtained for the construction of rural households in the region, with better security conditions, comfort and cost.Keywords: calcium hydroxide, chemical stabilization, compressed earth blocks, sub-humid warm weather
Procedia PDF Downloads 402390 Associated Factors of Hypertension, Hypercholesterolemia and Double Burden Hypertension-Hypercholesterolemia in Patients With Congestive Heart Failure: Hospital Based Study
Authors: Pierre Mintom, William Djeukeu Asongni, Michelle Moni, William Dakam, Christine Fernande Nyangono Biyegue.
Abstract:
Background: In order to prevent congestive heart failure, control of hypertension and hypercholesterolemia is necessary because those risk factors frequently occur in combination. Objective: The aim of the study is to determine the prevalence and risk factors of hypertension, hypercholesterolemia and double burden HTA-Hypercholesterolemia in patients with congestive heart failure. Methodology: A database of 98 patients suffering from congestive heart failure was used. The latter were recruited from August 15, 2017, to March 5, 2018, in the Cardiology department of Deido District Hospital of Douala. This database provides information on sociodemographic parameters, biochemical examinations, characteristics of heart failure and food consumption. ESC/ESH and NCEP-ATPIII definitions were used to define Hypercholesterolemia (total cholesterol ≥200mg/dl), Hypertension (SBP≥140mmHg and/or DBP≥90mmHg). Double burden hypertension-hypercholesterolemia was defined as follows: total cholesterol (CT)≥200mg/dl, SBP≥140mmHg and DBP≥90mmHg. Results: The prevalence of hypertension (HTA), hypercholesterolemia (hyperchol) and double burden HTA-Hyperchol were 61.2%, 66.3% and 45.9%, respectively. No sociodemographic factor was associated with hypertension, hypercholesterolemia and double burden, but Male gender was significantly associated (p<0.05) with hypercholesterolemia. HypoHDLemia significantly increased hypercholesterolemia and the double burden by 19.664 times (p=0.001) and 14.968 times (p=0.021), respectively. Regarding dietary habits, the consumption of rice, peanuts and derivatives and cottonseed oil respectively significantly (p<0.05) exposed to the occurrence of hypertension. The consumption of tomatoes, green bananas, corn and derivatives, peanuts and derivatives and cottonseed oil significantly exposed (p<0.05) to the occurrence of hypercholesterolemia. The consumption of palm oil and cottonseed oil exposed the occurrence of the double burden of hypertension-hypercholesterolemia. Consumption of eggs protects against hypercholesterolemia, and consumption of peanuts and tomatoes protects against the double burden. Conclusion: hypercholesterolemia associated with hypertension appears as a complicating factor of congestive heart failure. Key risk factors are mainly diet-based, suggesting the importance of nutritional education for patients. New management protocols emphasizing diet should be considered.Keywords: risk factors, hypertension, hypercholesterolemia, congestive heart failure
Procedia PDF Downloads 69389 Predicting Relative Performance of Sector Exchange Traded Funds Using Machine Learning
Abstract:
Machine learning has been used in many areas today. It thrives at reviewing large volumes of data and identifying patterns and trends that might not be apparent to a human. Given the huge potential benefit and the amount of data available in the financial market, it is not surprising to see machine learning applied to various financial products. While future prices of financial securities are extremely difficult to forecast, we study them from a different angle. Instead of trying to forecast future prices, we apply machine learning algorithms to predict the direction of future price movement, in particular, whether a sector Exchange Traded Fund (ETF) would outperform or underperform the market in the next week or in the next month. We apply several machine learning algorithms for this prediction. The algorithms are Linear Discriminant Analysis (LDA), k-Nearest Neighbors (KNN), Decision Tree (DT), Gaussian Naive Bayes (GNB), and Neural Networks (NN). We show that these machine learning algorithms, most notably GNB and NN, have some predictive power in forecasting out-performance and under-performance out of sample. We also try to explore whether it is possible to utilize the predictions from these algorithms to outperform the buy-and-hold strategy of the S&P 500 index. The trading strategy to explore out-performance predictions does not perform very well, but the trading strategy to explore under-performance predictions can earn higher returns than simply holding the S&P 500 index out of sample.Keywords: machine learning, ETF prediction, dynamic trading, asset allocation
Procedia PDF Downloads 100388 Effect of Antioxidant-Rich Nutraceutical on Serum Glucose, Lipid Profile and Oxidative Stress Markers of Salt-Induced Metabolic Syndrome in Rats
Authors: Nura Lawal, Lawal Suleiman Bilbis, Rabiu Aliyu Umar, Anas A. Sabir
Abstract:
Metabolic syndrome (MS) a high-risk condition involving obesity, dyslipidemia, hypertension, and diabetes mellitus is prevalent in Nigeria. The study aims to formulate an antioxidant-rich nutraceutical from locally available foodstuff (onion, garlic, ginger, tomato, lemon, palm oil, watermelon seeds) and investigate their effects on blood pressure, body weight, serum glucose, lipid profile, insulin and oxidative stress markers in salt-induced rats. The rats were placed on 8% salt diet for 6 weeks and then supplementation and treatment with nutraceutical and nifedipine in the presence of salt diet for additional 4 weeks. Feeding rats with salt diet for 6 weeks increased blood pressure and body weight of the salt-loaded rats relative to control. Significant (P < 0.001) increase in serum blood glucose and lipid profile, and the decrease in high-density lipoprotein-cholesterol (HDL-C) was observed in salt-loaded rats as compared with control. Both supplementation and treatment (nifedipine) lowered the blood pressure but the only supplementation lowered the body weight. Supplementation with nutraceutical resulted in significant (P < 0.001) decrease in the serum blood glucose, lipid profile, malonyldialdehyde (MDA), insulin levels, insulin resistance, and increased HDL-C and antioxidant indices. The percentage protection against atherogenesis was 76.5±2.13%. There is strong positive correlation between blood pressure, body weight and serum blood glucose, lipid profile, markers of oxidative stress and strong negative correlation with HDL-C and antioxidant status. The results suggest that the nutraceuticals are useful in reversing most of the component of metabolic syndrome and might be beneficial in the treatment of patients with metabolic syndrome.Keywords: metabolic syndrome, hypertension, diabetes mallitus, obesity
Procedia PDF Downloads 250387 Assessment the Impact of Changes in Cultivation Pattern from Grape to Apple on Drying up of Urmia Lake
Authors: Nasser Karami
Abstract:
The Urmia grapes have been famous for centuries and have been among the most desirable in the production of wine. Interestingly, evidence shows that the Urmia region was the first place in the world where wine was produced and consumed. In fact, the grapes known as “Shiraz” and made popular by “Shiraz Wine” are the grapes cultivated as a local species especially in the West Azerbaijan watershed basin and exported to Europe. But after the Islamic Revolution, because the production, usage, and sale of wine were unlawful (under Islamic rule), they decided to cultivate apples instead of grapes. Before Islamic revolution, about 50 percent of the gardens were producing grapes, but the apple groves took up less than 1.5 percent (100 hectares). Three years after the revolution, in 1982, people were swept up in the revolutionary excitement and grape cultivation decreased, using less than 10 percent of the garden area. Important is the fact that an apple tree needs 12 times more water than a grapevine, it should be noted that in terms of water usage in the area, the agricultural area has not been increased by 2 or 4 times but rather by 12 times. Evaluation of this study showed that contrary to official reports, climate change isn’t major cause of drying up Urmia Lake and 65 percent of this environmental crisis happened due to spreading unsustainable agricultural in basin of this lake.Keywords: cultivation pattern, unsustainable agriculture, urmia lake drying, water managment
Procedia PDF Downloads 344386 Assessment of Hygroscopic Characteristics of Hevea brasiliensis Wood
Authors: John Tosin Aladejana
Abstract:
Wood behave differently under different environmental conditions. The knowledge of the hygroscopic nature of wood becomes a key factor in selecting wood for use and required treatment. This study assessed the hygroscopic behaviour of Hevea brasiliensis (Rubber) wood. Void volume, volumetric swelling in the tangential, radial and longitudinal directions and volumetric shrinkage were used to assess the response of the wood when loosing or taking up moisture. Hevea brasiliensis wood samples cut into 20 × 20 × 60 mm taken longitudinally and transversely were used for the study and dried in the oven at 103 ± 2⁰C. The mean values for moisture content in green Hevea brasiliensis wood were 49.74 %, 51.14 % and 54.36 % for top, middle and bottom portion respectively while 51.77 %, 50.02 % and 53.45 % were recorded for outer, middle and inner portions respectively for the tree. The values obtained for volumetric shrinkage and swelling indicated that shrinkage and swelling were higher at the top part of H. brasiliensis. It was also observed that the longitudinal shrinkage was negligible while tangential direction showed the highest shrinkage among the wood direction. The values of the void volume obtained were 43.0 %, 39.0 % and 38.0 % at the top, middle and bottom respectively. The result obtained showed clarification on the wood density of hevea brasiliensis based on the position and portion of the wood species and the variation in moisture content, void volume, volumetric shrinkage and swelling were also revealed. This will provide information in the process of drying hevea brasiliensis wood to ensure better wood quality devoid of defects.Keywords: moisture content, shrinkage, swelling, void volume
Procedia PDF Downloads 275385 Water Supply and Demand Analysis for Ranchi City under Climate Change Using Water Evaluation and Planning System Model
Authors: Pappu Kumar, Ajai Singh, Anshuman Singh
Abstract:
There are different water user sectors such as rural, urban, mining, subsistence and commercial irrigated agriculture, commercial forestry, industry, power generation which are present in the catchment in Subarnarekha River Basin and Ranchi city. There is an inequity issue in the access to water. The development of the rural area, construction of new power generation plants, along with the population growth, the requirement of unmet water demand and the consideration of environmental flows, the revitalization of small-scale irrigation schemes is going to increase the water demands in almost all the water-stressed catchment. The WEAP Model was developed by the Stockholm Environment Institute (SEI) to enable evaluation of planning and management issues associated with water resources development. The WEAP model can be used for both urban and rural areas and can address a wide range of issues including sectoral demand analyses, water conservation, water rights and allocation priorities, river flow simulation, reservoir operation, ecosystem requirements and project cost-benefit analyses. This model is a tool for integrated water resource management and planning like, forecasting water demand, supply, inflows, outflows, water use, reuse, water quality, priority areas and Hydropower generation, In the present study, efforts have been made to access the utility of the WEAP model for water supply and demand analysis for Ranchi city. A detailed works have been carried out and it was tried to ascertain that the WEAP model used for generating different scenario of water requirement, which could help for the future planning of water. The water supplied to Ranchi city was mostly contributed by our study river, Hatiya reservoir and ground water. Data was collected from various agencies like PHE Ranchi, census data of 2011, Doranda reservoir and meteorology department etc. This collected and generated data was given as input to the WEAP model. The model generated the trends for discharge of our study river up to next 2050 and same time also generated scenarios calculating our demand and supplies for feature. The results generated from the model outputs predicting the water require 12 million litter. The results will help in drafting policies for future regarding water supplies and demands under changing climatic scenarios.Keywords: WEAP model, water demand analysis, Ranchi, scenarios
Procedia PDF Downloads 419384 Detecting Venomous Files in IDS Using an Approach Based on Data Mining Algorithm
Authors: Sukhleen Kaur
Abstract:
In security groundwork, Intrusion Detection System (IDS) has become an important component. The IDS has received increasing attention in recent years. IDS is one of the effective way to detect different kinds of attacks and malicious codes in a network and help us to secure the network. Data mining techniques can be implemented to IDS, which analyses the large amount of data and gives better results. Data mining can contribute to improving intrusion detection by adding a level of focus to anomaly detection. So far the study has been carried out on finding the attacks but this paper detects the malicious files. Some intruders do not attack directly, but they hide some harmful code inside the files or may corrupt those file and attack the system. These files are detected according to some defined parameters which will form two lists of files as normal files and harmful files. After that data mining will be performed. In this paper a hybrid classifier has been used via Naive Bayes and Ripper classification methods. The results show how the uploaded file in the database will be tested against the parameters and then it is characterised as either normal or harmful file and after that the mining is performed. Moreover, when a user tries to mine on harmful file it will generate an exception that mining cannot be made on corrupted or harmful files.Keywords: data mining, association, classification, clustering, decision tree, intrusion detection system, misuse detection, anomaly detection, naive Bayes, ripper
Procedia PDF Downloads 414383 A Study of Life Expectancy in an Urban Set up of North-Eastern India under Dynamic Consideration Incorporating Cause Specific Mortality
Authors: Mompi Sharma, Labananda Choudhury, Anjana M. Saikia
Abstract:
Background: The period life table is entirely based on the assumption that the mortality patterns of the population existing in the given period will persist throughout their lives. However, it has been observed that the mortality rate continues to decline. As such, if the rates of change of probabilities of death are considered in a life table then we get a dynamic life table. Although, mortality has been declining in all parts of India, one may be interested to know whether these declines had appeared more in an urban area of underdeveloped regions like North-Eastern India. So, attempt has been made to know the mortality pattern and the life expectancy under dynamic scenario in Guwahati, the biggest city of North Eastern India. Further, if the probabilities of death changes then there is a possibility that its different constituent probabilities will also change. Since cardiovascular disease (CVD) is the leading cause of death in Guwahati. Therefore, an attempt has also been made to formulate dynamic cause specific death ratio and probabilities of death due to CVD. Objectives: To construct dynamic life table for Guwahati for the year 2011 based on the rates of change of probabilities of death over the previous 10 and 25 years (i.e.,2001 and 1986) and to compute corresponding dynamic cause specific death ratio and probabilities of death due to CVD. Methodology and Data: The study uses the method proposed by Denton and Spencer (2011) to construct dynamic life table for Guwahati. So, the data from the Office of the Birth and Death, Guwahati Municipal Corporation for the years 1986, 2001 and 2011 are taken. The population based data are taken from 2001 and 2011 census (India). However, the population data for 1986 has been estimated. Also, the cause of death ratio and probabilities of death due to CVD are computed for the aforementioned years and then extended to dynamic set up for the year 2011 by considering the rates of change of those probabilities over the previous 10 and 25 years. Findings: The dynamic life expectancy at birth (LEB) for Guwahati is found to be higher than the corresponding values in the period table by 3.28 (5.65) years for males and 8.30 (6.37) years for females during the period of 10 (25) years. The life expectancies under dynamic consideration in all the other age groups are also seen higher than the usual life expectancies, which may be possible due to gradual decline in probabilities of death since 1986-2011. Further, a continuous decline has also been observed in death ratio due to CVD along with cause specific probabilities of death for both sexes. As a consequence, dynamic cause of death probability due to CVD is found to be less in comparison to usual procedure. Conclusion: Since incorporation of changing mortality rates in period life table for Guwahati resulted in higher life expectancies and lower probabilities of death due to CVD, this would possibly bring out the real situation of deaths prevailing in the city.Keywords: cause specific death ratio, cause specific probabilities of death, dynamic, life expectancy
Procedia PDF Downloads 232382 Evaluation of Genetic Diversity Through RAPD Markers Among Melia azedarach L (Chinabery)
Authors: Nadir Ali Rind, Özlem Aksoy, Muhammad Umar Dahot, Salih Dikilitaş, Muhammad Rafiq, Burçak Tütünoğlu
Abstract:
Melia azedarach L. is freshly fruited small to medium sized tree native to China and North western India. It is growing in Pakistan and Turkey in various areas facing great environmental changes to maintain its survival. The species is valued for its high quality wood, medicinal, ornamental and shade purposes. The present work was aimed to estimate the genetic variation among the populations of Melia azedarach L. leaf samples that were collected from five different locations of Turkey and three different areas of Pakistan. These populations were chosen on the random bases by applying RAPD primers in order to construct a dendogram using UPGMA method to show genetic diversity. After that appropriate conservation strategies were suggested. 14 primers producing polymorphic and monomorphic bands were analyzed. Genetic distances were calculated for all the species studied by RAPD-PCR methods. According to the results the lowest genetic identity values and the highest genetic polymorphic values were determined. It is observed that there was a clear split among populations from different areas in Turkey and Pakistan. These differences may be due to eco-geographical association with genetic variation and should be conserved to retain the genetic variation of the species.Keywords: melia azedarach L., genetic diversity, conservation, RAPD-PCR, medicinal plant
Procedia PDF Downloads 465381 Viral Metagenomics Revealed a Novel Cardiovirus in Feces of Wild Rats
Authors: Asif Mahmood, Shama Shama, Hao Ni, Hao Wang, Yu Ling, Hui Xu, Shixing Yang, Qais Ahmad Naseer, Wen Zhang
Abstract:
Cardiovirus is a genus of viruses belonging to the family Picornaviridae. Here, we used viral metagenomic techniques to detect the viral nucleic acid in the fecal samples from wild rats in Zhenjiang city in China. Fecal samples were collected from 20 wild rats and pooled into four sample pools and then subjected to libraries construction which were then sequenced on Illumina MiSeq platform. The sequenced reads were analyzed using viral metagenomic analysis pipeline. A novel cardiovirus from feces of a wild rat was identified, named amzj-2018, of which the complete genome was acquired. Phylogenetic analysis based on the complete amino acid sequence of polyprotein revealed that amzj-2018 formed a separate branch located between clusters of Saffold virus and Rat Theilovirus 1 (RTV-1). Phylogenetic analysis based on different regions of the polyproteins, including P1, P2, P3, and P2+P3, respectively, showed discordant trees, where the tree based on P3 region indicated that amzj-2018 clustered separately between Theiler's murine encephalomyelitis virus and RTV-1. The complete genome of a cardiovirus was determined from the feces of wild rats which belonged to a novel type of cardiovirus based on phylogenetic analysis. Whether it is associated with disease needs further investigation.Keywords: cardiovirus, viral metagenomics, genomic organization, phylogenetic analysis
Procedia PDF Downloads 23380 Classification of Forest Types Using Remote Sensing and Self-Organizing Maps
Authors: Wanderson Goncalves e Goncalves, José Alberto Silva de Sá
Abstract:
Human actions are a threat to the balance and conservation of the Amazon forest. Therefore the environmental monitoring services play an important role as the preservation and maintenance of this environment. This study classified forest types using data from a forest inventory provided by the 'Florestal e da Biodiversidade do Estado do Pará' (IDEFLOR-BIO), located between the municipalities of Santarém, Juruti and Aveiro, in the state of Pará, Brazil, covering an area approximately of 600,000 hectares, Bands 3, 4 and 5 of the TM-Landsat satellite image, and Self - Organizing Maps. The information from the satellite images was extracted using QGIS software 2.8.1 Wien and was used as a database for training the neural network. The midpoints of each sample of forest inventory have been linked to images. Later the Digital Numbers of the pixels have been extracted, composing the database that fed the training process and testing of the classifier. The neural network was trained to classify two forest types: Rain Forest of Lowland Emerging Canopy (Dbe) and Rain Forest of Lowland Emerging Canopy plus Open with palm trees (Dbe + Abp) in the Mamuru Arapiuns glebes of Pará State, and the number of examples in the training data set was 400, 200 examples for each class (Dbe and Dbe + Abp), and the size of the test data set was 100, with 50 examples for each class (Dbe and Dbe + Abp). Therefore, total mass of data consisted of 500 examples. The classifier was compiled in Orange Data Mining 2.7 Software and was evaluated in terms of the confusion matrix indicators. The results of the classifier were considered satisfactory, and being obtained values of the global accuracy equal to 89% and Kappa coefficient equal to 78% and F1 score equal to 0,88. It evaluated also the efficiency of the classifier by the ROC plot (receiver operating characteristics), obtaining results close to ideal ratings, showing it to be a very good classifier, and demonstrating the potential of this methodology to provide ecosystem services, particularly in anthropogenic areas in the Amazon.Keywords: artificial neural network, computational intelligence, pattern recognition, unsupervised learning
Procedia PDF Downloads 362379 Recommender Systems Using Ensemble Techniques
Authors: Yeonjeong Lee, Kyoung-jae Kim, Youngtae Kim
Abstract:
This study proposes a novel recommender system that uses data mining and multi-model ensemble techniques to enhance the recommendation performance through reflecting the precise user’s preference. The proposed model consists of two steps. In the first step, this study uses logistic regression, decision trees, and artificial neural networks to predict customers who have high likelihood to purchase products in each product group. Then, this study combines the results of each predictor using the multi-model ensemble techniques such as bagging and bumping. In the second step, this study uses the market basket analysis to extract association rules for co-purchased products. Finally, the system selects customers who have high likelihood to purchase products in each product group and recommends proper products from same or different product groups to them through above two steps. We test the usability of the proposed system by using prototype and real-world transaction and profile data. In addition, we survey about user satisfaction for the recommended product list from the proposed system and the randomly selected product lists. The results also show that the proposed system may be useful in real-world online shopping store.Keywords: product recommender system, ensemble technique, association rules, decision tree, artificial neural networks
Procedia PDF Downloads 295378 Comparison of Seismic Response for Two RC Curved Bridges with Different Column Shapes
Authors: Nina N. Serdar, Jelena R. Pejović
Abstract:
This paper presents seismic risk assessment of two bridge structure, based on the probabilistic performance-based seismic assessment methodology. Both investigated bridges are tree span continuous RC curved bridges with the difference in column shapes. First bridge (type A) has a wall-type pier and second (type B) has a two-column bent with circular columns. Bridges are designed according to European standards: EN 1991-2, EN1992-1-1 and EN 1998-2. Aim of the performed analysis is to compare seismic behavior of these two structures and to detect the influence of column shapes on the seismic response. Seismic risk assessment is carried out by obtaining demand fragility curves. Non-linear model was constructed and time-history analysis was performed using thirty five pairs of horizontal ground motions selected to match site specific hazard. In performance based analysis, peak column drift ratio (CDR) was selected as engineering demand parameter (EDP). For seismic intensity measure (IM) spectral displacement was selected. Demand fragility curves that give probability of exceedance of certain value for chosen EDP were constructed and based on them conclusions were made.Keywords: RC curved bridge, demand fragility curve, wall type column, nonlinear time-history analysis, circular column
Procedia PDF Downloads 342377 Measuring the Biomechanical Effects of Worker Skill Level and Joystick Crane Speed on Forestry Harvesting Performance Using a Simulator
Authors: Victoria L. Chester, Usha Kuruganti
Abstract:
The forest industry is a major economic sector of Canada and also one of the most dangerous industries for workers. The use of mechanized mobile forestry harvesting machines has successfully reduced the incidence of injuries in forest workers related to manual labor. However, these machines have also created additional concerns, including a high machine operation learning curve, increased the length of the workday, repetitive strain injury, cognitive load, physical and mental fatigue, and increased postural loads due to sitting in a confined space. It is critical to obtain objective performance data for employers to develop appropriate work practices for this industry, however ergonomic field studies of this industry are lacking mainly due to the difficulties in obtaining comprehensive data while operators are cutting trees in the woods. The purpose of this study was to establish a measurement and experimental protocol to examine the effects of worker skill level and movement training speed (joystick crane speed) on harvesting performance using a forestry simulator. A custom wrist angle measurement device was developed as part of the study to monitor Euler angles during operation of the simulator. The device of the system consisted of two accelerometers, a Bluetooth module, three 3V coin cells, a microcontroller, a voltage regulator and an application software. Harvesting performance and crane data was provided by the simulator software and included tree to frame collisions, crane to tree collisions, boom tip distance, number of trees cut, etc. A pilot study of 3 operators with various skill levels was tested to identify factors that distinguish highly skilled operators from novice or intermediate operators. Dependent variables such as reaction time, math skill, past work experience, training movement speed (e.g. joystick control speeds), harvesting experience level, muscle activity, and wrist biomechanics were measured and analyzed. A 10-channel wireless surface EMG system was used to monitor the amplitude and mean frequency of 10 upper extremity muscles during pre and postperformance on the forestry harvest stimulator. The results of the pilot study showed inconsistent changes in median frequency pre-and postoperation, but there was the increase in the activity of the flexor carpi radialis, anterior deltoid and upper trapezius of both arms. The wrist sensor results indicated that wrist supination and pronation occurred more than flexion and extension with radial-ulnar rotation demonstrating the least movement. Overall, wrist angular motion increased as the crane speed increased from slow to fast. Further data collection is needed and will help industry partners determine those factors that separate skill levels of operators, identify optimal training speeds, and determine the length of training required to bring new operators to an efficient skill level effectively. In addition to effective and employment training programs, results of this work will be used for selective employee recruitment strategies to improve employee retention after training. Further, improved training procedures and knowledge of the physical and mental demands on workers will lead to highly trained and efficient personnel, reduced risk of injury, and optimal work protocols.Keywords: EMG, forestry, human factors, wrist biomechanics
Procedia PDF Downloads 147376 Ethnic-Racial Breakdown in Psychological Research among Latinx Populations in the U.S.
Authors: Madeline Phillips, Luis Mendez
Abstract:
The 21st century has seen an increase in the amount and variety of psychological research on Latinx, the largest minority group in the U.S., with great variability from the individual’s cultural origin (e.g., ethnicity) to region (e.g., nationality). We were interested in exploring how scientists recruit, conduct and report research on Latinx samples. Ethnicity and race are important components of individuals and should be addressed to capture a broader and deeper understanding of psychological research findings. In order to explore Latinx/Hispanic work, the Journal of Latinx Psychology (JLP) and Hispanic Journal of Behavioral Sciences (HJBS) were analyzed for 1) measures of ethnicity and race in empirical studies 2) nationalities represented 3) how researchers reported ethnic-racial demographics. The analysis included publications from 2013-2018 and revealed two common themes of reporting ethnicity and race: overrepresentation/underrepresentation and overgeneralization. There is currently not a systematic way of reporting ethnicity and race among Latinx/Hispanic research, creating a vague sense of what and how ethnicity/race plays a role in the lives of participants. Second, studies used the Hispanic/Latinx terms interchangeably and are not consistent across publications. For the purpose of this project, we were only interested in publications with Latinx samples in the U.S. Therefore, studies outside of the U.S. and non-empirical studies were excluded. JLP went from N = 118 articles to N = 94 and HJBS went from N = 174 to N = 154. For this project, we developed a coding rubric for ethnicity/race that reflected the different ways researchers reported ethnicity and race and was compatible with the U.S. census. We coded which ethnicity/race was identified as the largest ethnic group in each sample. We used the ethnic-racial breakdown numbers or percentages if provided. There were also studies that simply did not report the ethnic composition besides Hispanic or Latinx. We found that in 80% of the samples, Mexicans are overrepresented compared to the population statistics of Latinx in the US. We observed all the ethnic-racial breakdowns, demonstrating the overrepresentation of Mexican samples and underrepresentation and/or lack of representation of certain ethnicities (e.g., Chilean, Guatemalan). Our results showed an overgeneralization of studies that cluster their participants to Latinx/Hispanic, 23 for JLP and 63 for HJBS. The authors discuss the importance of transparency from researchers in reporting the context of the sample, including country, state, neighborhood, and demographic variables that are relevant to the goals of the project, except when there may be an issue of privacy and/or confidentiality involved. In addition, the authors discuss the importance to recognize the variability within the Latinx population and how it is reflected in the scientific discourse.Keywords: Latinx, Hispanic, race and ethnicity, diversity
Procedia PDF Downloads 114375 Influences of Island Characteristics on Plant Community Structure of Farasan Archipelago, Saudi Arabia: Island Biogeography and Nested Pattern
Authors: Khalid Al Mutairi, Mashhor Mansor, Magdy El-Bana, Saud L. Al-Rowaily, Asyraf Mansor
Abstract:
The present study was carried out in 20 islands of Farasan Archipelago in Saudi Arabia to describe the biogeography patterns of plants. A total of 191 species belonging to 129 genera and 53 families were identified. Following island biogeography theory, total plant species richness and their ecological groups were positively influenced by island size, number of habitats,elevation and were not affected by isolation. The high level of nestedness, the strong effect of area on total plant species richness and ecological groups, and the similarity of vegetation composition on the islands has several implications for conservation. In conclusion the large and richest islands in Farasan Archipelago such as Farasan Alkbir would conserve higher diversity than several smaller islands. This island also includes rare habitats like coral rocks and rare species. The invasion of the unique habitats such as wadi channels and water catchments in this island by the exotic tree Prosopis juliflora should be managed to conserve the native biodiversity. The protection of such critical habitats is very important on the other large island (e.g. Zufaf), due to their limited distribution in the country.Keywords: island biogeography, conservation, farasan archipelago, saudi arabia, plant diversity
Procedia PDF Downloads 349374 The Role of Risk Attitudes and Networks on the Migration Decision: Empirical Evidence from the United States
Authors: Tamanna Rimi
Abstract:
A large body of literature has discussed the determinants of migration decision. However, the potential role of individual risk attitudes on migration decision has so far been overlooked. The research on migration literature has studied how the expected income differential influences migration flows for a risk neutral individual. However, migration takes place when there is no expected income differential or even the variability of income appears as lower than in the current location. This migration puzzle motivates a recent trend in the literature that analyzes how attitudes towards risk influence the decision to migrate. However, the significance of risk attitudes on migration decision has been addressed mostly in a theoretical perspective in the mainstream migration literature. The efficient outcome of labor market and overall economy are largely influenced by migration in many countries. Therefore, attitudes towards risk as a determinant of migration should get more attention in empirical studies. To author’s best knowledge, this is the first study that has examined the relationship between relative risk aversion and migration decision in US market. This paper considers movement across United States as a means of migration. In addition, this paper also explores the network effect due to the increasing size of one’s own ethnic group to a source location on the migration decision and how attitudes towards risk vary with network effect. Two ethnic groups (i.e. Asian and Hispanic) have been considered in this regard. For the empirical estimation, this paper uses two sources of data: 1) U.S. census data for social, economic, and health research, 2010 (IPUMPS) and 2) University of Michigan Health and Retirement Study, 2010 (HRS). In order to measure relative risk aversion, this study uses the ‘Two Sample Two-Stage Instrumental Variable (TS2SIV)’ technique. This is a similar method of Angrist (1990) and Angrist and Kruegers’ (1992) ‘Two Sample Instrumental Variable (TSIV)’ technique. Using a probit model, the empirical investigation yields the following results: (i) risk attitude has a significantly large impact on migration decision where more risk averse people are less likely to migrate; (ii) the impact of risk attitude on migration varies by other demographic characteristics such as age and sex; (iii) people with higher concentration of same ethnic households living in a particular place are expected to migrate less from their current place; (iv) the risk attitudes on migration vary with network effect. The overall findings of this paper relating risk attitude, migration decision and network effect can be a significant contribution addressing the gap between migration theory and empirical study in migration literature.Keywords: migration, network effect, risk attitude, U.S. market
Procedia PDF Downloads 164373 Short Answer Grading Using Multi-Context Features
Authors: S. Sharan Sundar, Nithish B. Moudhgalya, Nidhi Bhandari, Vineeth Vijayaraghavan
Abstract:
Automatic Short Answer Grading is one of the prime applications of artificial intelligence in education. Several approaches involving the utilization of selective handcrafted features, graphical matching techniques, concept identification and mapping, complex deep frameworks, sentence embeddings, etc. have been explored over the years. However, keeping in mind the real-world application of the task, these solutions present a slight overhead in terms of computations and resources in achieving high performances. In this work, a simple and effective solution making use of elemental features based on statistical, linguistic properties, and word-based similarity measures in conjunction with tree-based classifiers and regressors is proposed. The results for classification tasks show improvements ranging from 1%-30%, while the regression task shows a stark improvement of 35%. The authors attribute these improvements to the addition of multiple similarity scores to provide ensemble of scoring criteria to the models. The authors also believe the work could reinstate that classical natural language processing techniques and simple machine learning models can be used to achieve high results for short answer grading.Keywords: artificial intelligence, intelligent systems, natural language processing, text mining
Procedia PDF Downloads 133372 Development of a French to Yorùbá Machine Translation System
Authors: Benjamen Nathaniel, Eludiora Safiriyu Ijiyemi, Egume Oneme Lucky
Abstract:
A review on machine translation systems shows that a lot of computational artefacts has been carried out to translate written or spoken texts from a source language to Yorùbá language through Machine Translation systems. However, there are no work on French to Yorùbá language machine translation system; hence, the study investigated the process involved in the translation of French-to-Yorùbá language equivalent with the view to adopting a rule- based MT approach to build a Machine Translation framework from simple sentences administered through questionnaire. Articles and relevant textbooks were reviewed with key speakers of both languages interviewed to find out the processes involved in the translation of French language and their equivalent in Yorùbálanguage simple sentences using home domain terminologies. Achieving this, a model was formulated using phrase grammar structure, re-write rule, parse tree, automata theory- based techniques, designed and implemented respectively with unified modeling language (UML) and python programming language. Analysing the result, it was observed when carrying out the result that, the Machine Translation system performed 18.45% above Experimental Subject Respondent and 2.7% below Linguistics Expert when analysed with word orthography, sentence syntax and semantic correctness of the sentences. And, when compared with Google Machine Translation system, it was noticed that the developed system performed better on lexicons of the target language.Keywords: machine translation (MT), rule-based, French language, Yoru`ba´ language
Procedia PDF Downloads 78371 Optrix: Energy Aware Cross Layer Routing Using Convex Optimization in Wireless Sensor Networks
Authors: Ali Shareef, Aliha Shareef, Yifeng Zhu
Abstract:
Energy minimization is of great importance in wireless sensor networks in extending the battery lifetime. One of the key activities of nodes in a WSN is communication and the routing of their data to a centralized base-station or sink. Routing using the shortest path to the sink is not the best solution since it will cause nodes along this path to fail prematurely. We propose a cross-layer energy efficient routing protocol Optrix that utilizes a convex formulation to maximize the lifetime of the network as a whole. We further propose, Optrix-BW, a novel convex formulation with bandwidth constraint that allows the channel conditions to be accounted for in routing. By considering this key channel parameter we demonstrate that Optrix-BW is capable of congestion control. Optrix is implemented in TinyOS, and we demonstrate that a relatively large topology of 40 nodes can converge to within 91% of the optimal routing solution. We describe the pitfalls and issues related with utilizing a continuous form technique such as convex optimization with discrete packet based communication systems as found in WSNs. We propose a routing controller mechanism that allows for this transformation. We compare Optrix against the Collection Tree Protocol (CTP) and we found that Optrix performs better in terms of convergence to an optimal routing solution, for load balancing and network lifetime maximization than CTP.Keywords: wireless sensor network, Energy Efficient Routing
Procedia PDF Downloads 393370 Profiling, Antibacterial and Antioxidant Activity of Acacia decurrens (Willd) an Invasive South Africa Tree
Authors: Joe Modise, Bamidel Joseph Okoli, Nas Molefe, Imelda Ledwaba
Abstract:
The present study describes the chemical profile and antioxidant potential of the stem bark of Acacia decurrens. The methanol fraction of A. decurrens stem bark gave the highest yield (20 %), while the hexane fraction had the lowest yield (0.2 %). The GC-MS spectra of the hexane, chloroform and ethyl acetate fractions confirm the presence of fifty two major compounds and the ICP-OES analysis of the stem bark was found to contain Co(0.41), Zn(1.75), Mn(3.69), Ca(8.67), Ni(10.54), Mg(12.98), Cr(24.38), K(47.88), Fe(154.62) ppm; which is an indication of hyper-accumulation capacity. The UV-Visible spectra of showed four absorption maxima for hexane fraction at 665 (0.028), 410 (0.116), 335 (0.278) and 250 (0.007) nm, three for chloroform fraction at 665 (0.028), 335 (0.278) and 250 (0.007) nm , three for ethyl acetate fraction at 665 (0.070), 390 (0.648) and 345 (0.663) nm and three for methanol fraction at 385 (0.508), 310 (0.886) and 295 (0.899) nm respectively. Quantitative phytochemical screening indicated that the alkaloid (0.6-3.3) % and saponins (5.1-8.6) % contents of the various fractions were significantly lower than the tannin (30.9-55.8) mg TAE/g, steroid(13.92-41.2) %, phenol (40.6-65.5) mgGAE/g and flavonoids (210.2 -284.9) mg RUE/g contents. The antioxidant activity of the fractions was analysed by different methods and revealed good to moderate antioxidant potential with different IC50 values viz. (42.2-49.6) mg/mL for ABTS and (37.8-75.0) μg/ml for DPPH respectively, compared to standard antioxidants. Based on obtained results, the A.decurrens stem bark fractions can be a source of safe, sustainable natural antioxidant drug and can be exploited as a source of controlled green-heavy metal cleaner.Keywords: Acacia decurrens, antioxidant, DPPH, ABTS, hyperaccumulation, Menstruum, ICP-OES, GC-MS, UV/visible
Procedia PDF Downloads 327369 Use of Carica papaya as a Bio-Sorbent for Removal of Heavy Metals in Wastewater
Authors: W. E. Igwegbe, B. C. Okoro, J. C. Osuagwu
Abstract:
The study was aimed at assessing the effectiveness of reducing the concentrations of heavy metals in waste water using Pawpaw (Carica papaya) wood as a bio-sorbent. The heavy metals considered include; zinc, cadmium, lead, copper, iron, selenium, nickel, and manganese. The physiochemical properties of carica papaya stem were studied. The experimental sample was obtained from a felled trunk of matured pawpaw tree. Waste water for experimental use was prepared by dissolving soil samples collected from a dump site at Owerri, Imo state in water. The concentration of each metal remaining in solution as residual metal after bio-sorption was determined using Atomic absorption Spectrometer. The effects of ph, contact time and initial heavy metal concentration were studied in a batch reactor. The results of Spectrometer test showed that there were different functional groups detected in the carica papaya stem biomass. Optimum bio-sorption occurred at pH 5.9 with 5g/100ml solution of bio-sorbent. The results of the study showed that the treated wastewater is fit for irrigation purpose based on Canada wastewater quality guideline for the protection of Agricultural standard. This approach thus provides a cost effective and environmentally friendly option for treating waste water.Keywords: biomass, bio-sorption, Carica papaya, heavy metal, wastewater
Procedia PDF Downloads 372368 Comparison of Phenolic and Urushiol Contents of Different Parts of Rhus verniciflua and Their Antimicrobial Activity
Authors: Jae Young Jang, Jong Hoon Ahn, Jae-Woong Lim, So Young Kang, Mi Kyeong Lee
Abstract:
Rhus verniciflua is commonly known as a lacquer tree in Korea. Stem barks of R. verniciflua have been used as an immunostimulator in traditional medicine. It contains phenolic compounds and is known for diverse biological activities such as antioxidant and antimicrobial activity. However, it also causes allergic dermatitis due to urushiols derivatives. For the development of active natural resources with less toxicity, the content of phenolic compounds and urushiols of different parts of R. verniciflua such as stem barks, lignum and leaves were quantitated by colorimetric assay and HPLC analysis. The urushiols content were the highest in stem barks, and followed by leaves. The lignum contained trace amount of urushiols. The phenolic contents, however, were the most abundant in lignum, and followed by leaves and stem barks. These results clear showed that the content of urushiols and phenolic differs depending on the parts of R. verniciflua. Antimicrobial activity of different parts of R. verniciflua against fish pathogenic bacteria was also investigated using Edwardsiella tarda. Lignum of R. verniciflua was the most effective in antimicrobial activity against E. tarda and phenolic constituents are suggested to be active constituents for activity. Taken together, phenolic compounds are responsible for antimicrobial activity of R. verniciflua. The lignum of R. verniciflua contains high content of phenolic compounds with less urushiols, which suggests efficient antimicrobial activity with less toxicity. Therefore, lignum of R. verniciflua are suggested as good sources for antimicrobial activity against fish bacterial diseases.Keywords: different parts, phenolic compounds, Rhus verniciflua, urushiols
Procedia PDF Downloads 320367 3d Gis Participatory Mapping And Conflict Ladm: Comparative Analysis Of Land Policies And Survey Procedures Applied By The Igorots, Ncip, And Denr To Itogon Ancestral Domain Boundaries
Authors: Deniz A. Apostol, Denyl A. Apostol, Oliver T. Macapinlac, George S. Katigbak
Abstract:
Ang lupa ay buhay at ang buhay ay lupa (land is life and life is land). Based on the 2015 census, the Indigenous Peoples (IPs) population in the Philippines is estimated to be 11.3-20.2 million. They hail from various regions, possess distinct cultures, but encounter shared struggles in territorial disputes. Itogon, the largest Benguet municipality, is home to the Ibaloi, Kankanaey, and other Igorot tribes. Despite having three (3) Ancestral Domains (ADs), Itogon is predominantly labeled as timberland or forest. These overlapping land classifications highlight the presence of inconsistencies in national laws and jurisdictions. This study aims to analyze surveying procedures used by the Igorots, NCIP, and DENR in mapping the Itogon AD Boundaries, show land boundary delineation conflicts, propose surveying guidelines, and recommend 3D Participatory Mapping as geomatics solution for updated AD reference maps. Interpretative Phenomenological Analysis (IPA), Comparative Legal Analysis (CLA), and Map Overlay Analysis (MOA) were utilized to examine the interviews, compare land policies and surveying procedures, and identify differences and overlaps in conflicting land boundaries. In the IPA, master themes identified were AD Definition (rights, responsibilities, restrictions), AD Overlaps (land classifications, political boundaries, ancestral domains, land laws/policies), and Other Conflicts (with other agencies, misinterpretations, suggestions), as considerations for mapping ADs. CLA focused on conflicting surveying procedures: AD Definitions, Surveying Equipment, Surveying Methods, Map Projections, Order of Accuracy, Monuments, Survey Parties, Pre-survey, Survey Proper, and Post-survey procedures. MOA emphasized the land area percentage of conflicting areas, showcasing the impact of misaligned surveying procedures. The findings are summarized through a Land Administration Domain Model (LADM) Conflict, for AD versus AD and Political Boundaries. The products of this study are identification of land conflict factors, survey guidelines recommendations, and contested land area computations. These can serve as references for revising survey manuals, updating AD Sustainable Development and Protection Plans, and making amendments to laws.Keywords: ancestral domain, gis, indigenous people, land policies, participatory mapping, surveying, survey procedures
Procedia PDF Downloads 95366 Indicators of Value of Life in Children with Colorectal Illness
Authors: Enkelejda Shkurti, Diamant Shtiza
Abstract:
Background: Health-related quality of life (HRQoL) is a significant consequence in health care. The objective of our study was to recognize features related to lower HRQoL scores in children with anorectal malformation (ARM) and Hirschsprung disease (HD). Methods: Children younger than 18 years, with HD or ARM, that were assessed at our private clinic in Tirana, Albania, from December 2018 to October 2019, were acknowledged. The outcomes of broad questionnaires concerning diagnosis, symptoms, and preceding health/surgical history and authenticated tools to measure urinary status, stooling grade, and HRQoL were appraised. Results: In patients aged 0-6 years, vomiting and abdominal enlargement were allied with a substantial decrease in total HRQoL scores. In children > 6 years of age, vomiting, abdominal swelling, and abdominal discomfort were also linked to a considerably lower HRQoL. The main indicator of lower HRQoL scores on regression tree analysis in all age clusters was the occurrence of psychosomatic, behavioral, or progressive comorbidity. Conclusion: Children with both HD or ARM that have a psychosomatic, behavioral, or growing problem experience considerably lower HRQoL than patients deprived of such problems, proposing that establishment of behavioral/growing sustenance as part of the care of these patients may have a considerable influence on their HRQoL.Keywords: anorectal malformation, Hirsch Sprung disease, quality of life, Albania
Procedia PDF Downloads 175365 Determining Optimal Number of Trees in Random Forests
Authors: Songul Cinaroglu
Abstract:
Background: Random Forest is an efficient, multi-class machine learning method using for classification, regression and other tasks. This method is operating by constructing each tree using different bootstrap sample of the data. Determining the number of trees in random forests is an open question in the literature for studies about improving classification performance of random forests. Aim: The aim of this study is to analyze whether there is an optimal number of trees in Random Forests and how performance of Random Forests differ according to increase in number of trees using sample health data sets in R programme. Method: In this study we analyzed the performance of Random Forests as the number of trees grows and doubling the number of trees at every iteration using “random forest” package in R programme. For determining minimum and optimal number of trees we performed Mc Nemar test and Area Under ROC Curve respectively. Results: At the end of the analysis it was found that as the number of trees grows, it does not always means that the performance of the forest is better than forests which have fever trees. In other words larger number of trees only increases computational costs but not increases performance results. Conclusion: Despite general practice in using random forests is to generate large number of trees for having high performance results, this study shows that increasing number of trees doesn’t always improves performance. Future studies can compare different kinds of data sets and different performance measures to test whether Random Forest performance results change as number of trees increase or not.Keywords: classification methods, decision trees, number of trees, random forest
Procedia PDF Downloads 396