Search results for: big data ecosystem
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 24758

Search results for: big data ecosystem

24398 Application of a Generalized Additive Model to Reveal the Relations between the Density of Zooplankton with Other Variables in the West Daya Bay, China

Authors: Weiwen Li, Hao Huang, Chengmao You, Jianji Liao, Lei Wang, Lina An

Abstract:

Zooplankton are a central issue in the ecology which makes a great contribution to maintaining the balance of an ecosystem. It is critical in promoting the material cycle and energy flow within the ecosystems. A generalized additive model (GAM) was applied to analyze the relationships between the density (individuals per m³) of zooplankton and other variables in West Daya Bay. All data used in this analysis (the survey month, survey station (longitude and latitude), the depth of the water column, the superficial concentration of chlorophyll a, the benthonic concentration of chlorophyll a, the number of zooplankton species and the number of zooplankton species) were collected through monthly scientific surveys during January to December 2016. GLM model (generalized linear model) was used to choose the significant variables’ impact on the density of zooplankton, and the GAM was employed to analyze the relationship between the density of zooplankton and the significant variables. The results showed that the density of zooplankton increased with an increase of the benthonic concentration of chlorophyll a, but decreased with a decrease in the depth of the water column. Both high numbers of zooplankton species and the overall total number of zooplankton individuals led to a higher density of zooplankton.

Keywords: density, generalized linear model, generalized additive model, the West Daya Bay, zooplankton

Procedia PDF Downloads 117
24397 Big Brain: A Single Database System for a Federated Data Warehouse Architecture

Authors: X. Gumara Rigol, I. Martínez de Apellaniz Anzuola, A. Garcia Serrano, A. Franzi Cros, O. Vidal Calbet, A. Al Maruf

Abstract:

Traditional federated architectures for data warehousing work well when corporations have existing regional data warehouses and there is a need to aggregate data at a global level. Schibsted Media Group has been maturing from a decentralised organisation into a more globalised one and needed to build both some of the regional data warehouses for some brands at the same time as the global one. In this paper, we present the architectural alternatives studied and why a custom federated approach was the notable recommendation to go further with the implementation. Although the data warehouses are logically federated, the implementation uses a single database system which presented many advantages like: cost reduction and improved data access to global users allowing consumers of the data to have a common data model for detailed analysis across different geographies and a flexible layer for local specific needs in the same place.

Keywords: data integration, data warehousing, federated architecture, Online Analytical Processing (OLAP)

Procedia PDF Downloads 214
24396 A System For A Sustainable Electronic Waste Marketplace

Authors: Arya Sarukkai

Abstract:

Due to increased technological advances and the high use of phones, tablets, computers, and other electronics, we continue to see rapid growth in the volume of e-waste. There are millions just throwing out their old devices, millions who have many devices and don’t know what to do with them, and there are millions who would benefit from receiving those devices. The thesis of this paper is that by creating an ecosystem of donors and recipients and providing the right incentives, we can reduce e-waste. We discuss a system for sustainable e-waste by building a marketplace between donors and recipients. We also summarize experimental results comparing different incentives and present a live web service that allows for e-waste supplies to reach schools and nonprofit institutions.

Keywords: E-waste ecosystems, marketplaces, e-waste web app, online services

Procedia PDF Downloads 158
24395 A Review Paper on Data Mining and Genetic Algorithm

Authors: Sikander Singh Cheema, Jasmeen Kaur

Abstract:

In this paper, the concept of data mining is summarized and its one of the important process i.e KDD is summarized. The data mining based on Genetic Algorithm is researched in and ways to achieve the data mining Genetic Algorithm are surveyed. This paper also conducts a formal review on the area of data mining tasks and genetic algorithm in various fields.

Keywords: data mining, KDD, genetic algorithm, descriptive mining, predictive mining

Procedia PDF Downloads 565
24394 Data-Mining Approach to Analyzing Industrial Process Information for Real-Time Monitoring

Authors: Seung-Lock Seo

Abstract:

This work presents a data-mining empirical monitoring scheme for industrial processes with partially unbalanced data. Measurement data of good operations are relatively easy to gather, but in unusual special events or faults it is generally difficult to collect process information or almost impossible to analyze some noisy data of industrial processes. At this time some noise filtering techniques can be used to enhance process monitoring performance in a real-time basis. In addition, pre-processing of raw process data is helpful to eliminate unwanted variation of industrial process data. In this work, the performance of various monitoring schemes was tested and demonstrated for discrete batch process data. It showed that the monitoring performance was improved significantly in terms of monitoring success rate of given process faults.

Keywords: data mining, process data, monitoring, safety, industrial processes

Procedia PDF Downloads 368
24393 Studies on Population and Management of Melon Fruit Fly Bactrocera cucurbitae (Coquillett) in Vegetables Agro-Ecosystem in District Hyderabada

Authors: Abro Zain-Ul-Aabdin, Naheed Baloch, Khuhro Niaz Hussain, Waseem Akbar, Noor Abid Saeed

Abstract:

The Melon Fruit Fly Bactrocera cucurbitae (Coq.) belongs to family: Tephritidae order: Diptera and is distributed throughout the vegetable growing areas of Pakistan. The B. cucurbitae is injurious pest of more than 125 species of the vegetables throughout the world. In the present studies we investigated the population of this important pest in cucurbit crops and influence of abiotic parameters such as: temperature, relative humidity and rainfall. The study was carried out at two different locations of District, Hyderabad. The locations were Jeay Shah and Dehli farm where three cucurbit vegetable crops, such as bottle gourd (Lagenaria siceraria), bitter gourd (Momordica charantia) and ridge gourd (Luffa acutangula) were grown. The traps were baited with Cue-lure and deployed at three meter height in the all locations from 01.01.2015 and up to 30.06.2015. Results revealed that overall significantly higher (P < 0.05) population was recorded on L.acutangula, M.charantia and L.siceraria (130.64, 127.21, and 122.91), respectively. However, significantly higher (P < 0.05) population was observed on L. acutangula (339.4±22.59) during the 4th week of May 2015 followed by M. charantia (334.6±22.76) L. siceraria (333.2±20.13). Whereas; lowest population was recorded on L. siceraria (5.8±1.39) followed by L. acutangula and M. charantia (6.8±0.80g, 8.0±1.30) respectively during the 4th week of January. The population of B. cucurbitae was significantly correlated with the temperature while negatively correlated with relative humidity. Meanwhile in the parasitism preference experiment pupal parasitoid Dirhinus giffardii showed significantly higher (P<0.05) parasitization when the pupae of B.cucurbitae were reared on Cucumber (Cucumis sativus) (24.8±0.48) and also female were yielded from pupae reared on C.sativus under no choice experiment. Similarly higher parasitization and female were recovered when pupae were supplied C. sativus under free choice experiment. Results of the present investigation would be useful in developing a sustainable pest management strategy in the vegetable agro-ecosystem.

Keywords: Dirhinus giffardii, Bactrocera cucurbitae Cucumis sativus, diptera, free choice, parasitization

Procedia PDF Downloads 333
24392 Assessing the Impact of Renewable Energy on Regional Sustainability: A Comparative Study of Suwon and Seoul

Authors: Jongsoo Jurng

Abstract:

The drive to expand renewable energies is often in direct conflict with sustainable development goals. Thus, it is important that energy policies account for potential trade-offs. We assess the interlinkages between energy, food, water, and land, for two case studies, Suwon and Seoul. We apply a range of assessment methods and study their usefulness as tools to identify trade-offs and to compare the sustainability performance. We calculate cross-sectoral footprints, self-sufficiency ratios and perform a simplified Energy-Water-Food nexus analysis. We use the latter for assessing scenarios to increase energy and food self-sufficiency in Suwon, while we use ecosystem service (ESS) accounting for Seoul. For Suwon, we find that constraints on the energy, food and water sectors urgently call for integrated approaches to energy policy; for Seoul, the further expansion of renewables comes at the expense of cultural and supporting ESS, which could outweigh gains from increased energy exports. We recommend a general upgrade to indicators and visualization methods that look beyond averages and a fostering of infrastructure for data on sustainable development based on harmonized international protocols. We warn against rankings of countries or regions based on benchmarks that are neither theory-driven nor location-specific.

Keywords: ESS, renewable energy, energy-water-food nexus, assessment

Procedia PDF Downloads 113
24391 A Survey of Semantic Integration Approaches in Bioinformatics

Authors: Chaimaa Messaoudi, Rachida Fissoune, Hassan Badir

Abstract:

Technological advances of computer science and data analysis are helping to provide continuously huge volumes of biological data, which are available on the web. Such advances involve and require powerful techniques for data integration to extract pertinent knowledge and information for a specific question. Biomedical exploration of these big data often requires the use of complex queries across multiple autonomous, heterogeneous and distributed data sources. Semantic integration is an active area of research in several disciplines, such as databases, information-integration, and ontology. We provide a survey of some approaches and techniques for integrating biological data, we focus on those developed in the ontology community.

Keywords: biological ontology, linked data, semantic data integration, semantic web

Procedia PDF Downloads 415
24390 Classification of Generative Adversarial Network Generated Multivariate Time Series Data Featuring Transformer-Based Deep Learning Architecture

Authors: Thrivikraman Aswathi, S. Advaith

Abstract:

As there can be cases where the use of real data is somehow limited, such as when it is hard to get access to a large volume of real data, we need to go for synthetic data generation. This produces high-quality synthetic data while maintaining the statistical properties of a specific dataset. In the present work, a generative adversarial network (GAN) is trained to produce multivariate time series (MTS) data since the MTS is now being gathered more often in various real-world systems. Furthermore, the GAN-generated MTS data is fed into a transformer-based deep learning architecture that carries out the data categorization into predefined classes. Further, the model is evaluated across various distinct domains by generating corresponding MTS data.

Keywords: GAN, transformer, classification, multivariate time series

Procedia PDF Downloads 90
24389 Generative AI: A Comparison of Conditional Tabular Generative Adversarial Networks and Conditional Tabular Generative Adversarial Networks with Gaussian Copula in Generating Synthetic Data with Synthetic Data Vault

Authors: Lakshmi Prayaga, Chandra Prayaga. Aaron Wade, Gopi Shankar Mallu, Harsha Satya Pola

Abstract:

Synthetic data generated by Generative Adversarial Networks and Autoencoders is becoming more common to combat the problem of insufficient data for research purposes. However, generating synthetic data is a tedious task requiring extensive mathematical and programming background. Open-source platforms such as the Synthetic Data Vault (SDV) and Mostly AI have offered a platform that is user-friendly and accessible to non-technical professionals to generate synthetic data to augment existing data for further analysis. The SDV also provides for additions to the generic GAN, such as the Gaussian copula. We present the results from two synthetic data sets (CTGAN data and CTGAN with Gaussian Copula) generated by the SDV and report the findings. The results indicate that the ROC and AUC curves for the data generated by adding the layer of Gaussian copula are much higher than the data generated by the CTGAN.

Keywords: synthetic data generation, generative adversarial networks, conditional tabular GAN, Gaussian copula

Procedia PDF Downloads 32
24388 Multi-Temporal Analysis of Vegetation Change within High Contaminated Watersheds by Superfund Sites in Wisconsin

Authors: Punwath Prum

Abstract:

Superfund site is recognized publicly to be a severe environmental problem to surrounding communities and biodiversity due to its hazardous chemical waste from industrial activities. It contaminates the soil and water but also is a leading potential point-source pollution affecting ecosystem in watershed areas from chemical substances. The risks of Superfund site on watershed can be effectively measured by utilizing publicly available data and geospatial analysis by free and open source application. This study analyzed the vegetation change within high risked contaminated watersheds in Wisconsin. The high risk watersheds were measured by which watershed contained high number Superfund sites. The study identified two potential risk watersheds in Lafayette and analyzed the temporal changes of vegetation within the areas based on Normalized difference vegetation index (NDVI) analysis. The raster statistic was used to compare the change of NDVI value over the period. The analysis results showed that the NDVI value within the Superfund sites’ boundary has a significant lower value than nearby surrounding and provides an analogy for environmental hazard affect by the chemical contamination in Superfund site.

Keywords: soil contamination, spatial analysis, watershed

Procedia PDF Downloads 103
24387 Heavy Metal Pollution in Soils of Yelagirihills,Tamilnadu by EDXRF Technique

Authors: Chandrasekaran, Ravisankar N. Harikrishnan, Rajalakshmi, K. K. Satapathy M. V. R. Prasad, K. V. Kanagasabapathy

Abstract:

Heavy metals were considered as highly toxic environmental pollutants to soil ecosystem and human health. In present study the 12 heavy metals (Mg, Al, K, Ca, Ti, Fe, V, Cr, Mn, Co,Ni and Zn.) are determined in soils of Yelagiri hills, Tamilnadu by energy dispersive X-ray fluorescence technique. Metal concentrations were used to quantify pollution contamination factors such as enrichment factor (EF), geo-accumulation index (Igeo) and contamination factor (CF) are calculated and reported.

Keywords: soil, heavy metals, EDXRF, pollution contamination factors

Procedia PDF Downloads 310
24386 A Privacy Protection Scheme Supporting Fuzzy Search for NDN Routing Cache Data Name

Authors: Feng Tao, Ma Jing, Guo Xian, Wang Jing

Abstract:

Named Data Networking (NDN) replaces IP address of traditional network with data name, and adopts dynamic cache mechanism. In the existing mechanism, however, only one-to-one search can be achieved because every data has a unique name corresponding to it. There is a certain mapping relationship between data content and data name, so if the data name is intercepted by an adversary, the privacy of the data content and user’s interest can hardly be guaranteed. In order to solve this problem, this paper proposes a one-to-many fuzzy search scheme based on order-preserving encryption to reduce the query overhead by optimizing the caching strategy. In this scheme, we use hash value to ensure the user’s query safe from each node in the process of search, so does the privacy of the requiring data content.

Keywords: NDN, order-preserving encryption, fuzzy search, privacy

Procedia PDF Downloads 448
24385 Data Disorders in Healthcare Organizations: Symptoms, Diagnoses, and Treatments

Authors: Zakieh Piri, Shahla Damanabi, Peyman Rezaii Hachesoo

Abstract:

Introduction: Healthcare organizations like other organizations suffer from a number of disorders such as Business Sponsor Disorder, Business Acceptance Disorder, Cultural/Political Disorder, Data Disorder, etc. As quality in healthcare care mostly depends on the quality of data, we aimed to identify data disorders and its symptoms in two teaching hospitals. Methods: Using a self-constructed questionnaire, we asked 20 questions in related to quality and usability of patient data stored in patient records. Research population consisted of 150 managers, physicians, nurses, medical record staff who were working at the time of study. We also asked their views about the symptoms and treatments for any data disorders they mentioned in the questionnaire. Using qualitative methods we analyzed the answers. Results: After classifying the answers, we found six main data disorders: incomplete data, missed data, late data, blurred data, manipulated data, illegible data. The majority of participants believed in their important roles in treatment of data disorders while others believed in health system problems. Discussion: As clinicians have important roles in producing of data, they can easily identify symptoms and disorders of patient data. Health information managers can also play important roles in early detection of data disorders by proactively monitoring and periodic check-ups of data.

Keywords: data disorders, quality, healthcare, treatment

Procedia PDF Downloads 406
24384 Variation of Litter Chemistry under Intensified Drought: Consequences on Flammability

Authors: E. Ormeno, C. Gutigny, J. Ruffault, J. Madrigal, M. Guijarro, C. Lecareux, C. Ballini

Abstract:

Mediterranean plant species feature numerous metabolic and morpho-physiological responses crucial to survive under both, typical Mediterranean drought conditions and future aggravated drought expected by climate change. Whether these adaptive responses will, in turn, increase the ecosystem perturbation in terms of fire hazard, is an issue that needs to be addressed. The aim of this study was to test whether recurrent and aggravated drought in the Mediterranean area favors the accumulation of waxes in leaf litter, with an eventual increase of litter flammability. The study was conducted in 2017 in a garrigue in Southern France dominated by Quercus coccifera, where two drought treatments were used: a treatment with recurrent aggravated drought consisting of ten rain exclusion structures which withdraw part of the annual precipitation since January 2012, and a natural drought treatment where Q. coccifera stands are free of such structures and thus grow under natural precipitation. Waxes were extracted with organic solvent and analyzed by GC-MS and litter flammability was assessed through measurements of the ignition delay, flame residence time and flame intensity (flame height) using an epiradiator as well as the heat of combustion using an oxygen bomb calorimeter. Results show that after 5 years of rain restriction, wax content in the cuticle of leaf litter increases significantly compared to shrubs growing under natural precipitation, in accordance with the theoretical knowledge which expects increases of cuticle waxes in green leaves in order to limit water evapotranspiration. Wax concentrations were also linearly and positively correlated to litter flammability, a correlation that lies on the high flammability own to the long-chain alkanes (C25-C31) found in leaf litter waxes. This innovative investigation shows that climate change is likely to favor ecosystem fire hazard through accumulation of highly flammable waxes in litter. It also adds valuable information about the types of metabolites that are associated with increasing litter flammability, since so far, within the leaf metabolic profile, only terpene-like compounds had been related to plant flammability.

Keywords: cuticular waxes, drought, flammability, litter

Procedia PDF Downloads 143
24383 Big Data and Analytics in Higher Education: An Assessment of Its Status, Relevance and Future in the Republic of the Philippines

Authors: Byron Joseph A. Hallar, Annjeannette Alain D. Galang, Maria Visitacion N. Gumabay

Abstract:

One of the unique challenges provided by the twenty-first century to Philippine higher education is the utilization of Big Data. The higher education system in the Philippines is generating burgeoning amounts of data that contains relevant data that can be used to generate the information and knowledge needed for accurate data-driven decision making. This study examines the status, relevance and future of Big Data and Analytics in Philippine higher education. The insights gained from the study may be relevant to other developing nations similarly situated as the Philippines.

Keywords: big data, data analytics, higher education, republic of the philippines, assessment

Procedia PDF Downloads 308
24382 Using Biofunctool® Index to Assess Soil Quality after Eight Years of Conservation Agriculture in New Caledonia

Authors: Remy Kulagowski, Tobias Sturm, Audrey Leopold, Aurelie Metay, Josephine Peigne, Alexis Thoumazeau, Alain Brauman, Bruno Fogliani, Florent Tivet

Abstract:

A major challenge for agriculture is to enhance productivity while limiting the impact on the environment. Conservation agriculture (CA) is one strategy whereby both sustainability and productivity can be achieved by preserving and improving the soil quality. Soils provide and regulate a large number of ecosystem services (ES) such as agricultural productivity and climate change adaptation and mitigation. The aim of this study is to assess the impacts of contrasted CA crop management on soil functions for maize (Zea mays L.) cultivation in an eight years field experiment (2010-2018). The study included two CA practices: direct seeding in dead mulch (DM) and living mulch (LM), and conventional plough-based tillage (CT) practices on a fluvisol in New Caledonia (French Archipelago in the South Pacific). In 2018, soil quality of the cropping systems were evaluated with the Biofunctool® set of indicators, that consists in twelve integrative, in-field, and low-tech indicators assessing the biological, physical and chemical properties of soils. Main soil functions were evaluated including (i) carbon transformation, (ii) structure maintenance, and (iii) nutrient cycling in the ten first soil centimeters. The results showed significant higher score for soil structure maintenance (e.g., aggregate stability, water infiltration) and carbon transformation function (e.g., soil respiration, labile carbon) under CA in DM and LM when compared with CT. Score of carbon transformation index was higher in DM compared with LM. However, no significant effect of cropping systems was observed on nutrient cycling (i.e., nitrogen and phosphorus). In conclusion, the aggregated synthetic scores of soil multi-functions evaluated with Biofunctool® demonstrate that CA cropping systems lead to a better soil functioning. Further analysis of the results with agronomic performance of the soil-crop systems would allow to better understand the links between soil functioning and production ES of CA.

Keywords: conservation agriculture, cropping systems, ecosystem services, soil functions

Procedia PDF Downloads 115
24381 Data Management and Analytics for Intelligent Grid

Authors: G. Julius P. Roy, Prateek Saxena, Sanjeev Singh

Abstract:

Power distribution utilities two decades ago would collect data from its customers not later than a period of at least one month. The origin of SmartGrid and AMI has subsequently increased the sampling frequency leading to 1000 to 10000 fold increase in data quantity. This increase is notable and this steered to coin the tern Big Data in utilities. Power distribution industry is one of the largest to handle huge and complex data for keeping history and also to turn the data in to significance. Majority of the utilities around the globe are adopting SmartGrid technologies as a mass implementation and are primarily focusing on strategic interdependence and synergies of the big data coming from new information sources like AMI and intelligent SCADA, there is a rising need for new models of data management and resurrected focus on analytics to dissect data into descriptive, predictive and dictatorial subsets. The goal of this paper is to is to bring load disaggregation into smart energy toolkit for commercial usage.

Keywords: data management, analytics, energy data analytics, smart grid, smart utilities

Procedia PDF Downloads 753
24380 Privacy Preserving Data Publishing Based on Sensitivity in Context of Big Data Using Hive

Authors: P. Srinivasa Rao, K. Venkatesh Sharma, G. Sadhya Devi, V. Nagesh

Abstract:

Privacy Preserving Data Publication is the main concern in present days because the data being published through the internet has been increasing day by day. This huge amount of data was named as Big Data by its size. This project deals the privacy preservation in the context of Big Data using a data warehousing solution called hive. We implemented Nearest Similarity Based Clustering (NSB) with Bottom-up generalization to achieve (v,l)-anonymity. (v,l)-Anonymity deals with the sensitivity vulnerabilities and ensures the individual privacy. We also calculate the sensitivity levels by simple comparison method using the index values, by classifying the different levels of sensitivity. The experiments were carried out on the hive environment to verify the efficiency of algorithms with Big Data. This framework also supports the execution of existing algorithms without any changes. The model in the paper outperforms than existing models.

Keywords: sensitivity, sensitive level, clustering, Privacy Preserving Data Publication (PPDP), bottom-up generalization, Big Data

Procedia PDF Downloads 263
24379 Microbial Activity and Greenhouse Gas (GHG) Emissions in Recovery Process in a Grassland of China

Authors: Qiushi Ning

Abstract:

The nitrogen (N) is an important limiting factor of various ecosystems, and the N deposition rate is increasing unprecedentedly due to anthropogenic activities. The N deposition altered the microbial growth and activity, and microbial mediated N cycling through changing soil pH, the availability of N and carbon (C). The CO2, CH4 and N2O are important greenhouse gas which threaten the sustainability and function of the ecosystem. With the prolonged and increasing N enrichment, the soil acidification and C limitation will be aggravated, and the microbial biomass will be further declined. The soil acidification and lack of C induced by N addition are argued as two important factors regulating the microbial activity and growth, and the studies combined soil acidification with lack of C on microbial community are scarce. In order to restore the ecosystem affected by chronic N loading, we determined the responses of microbial activity and GHG emssions to lime and glucose (control, 1‰ lime, 2‰ lime, glucose, 1‰ lime×glucose and 2‰ lime×glucose) addition which was used to alleviate the soil acidification and supply C resource into soils with N addition rates 0-50 g N m–2yr–1. The results showed no significant responses of soil respiration and microbial biomass (MBC and MBN) to lime addition, however, the glucose substantially improved the soil respiration and microbial biomass (MBC and MBN); the cumulative CO2 emission and microbial biomass of lime×glucose treatments were not significantly higher than those of only glucose treatment. The glucose and lime×glucose treatments reduced the net mineralization and nitrification rate, due to inspired microbial growth via C supply incorporating more inorganic N to the biomass, and mineralization of organic N was relatively reduced. The glucose addition also increased the CH4 and N2O emissions, CH4 emissions was regulated mainly by C resource as a substrate for methanogen. However, the N2O emissions were regulated by both C resources and soil pH, the C was important energy and the increased soil pH could benefit the nitrifiers and denitrifiers which were primary producers of N2O. The soil respiration and N2O emissions increased with increasing N addition rates in all glucose treatments, as the external C resource improved microbial N utilization. Compared with alleviated soil acidification, the improved availability of C substantially increased microbial activity, therefore, the C should be the main limiting factor in long-term N loading soils. The most important, when we use the organic C fertilization to improve the production of the ecosystems, the GHG emissions and consequent warming potentials should be carefully considered.

Keywords: acidification and C limitation, greenhouse gas emission, microbial activity, N deposition

Procedia PDF Downloads 272
24378 A Fuzzy Kernel K-Medoids Algorithm for Clustering Uncertain Data Objects

Authors: Behnam Tavakkol

Abstract:

Uncertain data mining algorithms use different ways to consider uncertainty in data such as by representing a data object as a sample of points or a probability distribution. Fuzzy methods have long been used for clustering traditional (certain) data objects. They are used to produce non-crisp cluster labels. For uncertain data, however, besides some uncertain fuzzy k-medoids algorithms, not many other fuzzy clustering methods have been developed. In this work, we develop a fuzzy kernel k-medoids algorithm for clustering uncertain data objects. The developed fuzzy kernel k-medoids algorithm is superior to existing fuzzy k-medoids algorithms in clustering data sets with non-linearly separable clusters.

Keywords: clustering algorithm, fuzzy methods, kernel k-medoids, uncertain data

Procedia PDF Downloads 177
24377 Land Lots and Shannon-Winner Index in Sarpolzahab Agro Ecosystems-Western Iran

Authors: Ashkan Asgari, Korous Khoshbakht, Saeid Soufizadeh

Abstract:

Various factors including land lots can affect biodiversity indices in Agricultural systems. Field study conducted to evaluate factors affecting crop diversity in Sarpolzahab in 2012. Required data were collected through direct observation of farms and filling questionnaires. Total numbers of 140 questionnaires were filled, SAS Software was used to analyse data and Ecological Methodology Program was applied to calculate Shannon-Winner index, subsequently. Results of study indicated that average number of land lots for each farmer was 2.78 and various from 2.2 in Rikhak Olia Village to 4.31 in Golam Kaboud Olia Village which shows small size of land lots due to separating larger lots by children of deceased farmers. The correlation between number of land lots and species biodiversity (0.308**) was significant and Shannon-Winner index was (0.262**). Therefore, according to the mentioned results one can assume that increase in number of land lots results in improving of the target index. Multiple land lots allow farmers to cultivate various crops which results in increasing biodiversity of crops in agro ecosystem. Subsequently, this increase will facilitate economic sustainability of the farmers and distribution of work force in the region throughout the year. The correlation of seasonal workers with biodiversity of crop species (0.256**) and Shannon-Winner (0.286**) was statistically significant and increasing number of seasonal work forces had resulted in improving crop biodiversity and decreasing dominant species or single crop farming systems. Vegetable farms which have a significant diversity, require a significant number of work forces which describes correlation between number of workers and diversity of species.

Keywords: agricultural systems, biodiversity indices, Shannon-Winner index, sustainability, rural

Procedia PDF Downloads 502
24376 Democracy Bytes: Interrogating the Exploitation of Data Democracy by Radical Terrorist Organizations

Authors: Nirmala Gopal, Sheetal Bhoola, Audecious Mugwagwa

Abstract:

This paper discusses the continued infringement and exploitation of data by non-state actors for destructive purposes, emphasizing radical terrorist organizations. It will discuss how terrorist organizations access and use data to foster their nefarious agendas. It further examines how cybersecurity, designed as a tool to curb data exploitation, is ineffective in raising global citizens' concerns about how their data can be kept safe and used for its acquired purpose. The study interrogates several policies and data protection instruments, such as the Data Protection Act, Cyber Security Policies, Protection of Personal Information(PPI) and General Data Protection Regulations (GDPR), to understand data use and storage in democratic states. The study outcomes point to the fact that international cybersecurity and cybercrime legislation, policies, and conventions have not curbed violations of data access and use by radical terrorist groups. The study recommends ways to enhance cybersecurity and reduce cyber risks using democratic principles.

Keywords: cybersecurity, data exploitation, terrorist organizations, data democracy

Procedia PDF Downloads 169
24375 Healthcare Data Mining Innovations

Authors: Eugenia Jilinguirian

Abstract:

In the healthcare industry, data mining is essential since it transforms the field by collecting useful data from large datasets. Data mining is the process of applying advanced analytical methods to large patient records and medical histories in order to identify patterns, correlations, and trends. Healthcare professionals can improve diagnosis accuracy, uncover hidden linkages, and predict disease outcomes by carefully examining these statistics. Additionally, data mining supports personalized medicine by personalizing treatment according to the unique attributes of each patient. This proactive strategy helps allocate resources more efficiently, enhances patient care, and streamlines operations. However, to effectively apply data mining, however, and ensure the use of private healthcare information, issues like data privacy and security must be carefully considered. Data mining continues to be vital for searching for more effective, efficient, and individualized healthcare solutions as technology evolves.

Keywords: data mining, healthcare, big data, individualised healthcare, healthcare solutions, database

Procedia PDF Downloads 38
24374 Summarizing Data Sets for Data Mining by Using Statistical Methods in Coastal Engineering

Authors: Yunus Doğan, Ahmet Durap

Abstract:

Coastal regions are the one of the most commonly used places by the natural balance and the growing population. In coastal engineering, the most valuable data is wave behaviors. The amount of this data becomes very big because of observations that take place for periods of hours, days and months. In this study, some statistical methods such as the wave spectrum analysis methods and the standard statistical methods have been used. The goal of this study is the discovery profiles of the different coast areas by using these statistical methods, and thus, obtaining an instance based data set from the big data to analysis by using data mining algorithms. In the experimental studies, the six sample data sets about the wave behaviors obtained by 20 minutes of observations from Mersin Bay in Turkey and converted to an instance based form, while different clustering techniques in data mining algorithms were used to discover similar coastal places. Moreover, this study discusses that this summarization approach can be used in other branches collecting big data such as medicine.

Keywords: clustering algorithms, coastal engineering, data mining, data summarization, statistical methods

Procedia PDF Downloads 337
24373 Diversity of Large Mammals in Awash National Park and its Ecosystem Role and Biodiversity Conservation, Ethiopia

Authors: Sintayehu W. Dejene

Abstract:

An ecological and biodiversity conservation study on species composition, population status and habitat association of large mammals and the impact of human interference on their distribution was carried out in Awash National Park, Ethiopia during October, 2012 to July, 2013. A total of 25 species of large mammals were recorded from the study area. Representative sample sites were taken from each habitat type and surveyed using random line transect method. For medium and large mammal survey, indirect methods (foot print and dung) and direct observations were used. Twenty three species of medium to large-sized mammals were identified and recorded from ANP. A total of 25 species of median and large size mammals were recorded from the study area. Out of this, 20 species were rodents of three families and five species were insectivores of two families. Beisa Oryx (Oryx beisa beisa),Soemmerings gazelle (Gazella soemmeringi),Defassa waterbuck (Kobus defassa), Lesser Kudu (Strepsiceros imberbis), Greater Kudu (Strepsiceros strepsiceros), Warthog (Phacochoerus aethiopicus), Baboon (Papio anubis baboon) and Salt's dikdik (Madoqua saltiana) were the most common seen median and large mammals in the study area. Beisa Oryx (Oryx beisa beisa) and Sommering Gazelles (Gazella soemmeringi) are commonly found in the open areas, where as Greater Kudus (Strepsiceros strepsiceros) and Lesser Kudus (Strepsiceros imberbis) was seen in the bushed areas. Defarsa waterbuck (Kobus defassa) was observed in the bushy river area in Northern part of the Park. Anubis baboon (Papio anubis baboon) was seen near to the river side. Hamadryas baboon founded in semi-desert areas of Awash National Park, particularly in Filwoha area. The area is one of a key biodiversity conservation and provide pure water, air, food, grazing land and storage of carbon.

Keywords: awash national park, biodiversity, ecosystem value, habitat association, large mammals, population status, species composition

Procedia PDF Downloads 354
24372 Access to Health Data in Medical Records in Indonesia in Terms of Personal Data Protection Principles: The Limitation and Its Implication

Authors: Anny Retnowati, Elisabeth Sundari

Abstract:

This research aims to elaborate the meaning of personal data protection principles on patient access to health data in medical records in Indonesia and its implications. The method uses normative legal research by examining health law in Indonesia regarding the patient's right to access their health data in medical records. The data will be analysed qualitatively using the interpretation method to elaborate on the limitation of the meaning of personal data protection principles on patients' access to their data in medical records. The results show that patients only have the right to obtain copies of their health data in medical records. There is no right to inspect directly at any time. Indonesian health law limits the principle of patients' right to broad access to their health data in medical records. This restriction has implications for the reduction of personal data protection as part of human rights. This research contribute to show that a limitaion of personal data protection may abuse the human rights.

Keywords: access, health data, medical records, personal data, protection

Procedia PDF Downloads 57
24371 Conceptualizing the Knowledge to Manage and Utilize Data Assets in the Context of Digitization: Case Studies of Multinational Industrial Enterprises

Authors: Martin Böhmer, Agatha Dabrowski, Boris Otto

Abstract:

The trend of digitization significantly changes the role of data for enterprises. Data turn from an enabler to an intangible organizational asset that requires management and qualifies as a tradeable good. The idea of a networked economy has gained momentum in the data domain as collaborative approaches for data management emerge. Traditional organizational knowledge consequently needs to be extended by comprehensive knowledge about data. The knowledge about data is vital for organizations to ensure that data quality requirements are met and data can be effectively utilized and sovereignly governed. As this specific knowledge has been paid little attention to so far by academics, the aim of the research presented in this paper is to conceptualize it by proposing a “data knowledge model”. Relevant model entities have been identified based on a design science research (DSR) approach that iteratively integrates insights of various industry case studies and literature research.

Keywords: data management, digitization, industry 4.0, knowledge engineering, metamodel

Procedia PDF Downloads 326
24370 A Comprehensive Planning Model for Amalgamation of Intensification and Green Infrastructure

Authors: Sara Saboonian, Pierre Filion

Abstract:

The dispersed-suburban model has been the dominant one across North America for the past seventy years, characterized by automobile reliance, low density, and land-use specialization. Two planning models have emerged as possible alternatives to address the ills inflicted by this development pattern. First, there is intensification, which promotes efficient infrastructure by connecting high-density, multi-functional, and walkable nodes with public transit services within the suburban landscape. Second is green infrastructure, which provides environmental health and human well-being by preserving and restoring ecosystem services. This research studies incompatibilities and the possibility of amalgamating the two alternatives in an attempt to develop a comprehensive alternative to suburban model that advocates density, multi-functionality and transit- and pedestrian-conduciveness, with measures capable of mitigating the adverse environmental impacts of compactness. The research investigates three Canadian urban growth centers, where intensification is the current planning practice, and the awareness of green infrastructure benefits is on the rise. However, these three centers are contrasted by their development stage, the presence or absence of protected natural land, their environmental approach, and their adverse environmental consequences according to the planning cannons of different periods. The methods include reviewing the literature on green infrastructure planning, criticizing the Ontario provincial plans for intensification, surveying residents’ preferences for alternative models, and interviewing officials who deal with the local planning for the centers. Moreover, the research draws on recalling debates between New Urbanism and Landscape/Ecological Urbanism. The case studies expose the difficulties in creating urban growth centres that accommodate green infrastructure while adhering to intensification principles. First, the dominant status of intensification and the obstacles confronting intensification have monopolized the planners’ concerns. Second, the tension between green infrastructure and intensification explains the absence of the green infrastructure typologies that correspond to intensification-compatible forms and dynamics. Finally, the lack of highlighted social-economic benefits of green infrastructure reduces residents’ participation. Moreover, the results from the research provide insight into predominating urbanization theories, New Urbanism and Landscape/Ecological Urbanism. In order to understand political, planning, and ecological dynamics of such blending, dexterous context-specific planning is required. Findings suggest the influence of the following factors on amalgamating intensification and green infrastructure. Initially, producing ecosystem services-based justifications for green infrastructure development in the intensification context provides an expert-driven backbone for the implementation programs. This knowledge-base should be translated to effectively imbue different urban stakeholders. Moreover, due to the limited greenfields in intensified areas, spatial distribution and development of multi-level corridors such as pedestrian-hospitable settings and transportation networks along green infrastructure measures are required. Finally, to ensure the long-term integrity of implemented green infrastructure measures, significant investment in public engagement and education, as well as clarification of management responsibilities is essential.

Keywords: ecosystem services, green infrastructure, intensification, planning

Procedia PDF Downloads 321
24369 Analysis and Forecasting of Bitcoin Price Using Exogenous Data

Authors: J-C. Leneveu, A. Chereau, L. Mansart, T. Mesbah, M. Wyka

Abstract:

Extracting and interpreting information from Big Data represent a stake for years to come in several sectors such as finance. Currently, numerous methods are used (such as Technical Analysis) to try to understand and to anticipate market behavior, with mixed results because it still seems impossible to exactly predict a financial trend. The increase of available data on Internet and their diversity represent a great opportunity for the financial world. Indeed, it is possible, along with these standard financial data, to focus on exogenous data to take into account more macroeconomic factors. Coupling the interpretation of these data with standard methods could allow obtaining more precise trend predictions. In this paper, in order to observe the influence of exogenous data price independent of other usual effects occurring in classical markets, behaviors of Bitcoin users are introduced in a model reconstituting Bitcoin value, which is elaborated and tested for prediction purposes.

Keywords: big data, bitcoin, data mining, social network, financial trends, exogenous data, global economy, behavioral finance

Procedia PDF Downloads 332