Search results for: text extraction
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 3078

Search results for: text extraction

2958 Text Analysis to Support Structuring and Modelling a Public Policy Problem-Outline of an Algorithm to Extract Inferences from Textual Data

Authors: Claudia Ehrentraut, Osama Ibrahim, Hercules Dalianis

Abstract:

Policy making situations are real-world problems that exhibit complexity in that they are composed of many interrelated problems and issues. To be effective, policies must holistically address the complexity of the situation rather than propose solutions to single problems. Formulating and understanding the situation and its complex dynamics, therefore, is a key to finding holistic solutions. Analysis of text based information on the policy problem, using Natural Language Processing (NLP) and Text analysis techniques, can support modelling of public policy problem situations in a more objective way based on domain experts knowledge and scientific evidence. The objective behind this study is to support modelling of public policy problem situations, using text analysis of verbal descriptions of the problem. We propose a formal methodology for analysis of qualitative data from multiple information sources on a policy problem to construct a causal diagram of the problem. The analysis process aims at identifying key variables, linking them by cause-effect relationships and mapping that structure into a graphical representation that is adequate for designing action alternatives, i.e., policy options. This study describes the outline of an algorithm used to automate the initial step of a larger methodological approach, which is so far done manually. In this initial step, inferences about key variables and their interrelationships are extracted from textual data to support a better problem structuring. A small prototype for this step is also presented.

Keywords: public policy, problem structuring, qualitative analysis, natural language processing, algorithm, inference extraction

Procedia PDF Downloads 560
2957 Interactive, Topic-Oriented Search Support by a Centroid-Based Text Categorisation

Authors: Mario Kubek, Herwig Unger

Abstract:

Centroid terms are single words that semantically and topically characterise text documents and so may serve as their very compact representation in automatic text processing. In the present paper, centroids are used to measure the relevance of text documents with respect to a given search query. Thus, a new graphbased paradigm for searching texts in large corpora is proposed and evaluated against keyword-based methods. The first, promising experimental results demonstrate the usefulness of the centroid-based search procedure. It is shown that especially the routing of search queries in interactive and decentralised search systems can be greatly improved by applying this approach. A detailed discussion on further fields of its application completes this contribution.

Keywords: search algorithm, centroid, query, keyword, co-occurrence, categorisation

Procedia PDF Downloads 253
2956 Binarization and Recognition of Characters from Historical Degraded Documents

Authors: Bency Jacob, S.B. Waykar

Abstract:

Degradations in historical document images appear due to aging of the documents. It is very difficult to understand and retrieve text from badly degraded documents as there is variation between the document foreground and background. Thresholding of such document images either result in broken characters or detection of false texts. Numerous algorithms exist that can separate text and background efficiently in the textual regions of the document; but portions of background are mistaken as text in areas that hardly contain any text. This paper presents a way to overcome these problems by a robust binarization technique that recovers the text from a severely degraded document images and thereby increases the accuracy of optical character recognition systems. The proposed document recovery algorithm efficiently removes degradations from document images. Here we are using the ostus method ,local thresholding and global thresholding and after the binarization training and recognizing the characters in the degraded documents.

Keywords: binarization, denoising, global thresholding, local thresholding, thresholding

Procedia PDF Downloads 311
2955 First Approach on Lycopene Extraction Using Limonene

Authors: M. A. Ferhat, M. N. Boukhatem, F. Chemat

Abstract:

Lycopene extraction with petroleum derivatives as solvents has caused safety, health, and environmental concerns everywhere. Thus, finding a safe alternative solvent will have a strong and positive impact on environments and general health of the world population. d-limonene from the orange peel was extracted through a steam distillation procedure followed by a deterpenation process and combining this achievement by using it as a solvent for extracting lycopene from tomato fruit as a substitute of dichloromethane. Lycopene content of fresh tomatoes was determined by high-performance liquid chromatography after extraction. Yields obtained for both extractions showed that yields of d-limonene’s extracts were almost equivalent to those obtained using dichloromethane. The proposed approach using a green solvent to perform extraction is useful and can be considered as a nice alternative to conventional petroleum solvent where toxicity for both operator and environment is reduced.

Keywords: alternative solvent, d-limonene, extraction, lycopene

Procedia PDF Downloads 383
2954 Extraction of Natural Colorant from the Flowers of Flame of Forest Using Ultrasound

Authors: Sunny Arora, Meghal A. Desai

Abstract:

An impetus towards green consumerism and implementation of sustainable techniques, consumption of natural products and utilization of environment friendly techniques have gained accelerated acceptance. Butein, a natural colorant, has many medicinal properties apart from its use in dyeing industries. Extraction of butein from the flowers of flame of forest was carried out using ultrasonication bath. Solid loading (2-6 g), extraction time (30-50 min), volume of solvent (30-50 mL) and types of solvent (methanol, ethanol and water) have been studied to maximize the yield of butein using the Taguchi method. The highest yield of butein 4.67% (w/w) was obtained using 4 g of plant material, 40 min of extraction time and 30 mL volume of methanol as a solvent. The present method provided a greater reduction in extraction time compared to the conventional method of extraction. Hence, the outcome of the present investigation could further be utilized to develop the method at a higher scale.

Keywords: butein, flowers of Flame of the Forest, Taguchi method, ultrasonic bath

Procedia PDF Downloads 442
2953 Adaptation of Projection Profile Algorithm for Skewed Handwritten Text Line Detection

Authors: Kayode A. Olaniyi, Tola. M. Osifeko, Adeola A. Ogunleye

Abstract:

Text line segmentation is an important step in document image processing. It represents a labeling process that assigns the same label using distance metric probability to spatially aligned units. Text line detection techniques have successfully been implemented mainly in printed documents. However, processing of the handwritten texts especially unconstrained documents has remained a key problem. This is because the unconstrained hand-written text lines are often not uniformly skewed. The spaces between text lines may not be obvious, complicated by the nature of handwriting and, overlapping ascenders and/or descenders of some characters. Hence, text lines detection and segmentation represents a leading challenge in handwritten document image processing. Text line detection methods that rely on the traditional global projection profile of the text document cannot efficiently confront with the problem of variable skew angles between different text lines. Hence, the formulation of a horizontal line as a separator is often not efficient. This paper presents a technique to segment a handwritten document into distinct lines of text. The proposed algorithm starts, by partitioning the initial text image into columns, across its width into chunks of about 5% each. At each vertical strip of 5%, the histogram of horizontal runs is projected. We have worked with the assumption that text appearing in a single strip is almost parallel to each other. The algorithm developed provides a sliding window through the first vertical strip on the left side of the page. It runs through to identify the new minimum corresponding to a valley in the projection profile. Each valley would represent the starting point of the orientation line and the ending point is the minimum point on the projection profile of the next vertical strip. The derived text-lines traverse around any obstructing handwritten vertical strips of connected component by associating it to either the line above or below. A decision of associating such connected component is made by the probability obtained from a distance metric decision. The technique outperforms the global projection profile for text line segmentation and it is robust to handle skewed documents and those with lines running into each other.

Keywords: connected-component, projection-profile, segmentation, text-line

Procedia PDF Downloads 93
2952 Glossematics and Textual Structure

Authors: Abdelhadi Nadjer

Abstract:

The structure of the text to the systemic school -(glossématique-Helmslev). At the beginning of the note we have a cursory look around the concepts of general linguistics The science that studies scientific study of human language based on the description and preview the facts away from the trend of education than we gave a detailed overview the founder of systemic school and most important customers and more methods and curriculum theory and analysis they extend to all humanities, practical action each offset by a theoretical and the procedure can be analyzed through the elements that pose as another method we talked to its links with other language schools where they are based on the sharp criticism of the language before and deflected into consideration for the field of language and its erection has outside or language network and its participation in the actions (non-linguistic) and after that we started our Valglosamatik analytical structure of the text is ejected text terminal or all of the words to was put for expression. This text Negotiable divided into types in turn are divided into classes and class should not be carrying a contradiction and be inclusive. It is on the same materials as described relationships that combine language and seeks to describe their relations and identified.

Keywords: text, language schools, linguistics, human language

Procedia PDF Downloads 427
2951 We Wonder If They Mind: An Empirical Inquiry into the Narratological Function of Mind Wandering in Readers of Literary Texts

Authors: Tina Ternes, Florian Kleinau

Abstract:

The study investigates the content and triggers of mind wandering (MW) in readers of fictional texts. It asks whether readers’ MW is productive (text-related) or unproductive (text-unrelated). Methodologically, it bridges the gap between narratological and data-driven approaches by utilizing a sentence-by-sentence self-paced reading paradigm combined with thought probes in the reading of an excerpt of A. L. Kennedy’s “Baby Blue”. Results show that the contents of MW can be linked to text properties. We validated the role of self-reference in MW and found prediction errors to be triggers of MW. Results also indicate that the content of MW often travels along the lines of the text at hand and can thus be viewed as productive and integral to interpretation.

Keywords: narratology, mind wandering, reading fiction, meta cognition

Procedia PDF Downloads 55
2950 Incorporating Information Gain in Regular Expressions Based Classifiers

Authors: Rosa L. Figueroa, Christopher A. Flores, Qing Zeng-Treitler

Abstract:

A regular expression consists of sequence characters which allow describing a text path. Usually, in clinical research, regular expressions are manually created by programmers together with domain experts. Lately, there have been several efforts to investigate how to generate them automatically. This article presents a text classification algorithm based on regexes. The algorithm named REX was designed, and then, implemented as a simplified method to create regexes to classify Spanish text automatically. In order to classify ambiguous cases, such as, when multiple labels are assigned to a testing example, REX includes an information gain method Two sets of data were used to evaluate the algorithm’s effectiveness in clinical text classification tasks. The results indicate that the regular expression based classifier proposed in this work performs statically better regarding accuracy and F-measure than Support Vector Machine and Naïve Bayes for both datasets.

Keywords: information gain, regular expressions, smith-waterman algorithm, text classification

Procedia PDF Downloads 292
2949 Microwave and Ultrasound Assisted Extraction of Pectin from Mandarin and Lemon Peel: Comparisons between Sources and Methods

Authors: Pınar Karbuz, A. Seyhun Kıpcak, Mehmet B. Piskin, Emek Derun, Nurcan Tugrul

Abstract:

Pectin is a complex colloidal polysaccharide, found on the cell walls of all young plants such as fruit and vegetables. It acts as a thickening, stabilizing and gelling agent in foods. Pectin was extracted from mandarin and lemon peels using ultrasound and microwave assisted extraction methods to compare with these two different sources and methods of pectin production. In this work, the effect of microwave power (360, 600 W) and irradiation time (1, 2, 3 min) on the yield of extracted pectin from mandarin and lemon peels for microwave assisted extraction (MAE) were investigated. For ultrasound assisted extraction (UAE), parameters were determined as temperature (60, 75 °C) and sonication time (15, 30, 45 min) and hydrochloric acid (HCl) was used as an extracting agent for both extraction methods. The highest yields of extracted pectin from lemon peels were found to be 8.16 % (w/w) for 75 °C, 45 min by UAE and 8.58 % (w/w) for 360 W, 1 min by MAE. Additionally, the highest yields of extracted pectin from mandarin peels were found to be 11.29 % (w/w) for 75 °C, 45 min by UAE and 16.44 % (w/w) for 600 W, 1 min by MAE. The results showed that the use of microwave assisted extraction promoted a better yield when compared to the two extraction methods. On the other hand, according to the results of experiments, mandarin peels contain more pectin than lemon peels when the compared to the pectin product values of two sources. Therefore, these results suggested that MAE could be used as an efficient and rapid method for extraction of pectin and mandarin peels should be preferred as sources of pectin production compared to lemon peels.

Keywords: mandarin peel, lemon peel, pectin, ultrasound, microwave, extraction

Procedia PDF Downloads 214
2948 Arabic Light Stemmer for Better Search Accuracy

Authors: Sahar Khedr, Dina Sayed, Ayman Hanafy

Abstract:

Arabic is one of the most ancient and critical languages in the world. It has over than 250 million Arabic native speakers and more than twenty countries having Arabic as one of its official languages. In the past decade, we have witnessed a rapid evolution in smart devices, social network and technology sector which led to the need to provide tools and libraries that properly tackle the Arabic language in different domains. Stemming is one of the most crucial linguistic fundamentals. It is used in many applications especially in information extraction and text mining fields. The motivation behind this work is to enhance the Arabic light stemmer to serve the data mining industry and leverage it in an open source community. The presented implementation works on enhancing the Arabic light stemmer by utilizing and enhancing an algorithm that provides an extension for a new set of rules and patterns accompanied by adjusted procedure. This study has proven a significant enhancement for better search accuracy with an average 10% improvement in comparison with previous works.

Keywords: Arabic data mining, Arabic Information extraction, Arabic Light stemmer, Arabic stemmer

Procedia PDF Downloads 280
2947 Physical Parameters Influencing the Yield of Nigella Sativa Oil Extracted by Hydraulic Pressing

Authors: Hadjadj Naima, K. Mahdi, D. Belhachat, F. S. Ait Chaouche, A. Ferradji

Abstract:

The Nigella Sativa oil yield extracted by hydraulic pressing is influenced by the pressure temperature and size particles. The optimization of oil extraction is investigated. The rate of extraction of the whole seeds is very weak, a crushing of seeds is necessary to facilitate the extraction. This rate augments with the rise of the temperature and the pressure, and decrease of size particles. The best output (66%) is obtained for a granulometry lower than 1mm, a temperature of 50°C and a pressure of 120 bars.

Keywords: oil, Nigella sativa, extraction, optimization, temperature, pressure

Procedia PDF Downloads 449
2946 A Unique Exact Approach to Handle a Time-Delayed State-Space System: The Extraction of Juice Process

Authors: Mohamed T. Faheem Saidahmed, Ahmed M. Attiya Ibrahim, Basma GH. Elkilany

Abstract:

This paper discusses the application of Time Delay Control (TDC) compensation technique in the juice extraction process in a sugar mill. The objective is to improve the control performance of the process and increase extraction efficiency. The paper presents the mathematical model of the juice extraction process and the design of the TDC compensation controller. Simulation results show that the TDC compensation technique can effectively suppress the time delay effect in the process and improve control performance. The extraction efficiency is also significantly increased with the application of the TDC compensation technique. The proposed approach provides a practical solution for improving the juice extraction process in sugar mills using MATLAB Processes.

Keywords: time delay control (TDC), exact and unique state space model, delay compensation, Smith predictor.

Procedia PDF Downloads 52
2945 Multi-Class Text Classification Using Ensembles of Classifiers

Authors: Syed Basit Ali Shah Bukhari, Yan Qiang, Saad Abdul Rauf, Syed Saqlaina Bukhari

Abstract:

Text Classification is the methodology to classify any given text into the respective category from a given set of categories. It is highly important and vital to use proper set of pre-processing , feature selection and classification techniques to achieve this purpose. In this paper we have used different ensemble techniques along with variance in feature selection parameters to see the change in overall accuracy of the result and also on some other individual class based features which include precision value of each individual category of the text. After subjecting our data through pre-processing and feature selection techniques , different individual classifiers were tested first and after that classifiers were combined to form ensembles to increase their accuracy. Later we also studied the impact of decreasing the classification categories on over all accuracy of data. Text classification is highly used in sentiment analysis on social media sites such as twitter for realizing people’s opinions about any cause or it is also used to analyze customer’s reviews about certain products or services. Opinion mining is a vital task in data mining and text categorization is a back-bone to opinion mining.

Keywords: Natural Language Processing, Ensemble Classifier, Bagging Classifier, AdaBoost

Procedia PDF Downloads 205
2944 Recovery of Essential Oil from Zingiber Officinale Var. Bentong Using Ultrasound Assisted-Supercritical Carbon Dioxide Extraction

Authors: Norhidayah Suleiman, Afza Zulfaka

Abstract:

Zingiber officinale var. Bentong has been identified as the source of high added value compound specifically gingerol-related compounds. The extraction of the high-value compound using conventional method resulted in low yield and time consumption. Hence, the motivation for this work is to investigate the effect of the extraction technique on the essential oil from Zingiber officinale var. Bentong rhizome for commercialization purpose in many industries namely, functional food, pharmaceutical, and cosmeceutical. The investigation begins with a pre-treatment using ultrasound assisted in order to enhance the recovery of essential oil. It was conducted at a fixed frequency (20 kHz) of ultrasound with various time (10, 20, 40 min). The extraction using supercritical carbon dioxide (scCO2) were carried out afterward at a specific condition of temperature (50 °C) and pressure (30 MPa). scCO2 extraction seems to be a promising sustainable green method for the extraction of essential oil due to the benefits that CO2 possesses. The expected results demonstrated the ultrasound-assisted-scCO2 produces a higher yield of essential oil compared to solely scCO2 extraction. This research will provide important features for its application in food supplements or phytochemical preparations.

Keywords: essential oil, scCO2, ultrasound assisted, Zingiber officinale Var. Bentong

Procedia PDF Downloads 107
2943 Resource Creation Using Natural Language Processing Techniques for Malay Translated Qur'an

Authors: Nor Diana Ahmad, Eric Atwell, Brandon Bennett

Abstract:

Text processing techniques for English have been developed for several decades. But for the Malay language, text processing methods are still far behind. Moreover, there are limited resources, tools for computational linguistic analysis available for the Malay language. Therefore, this research presents the use of natural language processing (NLP) in processing Malay translated Qur’an text. As the result, a new language resource for Malay translated Qur’an was created. This resource will help other researchers to build the necessary processing tools for the Malay language. This research also develops a simple question-answer prototype to demonstrate the use of the Malay Qur’an resource for text processing. This prototype has been developed using Python. The prototype pre-processes the Malay Qur’an and an input query using a stemming algorithm and then searches for occurrences of the query word stem. The result produced shows improved matching likelihood between user query and its answer. A POS-tagging algorithm has also been produced. The stemming and tagging algorithms can be used as tools for research related to other Malay texts and can be used to support applications such as information retrieval, question answering systems, ontology-based search and other text analysis tasks.

Keywords: language resource, Malay translated Qur'an, natural language processing (NLP), text processing

Procedia PDF Downloads 285
2942 Method of Complex Estimation of Text Perusal and Indicators of Reading Quality in Different Types of Commercials

Authors: Victor N. Anisimov, Lyubov A. Boyko, Yazgul R. Almukhametova, Natalia V. Galkina, Alexander V. Latanov

Abstract:

Modern commercials presented on billboards, TV and on the Internet contain a lot of information about the product or service in text form. However, this information cannot always be perceived and understood by consumers. Typical sociological focus group studies often cannot reveal important features of the interpretation and understanding information that has been read in text messages. In addition, there is no reliable method to determine the degree of understanding of the information contained in a text. Only the fact of viewing a text does not mean that consumer has perceived and understood the meaning of this text. At the same time, the tools based on marketing analysis allow only to indirectly estimate the process of reading and understanding a text. Therefore, the aim of this work is to develop a valid method of recording objective indicators in real time for assessing the fact of reading and the degree of text comprehension. Psychophysiological parameters recorded during text reading can form the basis for this objective method. We studied the relationship between multimodal psychophysiological parameters and the process of text comprehension during reading using the method of correlation analysis. We used eye-tracking technology to record eye movements parameters to estimate visual attention, electroencephalography (EEG) to assess cognitive load and polygraphic indicators (skin-galvanic reaction, SGR) that reflect the emotional state of the respondent during text reading. We revealed reliable interrelations between perceiving the information and the dynamics of psychophysiological parameters during reading the text in commercials. Eye movement parameters reflected the difficulties arising in respondents during perceiving ambiguous parts of text. EEG dynamics in rate of alpha band were related with cumulative effect of cognitive load. SGR dynamics were related with emotional state of the respondent and with the meaning of text and type of commercial. EEG and polygraph parameters together also reflected the mental difficulties of respondents in understanding text and showed significant differences in cases of low and high text comprehension. We also revealed differences in psychophysiological parameters for different type of commercials (static vs. video, financial vs. cinema vs. pharmaceutics vs. mobile communication, etc.). Conclusions: Our methodology allows to perform multimodal evaluation of text perusal and the quality of text reading in commercials. In general, our results indicate the possibility of designing an integral model to estimate the comprehension of reading the commercial text in percent scale based on all noticed markers.

Keywords: reading, commercials, eye movements, EEG, polygraphic indicators

Procedia PDF Downloads 139
2941 Extraction of Strontium Ions through Ligand Assisted Ionic Liquids

Authors: Pradeep Kumar, Abhishek Kumar Chandra, Ashok Khanna

Abstract:

Extraction of Strontium by crown ether (DCH18C6) hasbeen investigated in [BMIM][TF2N] Ionic Liquid (IL) giving higher extraction ~98% and distribution ratio as compared to other organic solvents (Dodecane, Hexane, & Isodecyl alcohol + Dodecane). Distribution ratio of Sr in IL at 0.15M DCH18C6 indicates an enhancement of 20000, 2000, 500 times over Dodecane, Hexane and 5% Isodecyl Alcohol + 95 % Dodecane at 0.01M aqueous acidity respectively. In presence of IL, Sr extraction decreases with increase in HNO3 concentration in aqueous phase whereas opposite trend was observed with organic solvents.Extraction of Sr initially increases with increase in DCH18C6 concentration in IL, finally reaching an asymptotic constant.

Keywords: distribution ratio, ionic liquid, ligand, organic solvent, stripping

Procedia PDF Downloads 412
2940 Text Based Shuffling Algorithm on Graphics Processing Unit for Digital Watermarking

Authors: Zayar Phyo, Ei Chaw Htoon

Abstract:

In a New-LSB based Steganography method, the Fisher-Yates algorithm is used to permute an existing array randomly. However, that algorithm performance became slower and occurred memory overflow problem while processing the large dimension of images. Therefore, the Text-Based Shuffling algorithm aimed to select only necessary pixels as hiding characters at the specific position of an image according to the length of the input text. In this paper, the enhanced text-based shuffling algorithm is presented with the powered of GPU to improve more excellent performance. The proposed algorithm employs the OpenCL Aparapi framework, along with XORShift Kernel including the Pseudo-Random Number Generator (PRNG) Kernel. PRNG is applied to produce random numbers inside the kernel of OpenCL. The experiment of the proposed algorithm is carried out by practicing GPU that it can perform faster-processing speed and better efficiency without getting the disruption of unnecessary operating system tasks.

Keywords: LSB based steganography, Fisher-Yates algorithm, text-based shuffling algorithm, OpenCL, XORShiftKernel

Procedia PDF Downloads 121
2939 Response Surface Methodology for the Optimization of Sugar Extraction from Phoenix dactylifera L.

Authors: Lila Boulekbache-Makhlouf, Kahina Djaoud, Myriam Tazarourte, Samir Hadjal, Khodir Madani

Abstract:

In Algeria, important quantities of secondary date variety (Phoenix dactylifera L.) are generated in each campaign; their chemical composition is similar to that of commercial dates. The present work aims to valorize this common date variety (Degla-Beida) which is often poorly exploited. In this context, we tried to prepare syrup from the secondary date variety and to evaluate the effect of conventional extraction (CE) or water bath extraction (WBE) and alternative extraction (microwaves assisted extraction (MAE), and ultrasounds assisted extraction (UAE)) on its total sugar content (TSC), using response surface methodology (RSM). Then, the analysis of individual sugars was performed by high-performance liquid chromatography (HPLC). Maximum predicted TSC recoveries under the optimized conditions for MAE, UAE and CE were 233.248 ± 3.594 g/l, 202.889 ± 5.797 g/l, and 233.535 ± 5.412 g/l, respectively, which were close to the experimental values: 233.796 ± 1.898 g/l; 202.037 ± 3.401 g/l and 234.380 ± 2.425 g/l. HPLC analysis revealed high similarity in the sugar composition of date juices obtained by MAE (60.11% sucrose, 16.64% glucose and 23.25% fructose) and CE (50.78% sucrose, 20.67% glucose and 28.55% fructose), although a large difference was detected for that obtained by UAE (0.00% sucrose, 46.94% glucose and 53.06% fructose). Microwave-assisted extraction was the best method for the preparation of date syrup with an optimal recovery of total sugar content. However, ultrasound-assisted extraction was the best one for the preparation of date syrup with high content of reducing sugars.

Keywords: dates, extraction, RSM, sugars, syrup

Procedia PDF Downloads 129
2938 The Effect of Ionic Strength on the Extraction of Copper(II) from Perchlorate Solutions by Capric Acid in Chloroform

Authors: A. Bara, D. Barkat

Abstract:

The liquid-liquid extraction of copper (II) from aqueous solution by capric acid (HL) in chloroform at 25°C has been studied. The ionic strength effect of the aqueous phase shows that the extraction of copper(II) increases with the increase in ionic strength. with different ionic strengths 1, 0.5, 0.25, 0.125 and 0.1M in the aqueous phase. Cu (II) is extracted as the complex CuL2(ClO4).

Keywords: liquid-liquid extraction, ionic strength, copper (II), capric acid

Procedia PDF Downloads 506
2937 Text Mining of Twitter Data Using a Latent Dirichlet Allocation Topic Model and Sentiment Analysis

Authors: Sidi Yang, Haiyi Zhang

Abstract:

Twitter is a microblogging platform, where millions of users daily share their attitudes, views, and opinions. Using a probabilistic Latent Dirichlet Allocation (LDA) topic model to discern the most popular topics in the Twitter data is an effective way to analyze a large set of tweets to find a set of topics in a computationally efficient manner. Sentiment analysis provides an effective method to show the emotions and sentiments found in each tweet and an efficient way to summarize the results in a manner that is clearly understood. The primary goal of this paper is to explore text mining, extract and analyze useful information from unstructured text using two approaches: LDA topic modelling and sentiment analysis by examining Twitter plain text data in English. These two methods allow people to dig data more effectively and efficiently. LDA topic model and sentiment analysis can also be applied to provide insight views in business and scientific fields.

Keywords: text mining, Twitter, topic model, sentiment analysis

Procedia PDF Downloads 150
2936 The Effect of Different Extraction Techniques on the Yield and the Composition of Oil (Laurus Nobilis L.) Fruits Widespread in Syria

Authors: Khaled Mawardi

Abstract:

Bay laurel (Laurus nobilis L.) is an evergreen of the Laurus genus of the Lauraceae Family. It is a plant native to the southern Mediterranean and widespread in Syria. It is a plant with enormous industrial applications. For instance, they are used as platform chemicals in food, pharmaceutical and cosmetic applications. Herein, we report an efficient extraction of Bay laurel oil from Bay laurel fruits via a comparative investigation of boiled water conventional extraction technique and microwave-assisted extraction (MAE) by microwave heating at atmospheric pressure. In order to optimize the extraction efficiency, we investigated several extraction parameters, such as extraction time and microwave power. In addition, to demonstrate the feasibility of the method, oil obtained under optimal conditions by method (MAE) was compared quantitatively and qualitatively with that obtained by the conventional method. After 1h of microwave-assisted extraction (power of 600W), an oil yield of 9.8% with identified lauric acid content of 22.7%. In comparison, an extended extraction of up to 4h was required to obtain a 9.7% yield of oil extraction with 21.2% of lauric acid content. The change in microwave power impacts the fatty acids profile and also the quality parameters of Laurel Oil. It was found that the profile of fatty acids changed with the power, where the lauric acid content increased from 22.7% at 600W to 30.5% at 1200W owing to a decrease of oleic acid content from 32.8% at 600W to 28.3% at 1200W and linoleic acid content from 22.3% at 600W to 20.6% at 1200W. In addition, we observed a decrease in oil yield from 9.8% at 600W to 5.1% at 1200W. Summarily, the overall results indicated that the extraction of laurel fruit oils could be successfully performed using (MAE) at a short extraction time and lower energy compared with the fixed oil obtained by conventional processes of extraction. Microwave heating exerted more aggressive effects on the oil. Indeed, microwave heating inflicted changes in the fatty acids profile of oil; the most affected fraction was the unsaturated fatty acids, with higher susceptibility to oxidation.

Keywords: microwaves, extraction, Laurel oil, solvent-free

Procedia PDF Downloads 42
2935 Text Localization in Fixed-Layout Documents Using Convolutional Networks in a Coarse-to-Fine Manner

Authors: Beier Zhu, Rui Zhang, Qi Song

Abstract:

Text contained within fixed-layout documents can be of great semantic value and so requires a high localization accuracy, such as ID cards, invoices, cheques, and passports. Recently, algorithms based on deep convolutional networks achieve high performance on text detection tasks. However, for text localization in fixed-layout documents, such algorithms detect word bounding boxes individually, which ignores the layout information. This paper presents a novel architecture built on convolutional neural networks (CNNs). A global text localization network and a regional bounding-box regression network are introduced to tackle the problem in a coarse-to-fine manner. The text localization network simultaneously locates word bounding points, which takes the layout information into account. The bounding-box regression network inputs the features pooled from arbitrarily sized RoIs and refine the localizations. These two networks share their convolutional features and are trained jointly. A typical type of fixed-layout documents: ID cards, is selected to evaluate the effectiveness of the proposed system. These networks are trained on data cropped from nature scene images, and synthetic data produced by a synthetic text generation engine. Experiments show that our approach locates high accuracy word bounding boxes and achieves state-of-the-art performance.

Keywords: bounding box regression, convolutional networks, fixed-layout documents, text localization

Procedia PDF Downloads 165
2934 Effect of Ultrasound on Carotenoids Extraction from Pepper and Process Optimization Using Response Surface Methodology (RSM)

Authors: Elham Mahdian, Reza Karazhian, Rahele Dehghan Tanha

Abstract:

Pepper (Capsicum annum L.) which belong to the family Solananceae, are known for their versatility as a vegetable crop and are consumed both as fresh vegetables or dehydrated for spices. Pepper is considered an excellent source of bioactive nutrients. Ascorbic acid, carotenoids and phenolic compounds are its main antioxidant constituents. Ultrasound assisted extraction is an inexpensive, simple and efficient alternative to conventional extraction techniques. The mechanism of action for ultrasound-assisted extraction are attributed to cavitations, mechanical forces and thermal impact, which result in disruption of cells walls, reduce particle size, and enhance mass transfer across cell membranes. In this study, response surface methodology was used to optimize experimental conditions for ultrasonic assisted extraction of carotenoid compounds from Chili peppers. Variables were included extraction temperatures at 3 levels (30, 40 and 50 °C), extraction times at 3 levels (10, 25 and 40 minutes) and power at 3 levels (30, 60 and 90 %). It was observed that ultrasound waves applied at temperature of 49°C, time of 10 minutes and power 89 % resulted to the highest carotenoids contents (lycopene and β-carotene), while the lowest value was recorded in the control. Thus, results showed that ultrasound waves have strong impact on extraction of carotenoids from pepper.

Keywords: carotenoids, optimization, pepper, response surface methodology

Procedia PDF Downloads 436
2933 Oil Extraction from Sunflower Seed Using Green Solvent 2-Methyltetrahydrofuran and Isoamyl Alcohol

Authors: Sergio S. De Jesus, Aline Santana, Rubens Maciel Filho

Abstract:

The objective of this study was to choose and determine a green solvent system with similar extraction efficiencies as the traditional Bligh and Dyer method. Sunflower seed oil was extracted using Bligh and Dyer method with 2-methyltetrahydrofuran and isoamyl using alcohol ratios of 1:1; 2:1; 3:1; 1:2; 3:1. At the same time comparative experiments was performed with chloroform and methanol ratios of 1:1; 2:1; 3:1; 1:2; 3:1. Comparison study was done using 5 replicates (n=5). Statistical analysis was performed using Microsoft Office Excel (Microsoft, USA) to determine means and Tukey’s Honestly Significant Difference test for comparison between treatments (α = 0.05). The results showed that using classic method with methanol and chloroform presented the extraction oil yield with the values of 31-44% (w/w) and values of 36-45% (w/w) using green solvents for extractions. Among the two extraction methods, 2 methyltetrahydrofuran and isoamyl alcohol ratio 2:1 provided the best results (45% w/w), while the classic method using chloroform and methanol with ratio of 3:1 presented a extraction oil yield of 44% (w/w). It was concluded that the proposed extraction method using 2-methyltetrahydrofuran and isoamyl alcohol in this work allowed the same efficiency level as chloroform and methanol.

Keywords: extraction, green solvent, lipids, sugarcane

Procedia PDF Downloads 348
2932 Oil Extraction from Microalgae Dunalliela sp. by Polar and Non-Polar Solvents

Authors: A. Zonouzi, M. Auli, M. Javanmard Dakheli, M. A. Hejazi

Abstract:

Microalgae are tiny photosynthetic plants. Nowadays, microalgae are being used as nutrient-dense foods and sources of fine chemicals. They have significant amounts of lipid, carotenoids, vitamins, protein, minerals, chlorophyll, and pigments. Oil extraction from algae is a hotly debated topic currently because introducing an efficient method could decrease the process cost. This can determine the sustainability of algae-based foods. Scientific research works show that solvent extraction using chloroform/methanol (2:1) mixture is one of the efficient methods for oil extraction from algal cells, but both methanol and chloroform are toxic solvents, and therefore, the extracted oil will not be suitable for food application. In this paper, the effect of two food grade solvents (hexane and hexane/ isopropanol) on oil extraction yield from microalgae Dunaliella sp. was investigated and the results were compared with chloroform/methanol (2:1) extraction yield. It was observed that the oil extraction yield using hexane, hexane/isopropanol (3:2) and chloroform/methanol (2:1) mixture were 5.4, 13.93, and 17.5 (% w/w, dry basis), respectively. The fatty acid profile derived from GC illustrated that the palmitic (36.62%), oleic (18.62%), and stearic acids (19.08%) form the main portion of fatty acid composition of microalgae Dunalliela sp. oil. It was concluded that, the addition of isopropanol as polar solvent could increase the extraction yield significantly. Isopropanol solves cell wall phospholipids and enhances the release of intercellular lipids, which improves accessing of hexane to fatty acids.

Keywords: fatty acid profile‎, microalgae‎, oil extraction‎, polar solvent‎

Procedia PDF Downloads 343
2931 Green Extraction of Patchoulol from Patchouli Leaves Using Ultrasound-Assisted Ionic Liquids

Authors: G. C. Jadeja, M. A. Desai, D. R. Bhatt, J. K. Parikh

Abstract:

Green extraction techniques are fast paving ways into various industrial sectors due to the stringent governmental regulations leading to the banning of toxic chemicals’ usage and also due to the increasing health/environmental awareness. The present work describes the ionic liquids based sonication method for selectively extracting patchoulol from the leaves of patchouli. 1-Butyl-3-methylimidazolium tetrafluoroborate ([Bmim]BF4) and N,N,N,N’,N’,N’-Hexaethyl-butane-1,4-diammonium dibromide (dicationic ionic liquid - DIL) were selected for extraction. Ultrasound assisted ionic liquid extraction was employed considering concentration of ionic liquid (4–8 %, w/w), ultrasound power (50–150 W for [Bmim]BF4 and 20–80 W for DIL), temperature (30–50 oC) and extraction time (30–50 min) as major parameters influencing the yield of patchoulol. Using the Taguchi method, the parameters were optimized and analysis of variance (ANOVA) was performed to find the most influential factor in the selected extraction method. In case of [Bmim]BF4, the optimum conditions were found to be: 4 % (w/w) ionic liquid concentration, 50 W power, 30 oC temperature and extraction time of 30 min. The yield obtained under the optimum conditions was 3.99 mg/g. In case of DIL, the optimum conditions were obtained as 6 % (w/w) ionic liquid concentration, 80 W power, 30 oC temperature and extraction time of 40 min, for which the yield obtained was 4.03 mg/g. Temperature was found to be the most significant factor in both the cases. Extraction time was the insignificant parameter while extracting the product using [Bmim]BF4 and in case of DIL, power was found to be the least significant factor affecting the process. Thus, a green method of recovering patchoulol is proposed.

Keywords: green extraction, ultrasound, patchoulol, ionic liquids

Procedia PDF Downloads 331
2930 Literature Review: Microalgae as Functional Foods with Solvent Free Extraction

Authors: Angela Justina Kumalaputri

Abstract:

Indonesia, as a maritime country, has abundant marine living resources yet has not been optimally utilized. So far, we only focusing on fisheries. In the other hand, Indonesia, as the country with the fourth longest coastline, is a very good cultivation place for microalgae. Microalgae can be diversified to many important products, such as food, fuel, pharmaceutical products, functional food, and cosmetics.This research is focusing on the literature study about types of microalgae as sources for functional foods (such as antioxidants), including the contents and the separation methods. The research methods which we use are: (1) Literature study about various microalgaes (2) Literature study about extractions using supercritical fluid of CO₂, which are free from toxic organic solvents, environmentally friendly, and safe for food products. Supercritical fluid extraction using CO₂ (low critical points: temperature at 31.1 oC and pressure at 72.9 bars) could be done at a low temperature which are suitable for temperature labile compounds, low energy, and faster extraction time compared with conventional method of extraction.

Keywords: antioxidants, supercritical fluid extraction, solvent-free extraction, microalgae

Procedia PDF Downloads 45
2929 An Automatic Feature Extraction Technique for 2D Punch Shapes

Authors: Awais Ahmad Khan, Emad Abouel Nasr, H. M. A. Hussein, Abdulrahman Al-Ahmari

Abstract:

Sheet-metal parts have been widely applied in electronics, communication and mechanical industries in recent decades; but the advancement in sheet-metal part design and manufacturing is still behind in comparison with the increasing importance of sheet-metal parts in modern industry. This paper presents a methodology for automatic extraction of some common 2D internal sheet metal features. The features used in this study are taken from Unipunch ™ catalogue. The extraction process starts with the data extraction from STEP file using an object oriented approach and with the application of suitable algorithms and rules, all features contained in the catalogue are automatically extracted. Since the extracted features include geometry and engineering information, they will be effective for downstream application such as feature rebuilding and process planning.

Keywords: feature extraction, internal features, punch shapes, sheet metal

Procedia PDF Downloads 587