Web Content Mining: A Solution to Consumer's Product Hunt

Syed Salman Ahmed; Zahid Halim; Rauf Baig; Shariq Bashir

Commenced in January 2007

Frequency: Monthly

Edition: International

Paper Count: 33122

Web Content Mining: A Solution to Consumer's Product Hunt

Authors: Syed Salman Ahmed, Zahid Halim, Rauf Baig, Shariq Bashir

Abstract:

With the rapid growth in business size, today's businesses orient towards electronic technologies. Amazon.com and e-bay.com are some of the major stakeholders in this regard. Unfortunately the enormous size and hugely unstructured data on the web, even for a single commodity, has become a cause of ambiguity for consumers. Extracting valuable information from such an everincreasing data is an extremely tedious task and is fast becoming critical towards the success of businesses. Web content mining can play a major role in solving these issues. It involves using efficient algorithmic techniques to search and retrieve the desired information from a seemingly impossible to search unstructured data on the Internet. Application of web content mining can be very encouraging in the areas of Customer Relations Modeling, billing records, logistics investigations, product cataloguing and quality management. In this paper we present a review of some very interesting, efficient yet implementable techniques from the field of web content mining and study their impact in the area specific to business user needs focusing both on the customer as well as the producer. The techniques we would be reviewing include, mining by developing a knowledge-base repository of the domain, iterative refinement of user queries for personalized search, using a graphbased approach for the development of a web-crawler and filtering information for personalized search using website captions. These techniques have been analyzed and compared on the basis of their execution time and relevance of the result they produced against a particular search.

Keywords: Data mining, web mining, search engines, knowledge discovery.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1057755

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 2055

References:

[1] Laware, G. W., Metadata management: A requirement for web warehousing and knowledge management. (2005) Scime, A. Web mining: Applications and Techniques, PA: Idea Group.
[2] Navigli, R., Ontology Learning from a Domain Web Corpus. In: Scime, A. (Ed.), Web Mining: Applications and Techniques, Idea, London. pp. 69-98.
[3] Chen, Z. - Meng, X., MARS: Multiplicative Adaptive Refinement Web Search. In: Scime, A. (Ed.), Web Mining: Applications and Techniques, Idea, London. pp. 99-118.
[4] Wu, F. and Hsu, C., Using context information to build a topic specific crawling system. In: Scime, A. (Ed.), Web Mining: Applications and Techniques, Idea, London. pp. 50-68.
[5] Kotb, Y., Gondow, K., Katayama, T., XML Semantics. In: Scime, A. (Ed.), Web Mining: Applications and Techniques, Idea, London. pp. 169-188.
[6] en.wikipedia.org/wiki/Web_mining.html
[7] http://www.eg.bucknell.edu/~xmeng/mars/mars. html