Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 31106
Granularity Analysis for Spatio-Temporal Web Sensors

Authors: Shun Hattori


In recent years, many researches to mine the exploding Web world, especially User Generated Content (UGC) such as weblogs, for knowledge about various phenomena and events in the physical world have been done actively, and also Web services with the Web-mined knowledge have begun to be developed for the public. However, there are few detailed investigations on how accurately Web-mined data reflect physical-world data. It must be problematic to idolatrously utilize the Web-mined data in public Web services without ensuring their accuracy sufficiently. Therefore, this paper introduces the simplest Web Sensor and spatiotemporallynormalized Web Sensor to extract spatiotemporal data about a target phenomenon from weblogs searched by keyword(s) representing the target phenomenon, and tries to validate the potential and reliability of the Web-sensed spatiotemporal data by four kinds of granularity analyses of coefficient correlation with temperature, rainfall, snowfall, and earthquake statistics per day by region of Japan Meteorological Agency as physical-world data: spatial granularity (region-s population density), temporal granularity (time period, e.g., per day vs. per week), representation granularity (e.g., “rain" vs. “heavy rain"), and media granularity (weblogs vs. microblogs such as Tweets).

Keywords: Web mining, knowledge extraction, Granularity analysis, spatiotemporal data mining, Web credibility, Web sensor

Digital Object Identifier (DOI):

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1562


[1] K. Dave, S. Lawrence, and D. M. Pennock, "Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews," in Proc. 12th International World Wide Web Conference (WWW-03), Hungary, pp. 519-528, 2003.
[2] S. Fujimura, M. Toyoda, and M. Kitsuregawa, "A Reputation Extraction Method Considering Structure of Sentence," in Proc. 16th IEICE Data Engineering Workshop (DEWS-05), Japan, 6C-i8, 2005.
[3] T. Tezuka, T. Kurashima, and K. Tanaka, "Toward Tighter Integration of Web Search with a Geographic Information System," in Proc. 15th Int-l World Wide Web Conference (WWW-06), Scotland, pp. 277-286, 2006.
[4] K. Inui, S. Abe, H. Morita, M. Eguchi, A. Sumida, C. Sao, K. Hara, K. Murakami, and S. Matsuyoshi, "Experience Mining: Building a Large-Scale Database of Personal Experiences and Opinions from Web Documents," in Proc. 7th IEEE/WIC/ACM International Conference on Web Intelligence (WI-08), Australia, pp. 314-321, 2008.
[5] M. A. Hearst, "Automatic Acquisition of Hyponyms from Large Text Corpora," in Proc. 14th International Conference on Computational Linguistics (COLING-92), France, vol. 2, pp. 539-545, 1992.
[6] M. Ruiz-Casado, E. Alfonseca, and P. Castells, "Automatising the Learning of Lexical Patterns: An Application to the Enrichment of WordNet by Extracting Semantic Relationships from Wikipedia," Data & Knowledge Engineering, vol. 61, no. 3, pp. 484-499, June 2007.
[7] S. Hattori, H. Ohshima, S. Oyama, and K. Tanaka, "Mining the Web for Hyponymy Relations based on Property Inheritance," in Proc. 10th Asia-Pacific Web Conf. (APWeb-08), LNCS vol. 4976, pp. 99-110, 2008.
[8] S. Hattori and K. Tanaka, "Extracting Concept Hierarchy Knowledge from the Web based on Property Inheritance and Aggregation," in Proc. 7th IEEE/WIC/ACM International Conference on Web Intelligence (WI-08), Australia, pp. 432-437, 2008.
[9] S. Hattori, "Object-oriented Semantic and Sensory Knowledge Extraction from the Web," in Web Intelligence and Intelligent Agents, In-Tech, ch. 18, pp. 365-390, 2010.
[10] S. Hattori, "Hyponym Extraction from the Web based on Property Inheritance of Text and Image Features," in Proc. 6th International Conference on Advances in Semantic Processing (SEMAPRO-12), Spain, pp. 109-114, 2012.
[11] T. Tezuka and K. Tanaka, "Visual Description Conversion for Enhancing Search Engines and Navigational Systems," in Proc. 8th Asia-Pacific Web Conference (APWeb-06), China, LNCS vol. 3841, pp. 955-960, 2006.
[12] S. Hattori, T. Tezuka, and K. Tanaka, "Mining the Web for Appearance Description," in Proc. 18th International Conference on Database and Expert Systems Applications (DEXA-07), Germany, LNCS vol. 4653, pp. 790-800, 2007.
[13] S. Hattori, "Peculiar Image Retrieval by Cross-Language Web-extracted Appearance Descriptions," Int-l Journal of Computer Information Systems and Industrial Management, MIR Labs, vol. 4, pp. 486-495, Dec. 2011.
[14] S. Hattori, "Hyponymy-Based Peculiar Image Retrieval," International Journal of Computer Information Systems and Industrial Management (IJCISIM), MIR Labs, vol. 5, pp. 79-88, June 2012.
[15] S. Hattori and K. Tanaka, "Mining the Web for Access Decision-Making in Secure Spaces," in Proc. Joint 4th Int-l Conference on Soft Computing and Intelligent Systems and 9th International Symposium on advanced Intelligent Systems (SCIS&ISIS-08), Japan, TH-G3-4, pp. 370-375, 2008.
[16] S. Hattori, "Secure Spaces and Spatio-Temporal Weblog Sensors with Temporal Shift and Propagation," in Proc. 1st IRAST International Conference on Data Engineering and Internet Technology (DEIT-11), Indonesia, LNEE vol. 157, pp. 343-349, 2011.
[17] S. Hattori, "Linearly-Combined Web Sensors for Spatio-Temporal Data Extraction from the Web," in Proc. 6th Int-l Workshop on Spatial and Spatiotemporal Data Mining (SSTDM-11), Canada, pp. 897-904, 2011.
[18] S. Hattori, "Spatio-Temporal Web Sensors by Social Network Analysis," in Proc. 3rd International Workshop on Business Applications of Social Network Analysis (BASNA-12), Turkey, pp. 1020-1027, 2012.
[19] Japan Meteorological Agency,
[20] S. Hattori and K. Tanaka, "Towards Building Secure Smart Spaces for Information Security in the Physical World," Journal of Advanced Computational Intelligence and Intelligent Informatics (JACIII), Fuji Technology Press, vol. 11, no. 8, pp. 1023-1029, September 2007.
[21] S. Hattori and K. Tanaka, "Secure Spaces: Protecting Freedom of Information Access in Public Places," in Proc. 5th International Conference on Smart Homes and Health Telematics (ICOST-07), Japan, LNCS vol. 4541, pp. 99-109, 2007.
[22] S. Hattori, "Context-Aware Query Control for Secure Spaces," Journal of Computer Technology and Application (JCTA), David Publishing, vol. 3, no. 2, pp. 130-139, February 2012.
[23] S. Hattori, "Ability-Based Expression Control for Secure Spaces," Proc. Joint 6th International Conference on Soft Computing and Intelligent Systems and 13th International Symposium on advanced Intelligent Systems (SCIS&ISIS-12), Japan, F1-54-3, pp. 1298-1303, 2012.
[24] Google Web Search,