Localization of Geospatial Events and Hoax Prediction in the UFO Database
Unidentified Flying Objects (UFOs) have been an interesting topic for most enthusiasts and hence people all over the United States report such findings online at the National UFO Report Center (NUFORC). Some of these reports are a hoax and among those that seem legitimate, our task is not to establish that these events confirm that they indeed are events related to flying objects from aliens in outer space. Rather, we intend to identify if the report was a hoax as was identified by the UFO database team with their existing curation criterion. However, the database provides a wealth of information that can be exploited to provide various analyses and insights such as social reporting, identifying real-time spatial events and much more. We perform analysis to localize these time-series geospatial events and correlate with known real-time events. This paper does not confirm any legitimacy of alien activity, but rather attempts to gather information from likely legitimate reports of UFOs by studying the online reports. These events happen in geospatial clusters and also are time-based. We look at cluster density and data visualization to search the space of various cluster realizations to decide best probable clusters that provide us information about the proximity of such activity. A random forest classifier is also presented that is used to identify true events and hoax events, using the best possible features available such as region, week, time-period and duration. Lastly, we show the performance of the scheme on various days and correlate with real-time events where one of the UFO reports strongly correlates to a missile test conducted in the United States.
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.2571831Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 502
 B. J. Goode , J. M. Reyes, D. R. Pardo-Yepez, G. L. Canale, R. M. Tong, D. Mares, M. Roan, N. Ramakrishnan. Time-Series Analysis of Blog and Metaphor Dynamics for Event Detection. Advances in Cross-Cultural Decision Making Volume 480 of the series Advances in Intelligent Systems and Computing pp 17-27.
 M. Moshtaghi, C. Leckie, and J. C. Bezdek. Online Clustering of Multivariate Time- series. Proceedings of the 2016 SIAM International Conference on Data Mining. 2016, 360-368.
 K. P. D. Artificial intelligence methods for theory representation and hypothesis formation. Computer Applications in the biosciences. 1991 Jul; 7(3): 301-8.
 N. Marin, D. Sanchez. On generating linguistic descriptions of time series. Fuzzy Sets and Systems Volume 285, 15 February 2016, pp 6-30.
 C. Rutkowski, G. Dittman. The Canadian UFO report: the best cases revealed.
 S. Rani, G. Sikka. Recent techniques of clustering of time series data: a survey. International Journal of Computer Applications (0975-8887) Vol. 52-No. 15, August 2012.