AudioMine: Medical Data Mining in Heterogeneous Audiology Records

Shaun Cox; Michael Oakes; Stefan Wermter; Maurice Hawthorne

Commenced in January 2007

Frequency: Monthly

Edition: International

Paper Count: 33093

AudioMine: Medical Data Mining in Heterogeneous Audiology Records

Authors: Shaun Cox, Michael Oakes, Stefan Wermter, Maurice Hawthorne

Abstract:

We report on the results of a pilot study in which a data-mining tool was developed for mining audiology records. The records were heterogeneous in that they contained numeric, category and textual data. The tools developed are designed to observe associations between any field in the records and any other field. The techniques employed were the statistical chi-squared test, and the use of self-organizing maps, an unsupervised neural learning approach.

Keywords: Audiology, data mining, chi-squared, self-organizing maps

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1058579

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1670

References:

[1] K. Cios. Medical Data Mining and Knowledge Discovery.Berlin/Heidelberg: Springer Verlag, 2001.
[2] I. Kononenko, I. Bratko and M. Kukar. ''Application of Machine Learning to Medical Diagnosis''. In R. Michalski, I. Bratko and M. Kubat (editors), Machine Learning and DataMining, Chichester: John Wiley, 1998, pp389-408.
[3] B. Porter and E. Bareiss. ''PROTOS: An experiment in knowledge acquisition for heuristic classification tasks''. Proc. First International Meeting on Advances in Learning (IMAL) Les Arcs, France, 1986, pp. 159-174.
[4] G. Holmes and L.Trigg. A diagnostic tool for tree-based supervised classification learning algorithms. URL: www.cs.waikato.ac.nz/~ml/publications/1999/99GH-LT-Diagnostic-Tool.pdf
[5] T. Dietterich. ''An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting and randomization''. Machine Learning 40(2), 2000.
[6] B. Ingrao, ''Portable electronic patient records''. Hearing Review 8(6), 2001.
[7] M. Oakes, Statistics for Corpus Linguistics. Edinburgh: Edinburgh University Press, 1998.
[8] R. Zembowicz and J.Zytkov, ''From contingency tables to various forms of knowledge in databases.'', In U. Fayyad, G. Piatetsky-Shapiro, P. Smyth and R. Uthurusamy (editors), Advances in Knowledge Discovery and Data Mining, , Cambridge Massachusetts: AIII Press/MIT Press, 1996, pp. 329-349.
[9] P. Rayson, G.Leech and M. Hodges. ''Social differentiation in the use of English vocabulary.'' International Journal of Corpus Linguistics 2(1), 1997, pp. 133-152.
[10] M. Oakes, R Gaizauskas, H Fowkes, et al. ''Comparision between a method based on the chi-square test and a support vector machine for document classification'', Proceedings of ACM-SIGIR, New Orleans, 2001.
[11] S. Wermter and V. Weber. ''SCREEN: Learning a flat syntactic and semantic spoken language analysis using artificial neural networks''. Journal of Artificial Intelligence Research, 6(1):35--85, 1997.
[12] T. Kohonen. ''Self organisation of very large documents: State of the art.'' Proc. ICANN98, the 8th International Conference on Artificial Neural Networks, volume 1, 1998. London: Springer 1998. pp. 65-74.
[13] D.W. Patterson. Artificial Neural Networks: Theory and Applications.Singapore: Simon & Schuster/Prentice Hall, 1996.
[14] S. Lundell, 2001. http://www.ida.his.se/ida/kurser/ai-ann/kursmaterial/tutorial/node38.html.