**Commenced**in January 2007

**Frequency:**Monthly

**Edition:**International

**Paper Count:**30526

##### Clustering Mixed Data Using Non-normal Regression Tree for Process Monitoring

**Authors:**
Youngji Yoo,
Jun Seok Kim,
Cheong-Sool Park,
Jun-Geol Baek,
Sung-Shick Kim,
Young-Hak Lee

**Abstract:**

**Keywords:**
Clustering,
Semiconductor,
regression tree,
non-normal mixed process data,
Statistical Quality Control (SQC),
Pearson
distribution system

**Digital Object Identifier (DOI):**
doi.org/10.5281/zenodo.1329428

**References:**

[1] H. Yoon, J.-G. Baek, C.-S. Park, and Y.-H. Lee, "A constrained clustering method for mixed process data using recursive partitioning and regression trees," in Proceedings of the IIE Asian Conference, 2012.

[2] S. Bagchi, R. J. Baseman, A. Davenport, R. Natarajan, N. Slonim, and S. Weiss, "Data analytics and stochastic modeling in a semiconductor fab," Applied Stochastic Models in Business and Industry, vol. 26, no. 1, pp. 1-27, 2010.

[3] C. F. Chien, W. C. Wang, and J. C. Cheng, "Data mining for yield enhancement in semiconductor manufacturing and an empirical study," Expert Systems with Applications, vol. 33, no. 1, pp. 192-198, 2007.

[4] D. J. Hand, "Principles of data mining," Drug Safety, vol. 30, no. 7, pp. 621-622, 2007.

[5] J. Quinlan, "Induction of decision trees," Machine Learning, vol. 1, no. 1, p. 81, 1986.

[6] J. Quinlan, C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, 1993.

[7] L. Breiman, Classification and regression trees. Wadsworth International Group, 1984.

[8] G. V. Kass, "An exploratory technique for investigating large quantities of categorical data," Journal of the Royal Statistical Society. Series C (Applied Statistics), vol. 29, no. 2, pp. 119-127, 1980.

[9] J. N. Morgan, J. A. Sonquist, J. N. Morgan, and J. A. Sonquist, "Problems in the analysis of survey data, and a proposal," Journal of the American Statistical Association, vol. 58, no. 302, p. 415, 1963.

[10] R. E. Walpole and R. H. Myers, Probability and Statistics for Engineers and Scientists, 5th ed. Macmillan Coll Div, 1993.

[11] H. B. Mann and D. R. Whitney, "On a test of whether one of two random variables is stochastically larger than the other," Annals of Mathematical Statistics, vol. 18, no. 1, p. 50, 1947.

[12] F. Wilcoxon, "Individual comparisons by ranking methods," Biometrics Bulletin, vol. 1, no. 6, p. 80, 1945.

[13] G. Casella and R. Berger, Statistical Inference. Duxbury Press, 2001.

[14] N. Henze, "A probabilistic representation of the skew-normal distribution," Scandinavian Journal of Statistics, vol. 13, no. 4, pp. 271-275, 1986.

[15] I. W. Burr, "Cumulative frequency functions," Annals of Mathematical Statistics, vol. 13, pp. 215-232, 1942.

[16] K. Pearson, "Contributions to the mathematical theory of evolution," Philosophical Transactions of the Royal Society of London. A, vol. 185, pp. 71-110, 1894.

[17] K. Pearson, "Contributions to the mathematical theory of evolution. ii. skew variation in homogeneous material," Philosophical Transactions of the Royal Society of London. A, vol. 186, pp. 343-414, 1895.

[18] K. Pearson, "Mathematical contributions to the theory of evolution. x. supplement to a memoir on skew variation." Philosophical Transactions of the Royal Society of London Series a-Containing Papers of a Mathematical or Physical Character, vol. 197, pp. 443-459, 1901.

[19] K. Pearson, "Mathematical contributions to the theory of evolution. - xix. second supplement to a memoir on skew variation." Philosophical Transactions of the Royal Society of London Series a-Containing Papers of a Mathematical or Physical Character, vol. 216, pp. 429-457, 1916.

[20] Y. Nagahara, "Non-gaussian filter and smoother based on the pearson distribution system," Journal of Time Series Analysis, vol. 24, no. 6, pp. 721-738, 2003.