Lecture By Dr.Yiming Liu of Nanyang Technological University
time: 2019-09-29

Speaker: Dr.Yiming Liu(Nanyang Technological University)

TitleHigh dimensional clustering:A two-step method for mixture data

Time: Mon, Sept.30 2019,PM:15:00-16:00

Location: Room 4318, Building No.4, Wushan Campus


Abstract:

     Clustering is an important subject in unsupervised learning.  It is a common technique used in many fields, including machine learning, statistics, bioinformatics, and computer graphics. To classify different samples into a homogeneous group, it is based on different criterions. In this talk, we focus on the clusters that are characterized by the different parameters of means and covariances, and we study the clustering method for the high dimensional mixtures.  According to this setting, we propose a new method, Two-step method, to conduct clustering. Two-step method is investigated from two aspects, i.e., covariances and means, and based on the random matrix theory. Both theoretical and numerical properties of the Two-step method are discussed. Specifically, we propose two separate algorithms and one universal algorithm that are applicable to do the clustering in different settings. In addition, we prove that the misclustering error for all these three algorithms converges to zero with probability tends to one under mild conditions. Simulation studies also demonstrate that the Two-step method outperforms other methods under variety of settings.