京都大学 情報学研究科 知能情報学専攻 2023年8月実施 専門科目 S-3
Author
祭音Myyura, itsuitsuki
Description
Suppose that samples are given in a 2-dimensional feature space as shown in Table 1.
The samples are denoted by
(1) Suppose that a classification of the samples in Table 1 (a) is given by
where
(2) Suppose that another classification of the samples in Table 1 (a) is given by
where
(3) For each class given in (1), give the formula for calculating Mahalanobis distance for
(4) Explain which class given in (1) is suitable for the sample in Table 1 (b). The assumption on the sample distribution required for the discussion must be clearly stated.
Kai
(1)
Let
Let the point
i.e.,
The classifier is
(2)
Let
Since a larger ratio of between-class variance (sum of squared class center-overall center distances) to within-class variance (sum of squared data-class center distances) is considered a better classification, we compare the ratio
Let
Similarly,
Since
(3)
For
where
and by the inversion
(4)
Assume that each sample follows a multivariate normal distribution. Based on the form of the probability density function of the multivariate normal distribution (assuming the samples of each class form a multivariate normal distribution respectively), it is better to classify the sample into the class with the smaller Mahalanobis distance calculated using the method in Q3.
Following the formula given in Q3, the Mahalanobis distances
Hence class