PCA和SVM是什么？该如何建立人脸识别模型？

2020-12-31 10:23

让我们打印出变量：print（＂Total dataset size：＂）
print（＂n＿samples：％d＂， n＿samples）
print（＂n＿features：％d＂， n＿features）
print（＂n＿classes：％d＂， n＿classes）

所以，我们有1288个样本（图片），每个样本总共有1850个特征（50px37px）和7个类（人）。划分训练集和测试集接下来，我们使用sklearn．model＿selection将数据（X－特征和y－标签）分为训练数据和测试数据，其中25％用于测试，其余75％用于训练模型。X＿train， X＿test， y＿train， y＿test ＝ train＿test＿split（X， y， test＿size＝0．25， random＿state＝42）
以下是变量X－train、X＿test、y＿train和y＿test：

基于PCA的降维方法现在，我们从 sklearn．decomposition中选择PCA 以训练模型。我们已经在第一段代码中导入了PCA在我们的例子中，我们在训练集X＿train中总共有966个特征，我们使用PCA（维数缩减）将它们减少到50个：n＿components ＝ 50
pca ＝ RandomizedPCA（n＿components＝n＿components， whiten＝True）．fit（X＿train）

这个过程需要不到一秒钟的时间，这可以通过使用时间函数进行验证（让我们暂时跳过它）。现在我们将重塑PCA组件并定义特征脸，这是在人脸识别的计算机视觉问题中使用的一组特征向量的名称：eigenfaces ＝ pca．components＿．reshape（（n＿components， h， w））

如截图所示，特征脸是一个50×50×37的Numpy数组，50对应于特征的数量。接下来，我们使用PCA在X＿train 和X＿test 上的transform 函数来降低维数。X＿train＿pca ＝ pca．transform（X＿train）
X＿test＿pca ＝ pca．transform（X＿test）

从上面的截图可以看出，通过PCA算法，X＿train和X＿test的维数都被降低了，每一个都将特征从1850个减少到50个（正如我们在算法中定义的那样）。训练SVM分类器一旦我们完成了降维，就开始分类了。首先，我们将训练SVM分类模型。我们使用GridSearchCV，这是一个库函数，它是一种调整超参数的方法，它将系统地为网格中指定的算法参数的每个组合建立和评估模型，并在最佳估计量，参数在参数网格中给出：print（＂Fitting the classifier to the training set＂）
param＿grid ＝｛
＇C＇：［1e3， 5e3， 1e4， 5e4， 1e5］，
＇gamma＇：［0．0001， 0．0005， 0．001， 0．005， 0．01， 0．1］，
｝
clf ＝ GridSearchCV（SVC（kernel＝＇rbf＇， class＿weight＝＇balanced＇）， param＿grid）
clf ＝ clf．fit（X＿train＿pca， y＿train）
print（＂Best estimator found by grid search：＂）
print（clf．best＿estimator＿）

我们数据的最佳分类器是SVC，参数如下：SVC（C＝1000， class＿weight ＝ ‘balanced’， gamma＝0．01）预测现在让我们在测试数据上预测这些人的名字，我们使用从GridSearchCV中找到的分类器，它已经在训练数据拟合。print（＂Predicting the people names on the testing set＂）
y＿pred ＝ clf．predict（X＿test＿pca）

分类报告和混淆矩阵一旦预测完成，让我们打印分类报告，它显示了模型的精度、召回率、F1分数和支持分数，这使我们对分类器的行为有了更深入的直觉。print（classification＿report（y＿test， y＿pred， target＿names＝target＿names））

让我们打印混淆矩阵：print（confusion＿matrix（y＿test， y＿pred， labels＝range（n＿classes）））

混淆矩阵打印真正例、假正例和假反例的值，并提供分类器的概述。绘图最后，我们将绘制人物肖像和特征脸！我们将定义两个函数：title在测试集的一部分绘制预测结果，plot＿gallery通过绘制它们来评估预测：def title（y＿pred， y＿test， target＿names， i）：
pred＿name ＝ target＿names［y＿pred［i］］．rsplit（＇＇， 1）［－1］
true＿name ＝ target＿names［y＿test［i］］．rsplit（＇＇， 1）［－1］
return ＇predicted：％strue：％s＇％（pred＿name， true＿name）

def plot＿gallery（images， titles， h， w， n＿row＝3， n＿col＝4）：
＂＂＂绘制肖像库的帮助函数＂＂＂
plt．figure（figsize＝（1．8 ＊ n＿col， 2．4 ＊ n＿row））
plt．subplots＿adjust（bottom＝0， left＝．01， right＝．99， top＝．90， hspace＝．35）
for i in range（n＿row ＊ n＿col）：
plt．subplot（n＿row， n＿col， i ＋ 1）
plt．imshow（images［i］．reshape（（h， w））， cmap＝plt．cm．gray）
plt．title（titles［i］， size＝12）
plt．xticks（（））
plt．yticks（（））

现在让我们在测试集的一部分绘制预测结果：prediction＿titles ＝［title（y＿pred， y＿test， target＿names， i）
for i in range（y＿pred．shape［0］）］

plot＿gallery（X＿test， prediction＿titles， h， w）

现在让我们绘制特征面。我们使用在上面代码块中定义的eigenfaces变量。eigenface＿titles ＝［＂eigenface ％d＂％ i for i in range（eigenfaces．shape［0］）］
plot＿gallery（eigenfaces， eigenface＿titles， h， w）
plt．show（）

最后，我们来绘制PCA＋SVM模型用于人脸识别的精度：from sklearn．metrics import accuracy＿score
score ＝ accuracy＿score（y＿test， y＿pred）
print（score）

我们的准确分数是0．81！虽然这并不是一个完美的分数，还有很大的改进空间，但PCA和SVM的人脸识别为我们提供了进一步强大算法的起点！结论本文利用PCA和SVM建立了一个人脸识别模型。主成分分析算法被用来减少数据的维数，然后利用支持向量机进行分类，通过超参数调整寻找最佳估计量。我们对这些肖像进行了分类，准确度得分为0．81。

<上一页 1 2 3