偷偷摘套内射激情视频,久久精品99国产国产精,中文字幕无线乱码人妻,中文在线中文a,性爽19p

<nobr id="gfbpe"></nobr>

AI.x社區(qū)

軟考社區(qū)

企業(yè)培訓(xùn)

鴻蒙開發(fā)者社區(qū)

信創(chuàng)認(rèn)證

公眾號矩陣

移動端

視頻課免費課排行榜短視頻直播課軟考學(xué)堂

全部課程軟考信創(chuàng)認(rèn)證華為認(rèn)證廠商認(rèn)證 IT技術(shù)PMP項目管理免費題庫

在線學(xué)習(xí)

文章資源問答課堂專欄直播

51CTO

鴻蒙開發(fā)者社區(qū)

51CTO技術(shù)棧

51CTO官微

51CTO學(xué)堂

51CTO博客

CTO訓(xùn)練營

鴻蒙開發(fā)者社區(qū)訂閱號

51CTO軟考

51CTO學(xué)堂APP

51CTO學(xué)堂企業(yè)版APP

鴻蒙開發(fā)者社區(qū)視頻號

51CTO軟考題庫

賬號設(shè)置退出

實例 | 使用CNN和Python實施的肺炎檢測

作者：IT老周 2020-10-12 09:22:30

人工智能后端

嘿!幾個小時前我剛剛完成一個深度學(xué)習(xí)項目，現(xiàn)在我想分享一下我所做的事情。這一挑戰(zhàn)的目標(biāo)是確定一個人是否患有肺炎。如果是，則確定是否由細(xì)菌或病毒引起。好吧，我覺得這個項目應(yīng)該叫做分類而不是檢測。

介紹

嘿!幾個小時前我剛剛完成一個深度學(xué)習(xí)項目，現(xiàn)在我想分享一下我所做的事情。這一挑戰(zhàn)的目標(biāo)是確定一個人是否患有肺炎。如果是，則確定是否由細(xì)菌或病毒引起。好吧，我覺得這個項目應(yīng)該叫做分類而不是檢測。

使用CNN和Python實施的肺炎檢測

換句話說，此任務(wù)將是一個多分類問題，其中標(biāo)簽名稱為：normal(正常)，virus(病毒)和bacteria(細(xì)菌)。為了解決這個問題，我將使用CNN(卷積神經(jīng)網(wǎng)絡(luò))，它具有出色的圖像分類能力，。不僅如此，在這里我還實現(xiàn)了圖像增強技術(shù)，以提高模型性能。順便說一句，我獲得了80%的測試數(shù)據(jù)準(zhǔn)確性，這對我來說是非常令人印象深刻的。

整個數(shù)據(jù)集本身的大小約為1 GB，因此下載可能需要一段時間?；蛘?，我們也可以直接創(chuàng)建一個Kaggle Notebook并在那里編碼整個項目，因此我們甚至不需要下載任何內(nèi)容。接下來，如果瀏覽數(shù)據(jù)集文件夾，你將看到有3個子文件夾，即train，test和val。

好吧，我認(rèn)為這些文件夾名稱是不言自明的。此外，train文件夾中的數(shù)據(jù)分別包括正常，病毒和細(xì)菌類別的1341、1345和2530個樣本。我想這就是我介紹的全部內(nèi)容了，現(xiàn)在讓我們進入代碼的編寫!

注意：我在本文結(jié)尾處放置了該項目中使用的全部代碼。

加載模塊和訓(xùn)練圖像

使用計算機視覺項目時，要做的第一件事是加載所有必需的模塊和圖像數(shù)據(jù)本身。我使用tqdm模塊顯示進度條，稍后你將看到它有用的原因。

我最后導(dǎo)入的是來自Keras模塊的ImageDataGenerator。該模塊將幫助我們在訓(xùn)練過程中實施圖像增強技術(shù)。

import os 
import cv2import pickleimport numpy as np 
import matplotlib.pyplot as plt 
import seaborn as sns 
from tqdm import tqdm 
from sklearn.preprocessing import OneHotEncoder 
from sklearn.metrics import confusion_matrix 
from keras.models import Model, load_model 
from keras.layers import Dense, Input, Conv2D, MaxPool2D, Flatten 
from keras.preprocessing.image import ImageDataGeneratornp.random.seed(22)

接下來，我定義兩個函數(shù)以從每個文件夾加載圖像數(shù)據(jù)。乍一看，下面的兩個功能可能看起來完全一樣，但是在使用粗體顯示的行上實際上存在一些差異。這樣做是因為NORMAL和PNEUMONIA文件夾中的文件名結(jié)構(gòu)略有不同。盡管有所不同，但兩個功能執(zhí)行的其他過程基本相同。

首先，將所有圖像調(diào)整為200 x 200像素。

這一點很重要，因為所有文件夾中的圖像都有不同的尺寸，而神經(jīng)網(wǎng)絡(luò)只能接受具有固定數(shù)組大小的數(shù)據(jù)。

接下來，基本上所有圖像都存儲有3個顏色通道，這對X射線圖像來說是多余的。因此，我的想法是將這些彩色圖像都轉(zhuǎn)換為灰度圖像。

# Do not forget to include the last slash 
def load_normal(norm_path):    norm_files = np.array(os.listdir(norm_path))    norm_labels = np.array(['normal']*len(norm_files)) 
    norm_images = []    for image in tqdm(norm_files): 
        image = cv2.imread(norm_path + image)        image = cv2.resize(image, dsize=(200,200)) 
        image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)        norm_images.append(image) 
    norm_images = np.array(norm_images)    return norm_images, norm_labels 
def load_pneumonia(pneu_path):    pneu_files = np.array(os.listdir(pneu_path))    pneu_labels = np.array([pneu_file.split('_')[1] for pneu_file in pneu_files]) 
    pneu_images = []    for image in tqdm(pneu_files): 
        image = cv2.imread(pneu_path + image)        image = cv2.resize(image, dsize=(200,200)) 
        image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)        pneu_images.append(image) 
    pneu_images = np.array(pneu_images)    return pneu_images, pneu_labels

聲明了以上兩個函數(shù)后，現(xiàn)在我們可以使用它來加載訓(xùn)練數(shù)據(jù)了。如果你運行下面的代碼，你還將看到為什么我選擇在該項目中實現(xiàn)tqdm模塊。

norm_images, norm_labels = load_normal('/kaggle/input/chest-xray-pneumonia/chest_xray/train/NORMAL/')pneu_images, pneu_labels = load_pneumonia('/kaggle/input/chest-xray-pneumonia/chest_xray/train/PNEUMONIA/')

到目前為止，我們已經(jīng)獲得了幾個數(shù)組：norm_images，norm_labels，pneu_images和pneu_labels。

帶_images后綴的表示它包含預(yù)處理的圖像，而帶_labels后綴的數(shù)組表示它存儲了所有基本信息(也稱為標(biāo)簽)。換句話說，norm_images和pneu_images都將成為我們的X數(shù)據(jù)，其余的將成為y數(shù)據(jù)。

為了使項目看起來更簡單，我將這些數(shù)組的值連接起來并存儲在X_train和y_train數(shù)組中。

X_train = np.append(norm_images, pneu_images, axis=0) 
y_train = np.append(norm_labels, pneu_labels)

使用CNN和Python實施的肺炎檢測

順便說一句，我使用以下代碼獲取每個類的圖像數(shù)：

使用CNN和Python實施的肺炎檢測

顯示多張圖像

好吧，在這個階段，顯示幾個圖像并不是強制性的。但我想做是為了確保圖片是否已經(jīng)加載和預(yù)處理好。下面的代碼用于顯示14張從X_train陣列隨機拍攝的圖像以及標(biāo)簽。

fig, axes = plt.subplots(ncols=7, nrows=2, figsize=(16, 4)) 
indices = np.random.choice(len(X_train), 14) 
counter = 0 
for i in range(2): 
    for j in range(7): 
        axes[i,j].set_title(y_train[indices[counter]])        axes[i,j].imshow(X_train[indices[counter]], cmap='gray') 
        axes[i,j].get_xaxis().set_visible(False)        axes[i,j].get_yaxis().set_visible(False)        counter += 1 
plt.show()

使用CNN和Python實施的肺炎檢測

我們可以看到上圖，所有圖像現(xiàn)在都具有完全相同的大小，這與我用于本帖子封面圖片的圖像不同。

加載測試圖像

我們已經(jīng)知道所有訓(xùn)練數(shù)據(jù)都已成功加載，現(xiàn)在我們可以使用完全相同的函數(shù)加載測試數(shù)據(jù)。步驟幾乎相同，但是這里我將那些加載的數(shù)據(jù)存儲在X_test和y_test數(shù)組中。用于測試的數(shù)據(jù)本身包含624個樣本。

norm_images_test, norm_labels_test = load_normal('/kaggle/input/chest-xray-pneumonia/chest_xray/test/NORMAL/')pneu_images_test, pneu_labels_test = load_pneumonia('/kaggle/input/chest-xray-pneumonia/chest_xray/test/PNEUMONIA/')X_test = np.append(norm_images_test, pneu_images_test, axis=0) 
y_test = np.append(norm_labels_test, pneu_labels_test)

此外，我注意到僅加載整個數(shù)據(jù)集就需要很長時間。因此，我將使用pickle模塊將X_train，X_test，y_train和y_test保存在單獨的文件中。這樣我下次想再使用這些數(shù)據(jù)的時候，就不需要再次運行這些代碼了。

# Use this to save variables 
with open('pneumonia_data.pickle', 'wb') as f: 
    pickle.dump((X_train, X_test, y_train, y_test), f)# Use this to load variables 
with open('pneumonia_data.pickle', 'rb') as f: 
    (X_train, X_test, y_train, y_test) = pickle.load(f)

由于所有X數(shù)據(jù)都經(jīng)過了很好的預(yù)處理，因此現(xiàn)在使用標(biāo)簽y_train和y_test了。

標(biāo)簽預(yù)處理

此時，兩個y變量都由以字符串?dāng)?shù)據(jù)類型編寫的正常，細(xì)菌或病毒組成。實際上，這樣的標(biāo)簽只是神經(jīng)網(wǎng)絡(luò)所不能接受的。因此，我們需要將其轉(zhuǎn)換為單一格式。

幸運的是，我們從Scikit-Learn模塊獲取了 OneHotEncoder對象，它對完成轉(zhuǎn)換非常有幫助。為此，我們需要先在y_train和y_test上創(chuàng)建一個新軸。(我們創(chuàng)建了這個新軸，因為那是OneHotEncoder期望的形狀)。

y_train = y_train[:, np.newaxis] 
y_test = y_test[:, np.newaxis]

接下來，像這樣初始化one_hot_encoder。請注意，在這里我將False作為稀疏參數(shù)傳遞，以便簡化下一步。但是，如果你想使用稀疏矩陣，則只需使用sparse = True或?qū)?shù)保留為空即可。

one_hot_encoder = OneHotEncoder(sparse=False)

最后，我們將使用one_hot_encoder將這些y數(shù)據(jù)轉(zhuǎn)換為one-hot。然后將編碼后的標(biāo)簽存儲在y_train_one_hot和y_test_one_hot中。這兩個數(shù)組是我們將用于訓(xùn)練的標(biāo)簽。

y_train_one_hot = one_hot_encoder.fit_transform(y_train) 
y_test_one_hot = one_hot_encoder.transform(y_test)

將數(shù)據(jù)X重塑為(None，200，200，1)

現(xiàn)在讓我們回到X_train和X_test。重要的是要知道這兩個數(shù)組的形狀分別為(5216、200、200)和(624、200、200)。

乍一看，這兩個形狀看起來還可以，因為我們可以使用plt.imshow()函數(shù)進行顯示。但是，這種形狀卷積層不可接受，因為它希望將一個顏色通道作為其輸入。

因此，由于該圖像本質(zhì)上是灰度圖像，因此我們需要添加一個1維的新軸，該軸將被卷積層識別為唯一的顏色通道。雖然它的實現(xiàn)并不像我的解釋那么復(fù)雜：

X_train = X_train.reshape(X_train.shape[0], X_train.shape[1], X_train.shape[2], 1) 
X_test = X_test.reshape(X_test.shape[0], X_test.shape[1], X_test.shape[2], 1)

運行上述代碼后，如果我們同時檢查X_train和X_test的形狀，那么我們將看到現(xiàn)在的形狀分別是(5216，200，200，1)和(624，200，200，1)。

數(shù)據(jù)擴充

增加數(shù)據(jù)(或者更具體地說是增加訓(xùn)練數(shù)據(jù))的要點是，我們將通過創(chuàng)建更多的樣本(每個樣本都具有某種隨機性)來增加用于訓(xùn)練的數(shù)據(jù)數(shù)量。這些隨機性可能包括平移、旋轉(zhuǎn)、縮放、剪切和翻轉(zhuǎn)。

這種技術(shù)可以幫助我們的神經(jīng)網(wǎng)絡(luò)分類器減少過擬合，或者說，它可以使模型更好地泛化數(shù)據(jù)樣本。幸運的是，由于存在可以從Keras模塊導(dǎo)入的ImageDataGenerator對象，實現(xiàn)非常簡單。

datagen = ImageDataGenerator( 
        rotation_range = 10,   
        zoom_range = 0.1,  
        width_shift_range = 0.1,  
        height_shift_range = 0.1)

因此，我在上面的代碼中所做的基本上是設(shè)置隨機范圍。

接下來，在初始化datagen對象之后，我們需要做的是使它和我們的X_train相匹配。然后，該過程被隨后施加的flow()的方法，該步驟中是非常有用的，使得所述 train_gen對象現(xiàn)在能夠產(chǎn)生增強數(shù)據(jù)的批次。

datagen.fit(X_train)train_gen = datagen.flow(X_train, y_train_one_hot, batch_size=32)

CNN(卷積神經(jīng)網(wǎng)絡(luò))

現(xiàn)在是時候真正構(gòu)建神經(jīng)網(wǎng)絡(luò)架構(gòu)了。讓我們從輸入層(input1)開始。因此，這一層基本上會獲取X數(shù)據(jù)中的所有圖像樣本。因此，我們需要確保第一層接受與圖像尺寸完全相同的形狀。值得注意的是，我們僅需要定義(寬度，高度，通道)，而不是(樣本，寬度，高度，通道)。

此后，此輸入層連接到幾對卷積池層對，然后最終連接到全連接層。請注意，由于ReLU的計算速度比S型更快，因此模型中的所有隱藏層都使用ReLU激活函數(shù)，因此所需的訓(xùn)練時間更短。最后，要連接的最后一層是output1，它由3個具有softmax激活函數(shù)的神經(jīng)元組成。

這里使用softmax是因為我們希望輸出是每個類別的概率值。

input1 = Input(shape=(X_train.shape[1], X_train.shape[2], 1)) 
cnn = Conv2D(16, (3, 3), activation='relu', strides=(1, 1),  
    padding='same')(input1) 
cnn = Conv2D(32, (3, 3), activation='relu', strides=(1, 1),  
    padding='same')(cnn) 
cnn = MaxPool2D((2, 2))(cnn) 
cnn = Conv2D(16, (2, 2), activation='relu', strides=(1, 1),  
    padding='same')(cnn) 
cnn = Conv2D(32, (2, 2), activation='relu', strides=(1, 1),  
    padding='same')(cnn) 
cnn = MaxPool2D((2, 2))(cnn) 
cnn = Flatten()(cnn)cnn = Dense(100, activation='relu')(cnn) 
cnn = Dense(50, activation='relu')(cnn) 
output1 = Dense(3, activation='softmax')(cnn) 
model = Model(inputs=input1, outputs=output1)

在使用上面的代碼構(gòu)造了神經(jīng)網(wǎng)絡(luò)之后，我們可以通過對model對象應(yīng)用summary()來顯示模型的摘要。下面是我們的CNN模型的詳細(xì)情況。我們可以看到我們總共有800萬個參數(shù)——這確實很多。好吧，這就是為什么我在Kaggle Notebook上運行這個代碼。

使用CNN和Python實施的肺炎檢測

總之，在構(gòu)建模型之后，我們需要使用分類交叉熵?fù)p失函數(shù)和Adam優(yōu)化器來編譯神經(jīng)網(wǎng)絡(luò)。使用這個損失函數(shù)，因為它只是多類分類任務(wù)中常用的函數(shù)。同時，我選擇Adam作為優(yōu)化器，因為它是在大多數(shù)神經(jīng)網(wǎng)絡(luò)任務(wù)中最小化損失的最佳選擇。

model.compile(loss='categorical_crossentropy',  
              optimizer='adam', metrics=['acc'])

現(xiàn)在是時候訓(xùn)練模型了!在這里，我們將使用fit_generator()而不是fit()，因為我們將從train_gen對象獲取訓(xùn)練數(shù)據(jù)。如果你關(guān)注數(shù)據(jù)擴充部分，你會注意到train_gen是使用X_train和y_train_one_hot創(chuàng)建的。因此，我們不需要在fit_generator()方法中顯式定義X-y對。

history = model.fit_generator(train_gen, epochs=30,  
          validation_data=(X_test, y_test_one_hot))

train_gen的特殊之處在于，訓(xùn)練過程中將使用具有一定隨機性的樣本來完成。因此，我們在X_train中擁有的所有訓(xùn)練數(shù)據(jù)都不會直接輸入到神經(jīng)網(wǎng)絡(luò)中。取而代之的是，這些樣本將被用作生成器的基礎(chǔ)，通過一些隨機變換生成一個新圖像。

此外，該生成器在每個時期產(chǎn)生不同的圖像，這對于我們的神經(jīng)網(wǎng)絡(luò)分類器更好地泛化測試集中的樣本非常有利。下面是訓(xùn)練的過程。

Epoch 1/30 
163/163 [==============================] - 19s 114ms/step - loss: 5.7014 - acc: 0.6133 - val_loss: 0.7971 - val_acc: 0.7228 
. 
. 
. 
Epoch 10/30 
163/163 [==============================] - 18s 111ms/step - loss: 0.5575 - acc: 0.7650 - val_loss: 0.8788 - val_acc: 0.7308 
. 
. 
. 
Epoch 20/30 
163/163 [==============================] - 17s 102ms/step - loss: 0.5267 - acc: 0.7784 - val_loss: 0.6668 - val_acc: 0.7917 
. 
. 
. 
Epoch 30/30 
163/163 [==============================] - 17s 104ms/step - loss: 0.4915 - acc: 0.7922 - val_loss: 0.7079 - val_acc: 0.8045

整個訓(xùn)練本身在我的Kaggle Notebook上花費了大約10分鐘。所以要耐心點!經(jīng)過訓(xùn)練后，我們可以繪制出準(zhǔn)確度得分的提高和損失值的降低，如下所示：

plt.figure(figsize=(8,6)) 
plt.title('Accuracy scores') 
plt.plot(history.history['acc']) 
plt.plot(history.history['val_acc']) 
plt.legend(['acc', 'val_acc']) 
plt.show()plt.figure(figsize=(8,6)) 
plt.title('Loss value') 
plt.plot(history.history['loss']) 
plt.plot(history.history['val_loss']) 
plt.legend(['loss', 'val_loss']) 
plt.show()

使用CNN和Python實施的肺炎檢測

使用CNN和Python實施的肺炎檢測

根據(jù)上面的兩個圖，我們可以說，即使在這30個時期內(nèi)測試準(zhǔn)確性和損失值都在波動，模型的性能仍在不斷提高。

這里要注意的另一重要事情是，由于我們在項目的早期應(yīng)用了數(shù)據(jù)增強方法，因此該模型不會遭受過擬合的困擾。我們在這里可以看到，在最終迭代中，訓(xùn)練和測試數(shù)據(jù)的準(zhǔn)確性分別為79%和80%。

有趣的事實：在實施數(shù)據(jù)增強方法之前，我在訓(xùn)練數(shù)據(jù)上獲得了100%的準(zhǔn)確性，在測試數(shù)據(jù)上獲得了64%的準(zhǔn)確性，這顯然是過擬合了。因此，我們可以在此處清楚地看到，增加訓(xùn)練數(shù)據(jù)對于提高測試準(zhǔn)確性得分非常有效，同時也可以減少過擬合。

模型評估

現(xiàn)在，讓我們深入了解使用混淆矩陣得出的測試數(shù)據(jù)的準(zhǔn)確性。首先，我們需要預(yù)測所有X_test并將結(jié)果從獨熱格式轉(zhuǎn)換回其實際的分類標(biāo)簽。

predictions = model.predict(X_test) 
predictions = one_hot_encoder.inverse_transform(predictions)

接下來，我們可以像這樣使用confusion_matrix()函數(shù)：

cm = confusion_matrix(y_test, predictions)

重要的是要注意函數(shù)中使用的參數(shù)是(實際值，預(yù)測值)。該混淆矩陣函數(shù)的返回值是一個二維數(shù)組，用于存儲預(yù)測分布。為了使矩陣更易于解釋，我們可以使用Seaborn模塊中的heatmap()函數(shù)進行顯示。順便說一句，這里的類名列表的值是根據(jù)one_hotencoder.categories返回的順序獲取的。

classnames = ['bacteria', 'normal', 'virus']plt.figure(figsize=(8,8)) 
plt.title('Confusion matrix') 
sns.heatmap(cm, cbar=False, xticklabels=classnames, yticklabels=classnames, fmt='d', annot=True, cmap=plt.cm.Blues) 
plt.xlabel('Predicted') 
plt.ylabel('Actual') 
plt.show()

使用CNN和Python實施的肺炎檢測

根據(jù)上面的混淆矩陣，我們可以看到45張病毒X射線圖像被預(yù)測為細(xì)菌。這可能是因為很難區(qū)分這兩種肺炎。但是，至少因為我們對242個樣本中的232個進行了正確分類，所以我們的模型至少能夠很好地預(yù)測由細(xì)菌引起的肺炎。

這就是整個項目!謝謝閱讀!下面是運行整個項目所需的所有代碼。

import os 
import cv2import pickle    # Used to save variablesimport numpy as npimport matplotlib.pyplot as pltimport seaborn as snsfrom tqdm import tqdm    # Used to display progress bar 
from sklearn.preprocessing import OneHotEncoder 
from sklearn.metrics import confusion_matrix 
from keras.models import Model, load_model 
from keras.layers import Dense, Input, Conv2D, MaxPool2D, Flatten 
from keras.preprocessing.image import ImageDataGenerator    # Used to generate images 
np.random.seed(22) 
# Do not forget to include the last slashdef load_normal(norm_path):    norm_files = np.array(os.listdir(norm_path))    norm_labels = np.array(['normal']*len(norm_files)) 
    norm_images = []    for image in tqdm(norm_files): 
        # Read image        image = cv2.imread(norm_path + image)        # Resize image to 200x200 px 
        image = cv2.resize(image, dsize=(200,200)) 
        # Convert to grayscale        image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)        norm_images.append(image) 
    norm_images = np.array(norm_images)    return norm_images, norm_labels 
def load_pneumonia(pneu_path):    pneu_files = np.array(os.listdir(pneu_path))    pneu_labels = np.array([pneu_file.split('_')[1] for pneu_file in pneu_files]) 
    pneu_images = []    for image in tqdm(pneu_files): 
        # Read image        image = cv2.imread(pneu_path + image)        # Resize image to 200x200 px 
        image = cv2.resize(image, dsize=(200,200)) 
        # Convert to grayscale        image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)        pneu_images.append(image) 
    pneu_images = np.array(pneu_images)    return pneu_images, pneu_labels 
print('Loading images') 
# All images are stored in _images, all labels are in _labelsnorm_images, norm_labels = load_normal('/kaggle/input/chest-xray-pneumonia/chest_xray/train/NORMAL/') 
pneu_images, pneu_labels = load_pneumonia('/kaggle/input/chest-xray-pneumonia/chest_xray/train/PNEUMONIA/') 
# Put all train images to X_train X_train = np.append(norm_images, pneu_images, axis=0) 
# Put all train labels to y_trainy_train = np.append(norm_labels, pneu_labels) 
print(X_train.shape) 
print(y_train.shape) 
# Finding out the number of samples of each classprint(np.unique(y_train, return_counts=True))print('Display several images') 
fig, axes = plt.subplots(ncols=7, nrows=2, figsize=(16, 4)) 
indices = np.random.choice(len(X_train), 14) 
counter = 0 
for i in range(2): 
    for j in range(7): 
        axes[i,j].set_title(y_train[indices[counter]])        axes[i,j].imshow(X_train[indices[counter]], cmap='gray') 
        axes[i,j].get_xaxis().set_visible(False)        axes[i,j].get_yaxis().set_visible(False)        counter += 1 
plt.show()print('Loading test images') 
# Do the exact same thing as what we have done on train datanorm_images_test, norm_labels_test = load_normal('/kaggle/input/chest-xray-pneumonia/chest_xray/test/NORMAL/') 
pneu_images_test, pneu_labels_test = load_pneumonia('/kaggle/input/chest-xray-pneumonia/chest_xray/test/PNEUMONIA/') 
X_test = np.append(norm_images_test, pneu_images_test, axis=0) 
y_test = np.append(norm_labels_test, pneu_labels_test) 
# Save the loaded images to pickle file for future use 
with open('pneumonia_data.pickle', 'wb') as f: 
    pickle.dump((X_train, X_test, y_train, y_test), f)# Here's how to load it 
with open('pneumonia_data.pickle', 'rb') as f: 
    (X_train, X_test, y_train, y_test) = pickle.load(f) 
print('Label preprocessing') 
# Create new axis on all y data 
y_train = y_train[:, np.newaxis] 
y_test = y_test[:, np.newaxis] 
# Initialize OneHotEncoder object 
one_hot_encoder = OneHotEncoder(sparse=False) 
# Convert all labels to one-hot 
y_train_one_hot = one_hot_encoder.fit_transform(y_train) 
y_test_one_hot = one_hot_encoder.transform(y_test) 
print('Reshaping X data') 
# Reshape the data into (no of samples, height, width, 1), where 1 represents a single color channel 
X_train = X_train.reshape(X_train.shape[0], X_train.shape[1], X_train.shape[2], 1) 
X_test = X_test.reshape(X_test.shape[0], X_test.shape[1], X_test.shape[2], 1) 
print('Data augmentation') 
# Generate new images with some randomness 
datagen = ImageDataGenerator( 
        rotation_range = 10,   
        zoom_range = 0.1,  
        width_shift_range = 0.1,  
        height_shift_range = 0.1) 
datagen.fit(X_train) 
train_gen = datagen.flow(X_train, y_train_one_hot, batch_size = 32) 
print('CNN') 
# Define the input shape of the neural network 
input_shape = (X_train.shape[1], X_train.shape[2], 1) 
print(input_shape) 
input1 = Input(shape=input_shape) 
cnn = Conv2D(16, (3, 3), activation='relu', strides=(1, 1),  
    padding='same')(input1) 
cnn = Conv2D(32, (3, 3), activation='relu', strides=(1, 1),  
    padding='same')(cnn) 
cnn = MaxPool2D((2, 2))(cnn) 
cnn = Conv2D(16, (2, 2), activation='relu', strides=(1, 1),  
    padding='same')(cnn) 
cnn = Conv2D(32, (2, 2), activation='relu', strides=(1, 1),  
    padding='same')(cnn) 
cnn = MaxPool2D((2, 2))(cnn) 
cnn = Flatten()(cnn) 
cnn = Dense(100, activation='relu')(cnn) 
cnn = Dense(50, activation='relu')(cnn) 
output1 = Dense(3, activation='softmax')(cnn) 
model = Model(inputs=input1, outputs=output1) 
model.compile(loss='categorical_crossentropy',  
              optimizer='adam', metrics=['acc']) 
# Using fit_generator() instead of fit() because we are going to use data 
# taken from the generator. Note that the randomness is changing 
# on each epoch 
history = model.fit_generator(train_gen, epochs=30,  
          validation_data=(X_test, y_test_one_hot)) 
# Saving model 
model.save('pneumonia_cnn.h5') 
print('Displaying accuracy') 
plt.figure(figsize=(8,6)) 
plt.title('Accuracy scores') 
plt.plot(history.history['acc']) 
plt.plot(history.history['val_acc']) 
plt.legend(['acc', 'val_acc']) 
plt.show() 
print('Displaying loss') 
plt.figure(figsize=(8,6)) 
plt.title('Loss value') 
plt.plot(history.history['loss']) 
plt.plot(history.history['val_loss']) 
plt.legend(['loss', 'val_loss']) 
plt.show() 
# Predicting test data 
predictions = model.predict(X_test) 
print(predictions) 
predictions = one_hot_encoder.inverse_transform(predictions) 
print('Model evaluation') 
print(one_hot_encoder.categories_) 
classnames = ['bacteria', 'normal', 'virus'] 
# Display confusion matrix 
cm = confusion_matrix(y_test, predictions) 
plt.figure(figsize=(8,8)) 
plt.title('Confusion matrix') 
sns.heatmap(cm, cbar=False, xticklabels=classnames, yticklabels=classnames, fmt='d', annot=True, cmap=plt.cm.Blues) 
plt.xlabel('Predicted') 
plt.ylabel('Actual') 
plt.show()

責(zé)任編輯：未麗燕來源：今日頭條

Python CNN 檢測

51CTO技術(shù)棧公眾號

業(yè)務(wù)
速覽

媒體

51CTO CIOAge HC3i

社區(qū)

51CTO博客鴻蒙開發(fā)者社區(qū) AI.x社區(qū)

教育

51CTO學(xué)堂精培企業(yè)培訓(xùn) CTO訓(xùn)練營