Building a Binary Image Classification Model to Distinguish Between Images of Birds and Drones Using VGG16 | by Keven Ariel Amaya Muñoz | Apr, 2024

For my data storytelling capstone endeavor, I explored Harsh Walia’s Chook vs. Drone dataset on Kaggle. The dataset contains an entire lot of pictures initially meant to assist Indian Intelligence in detecting birds or drones present throughout the sky. It has a whole of 828 pictures divided into 3 folders meant for teaching, testing, and validation. The dataset is especially designed for binary image classification and holds quite a bit price in functions like security and surveillance.

I decided to utilize convolutional neural networks as my machine learning algorithm of different on account of it excels in its capability to acknowledge patterns in pictures, making it good for image recognition. I’ll use VGG16, a pre-trained model, to classify the photographs throughout the dataset as each drones or birds. I’ll modify the model to work increased with my dataset and use strategies like image resizing to wash and put collectively the data for modeling. Lastly, I’ll contemplate my model with teaching and validation accuracy and loss metrics.

Sooner than starting, be sure you import all compulsory options and libraries.

!pip arrange --quiet gdown==4.5.4 --no-cache-dir
import os
import random
from PIL import Image as PILImage
from IPython.present import Imageimport numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from mpl_toolkits.axes_grid1 import ImageGrid
from math import flooring
import tensorflow as tf
from tensorflow import keras
from keras import layers
from sklearn.model_selection import train_test_split
from keras.preprocessing.image import ImageDataGenerator
from keras.fashions import Sequential
from keras.layers import Enter, Dense, Flatten
from keras.layers import Conv2D, MaxPooling2D, Lambda
from keras import backend as Okay
from keras.optimizers import *
from keras.functions.vgg16 import VGG16
from keras.metrics import *
from keras.functions.vgg16 import preprocess_input
from tensorflow.keras.preprocessing import image_dataset_from_directory
from tensorflow.keras.layers.experimental.preprocessing import RandomFlip, RandomRotation

Step #1: Import and break up the data proper right into a observe/validation set

!gdown 1S8IezIq2waOeSjSjYoOvSk16XzOH3oJB && unzip -qq drones_or_birds.zipBATCH_SIZE = 64
# Resize the photographs to standardize dimensions.
IMG_SIZE = (160, 160)
itemizing = 'drones_or_birds/'
# Use keras's image_dataset_from_directory carry out
# to load the photographs from the itemizing.
train_dataset = image_dataset_from_directory(itemizing,
shuffle=True,
batch_size=BATCH_SIZE,
image_size=IMG_SIZE,
validation_split=0.2,
subset='teaching',
seed=42)
validation_dataset = image_dataset_from_directory(itemizing,
shuffle=True,
batch_size=BATCH_SIZE,
image_size=IMG_SIZE,
validation_split=0.2,
subset='validation',
seed=42)
# Get the names of each class from the dataset.
# On this case, the teachings are each "drone" or "fowl"
class_names = train_dataset.class_names
# Plot 9 pictures from the teaching dataset.
plt.decide(figsize=(10, 10))
for pictures, labels in train_dataset.take(1):
for i in range(9):
ax = plt.subplot(3, 3, i + 1)
plt.imshow(pictures[i].numpy().astype("uint8"))
plt.title(class_names[labels[i]])
plt.axis("off")

Step #2: Define the enter type of the data

input_shape = (160, 160, 3)

Steps #3–6: Assemble a CNN using VGG16. VGG16 is a robust model pre-trained on the ImageNet database, making it good for this binary image classification exercise. I moreover froze the parameters of some layers, as was achieved in a earlier lab, on account of I solely wished to educate the final word layers.

# Define the enter kind for the model
input_shape = (160, 160, 3)# Initialize the model
new_model = Sequential()
# Add enter layer
new_model.add(Enter(kind=input_shape))
# Add data augmentation layers
new_model.add(RandomFlip('horizontal'))
new_model.add(RandomRotation(0.2))
# You do not wish to fret about this step. Nonetheless in case you are curious:
# The "Lambda" layer is one different explicit layer that allow's you apply a custom-made
# carry out to the data. On account of everyone knows we'll use VGG16, we'll use
# the an identical pre-processing carry out that was used when it was first expert.
new_model.add(
Lambda(
preprocess_input,
determine='preprocessing',
input_shape=input_shape
)
)
# Load a pre-trained model
VGG = VGG16(
# we have to use our private final layers custom-made for our exercise
include_top=False,
input_shape=input_shape,
weights='imagenet'
)
# Freeze parameters each layer on account of we solely have to observe the final word layers
for layer in VGG.layers:
layer.trainable = False
# Add the pre-trained model to our model
new_model.add(VGG)
# Add a flattening layer
new_model.add(Flatten())
# Add a completely linked layer
new_model.add(Dense(256, activation='relu'))
# Add the output layer
new_model.add(Dense(1, activation='sigmoid'))
new_model.summary()

new_model.compile(
loss='binary_crossentropy',
optimizer='adam',
metrics=['accuracy'])historic previous = new_model.match(
train_dataset,
validation_data=validation_dataset,
epochs=10)

Think about the model’s effectivity by monitoring accuracy and loss.

acc = [0.] + historic previous.historic previous['accuracy']
val_acc = [0.] + historic previous.historic previous['val_accuracy']loss = historic previous.historic previous['loss']
val_loss = historic previous.historic previous['val_loss']
plt.decide(figsize=(8, 8))
plt.subplot(2, 1, 1)
plt.plot(acc, label='Teaching Accuracy')
plt.plot(val_acc, label='Validation Accuracy')
plt.legend(loc='lower correct')
plt.ylabel('Accuracy')
plt.ylim([min(plt.ylim()),1])
plt.title('Teaching and Validation Accuracy')
plt.subplot(2, 1, 2)
plt.plot(loss, label='Teaching Loss')
plt.plot(val_loss, label='Validation Loss')
plt.legend(loc='increased correct')
plt.ylabel('Cross Entropy')
plt.ylim([0,1.0])
plt.title('Teaching and Validation Loss')
plt.xlabel('epoch')
plt.current()

Plot pictures and assess the model’s effectivity.

# We'll take 9 pictures from the validation dataset to plot
plt.decide(figsize=(15, 15))
for x, y in validation_dataset.take(1):
for i in range(9):
prediction = new_model.predict(x)[i]
ax = plt.subplot(3, 3, i + 1)
plt.imshow(x[i].numpy().astype("uint8"))
tmp2 = "predicted {:.0f}% fowl, {:.0f}% drone, actually ".format(100 * (1 - prediction[0]), 100 * prediction[0])
plt.title(tmp2 + class_names[y[i].numpy()])
plt.axis("off")

Source link

Building a Binary Image Classification Model to Distinguish Between Images of Birds and Drones Using VGG16 | by Keven Ariel Amaya Muñoz | Apr, 2024

Working with Input-Convex Neural Networks part3(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

Embracing the Future: The Rise of AI-Driven Development in Software Engineering The software… | by DevBlogs | Jul, 2024

Research on Metaheuristic methods part4(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

SambaNova Reports Fastest DeepSeek-R1 671B with High Efficiency

Data Center Cooling: Carrier Invests in Direct-to-Chip Liquid Provider ZutaCore

Sama Launches Agentic Capture for Multi-Modal Agentic AI

AI and Crypto Security: Protecting Digital Assets with Advanced Technology

How to Balance Real-Time Data Processing with Batch Processing for Scalability

Our Picks

Human Activity Recognition Using Deep Learning: A Comprehensive Guide | by Waleed Ahmed | Jul, 2024

What questions are asked in the field of cybersecurity, especially questions asked in the field of… | by Achraf Zaryouh | Jun, 2024

A Guide to Career in Cybersecurity: Various Roles and Career Outlook

Most Popular

Revolutionizing the Way We Find Love

Will GenAI Replace Data Engineers? No – And Here’s Why.

Assortment Optimization Machine Learning | by Danishaliarshar | Mar, 2024

Building a Binary Image Classification Model to Distinguish Between Images of Birds and Drones Using VGG16 | by Keven Ariel Amaya Muñoz | Apr, 2024

Related Posts