Enhancing Ship Classification with CNNs and Transfer Learning

Introduction

Welcome to an in-depth exploration of ship classification utilizing Convolutional Neural Networks (CNNs) with the Analytics Vidhya hackathon dataset. CNNs are a cornerstone of image-related duties, identified for his or her means to study hierarchical representations of photographs. On this undertaking, we dive into understanding the ability of CNNs to categorise ships based mostly on their visible options.

This undertaking goals to exhibit deep studying software in picture categorization and examine CNNs constructed from scratch and people enhanced by means of switch studying. It explores ship classification, from preprocessing to analysis and comparability.

Studying Aims

Apply Convolutional Neural Networks (CNNs) for ship classification.
Preprocess picture knowledge utilizing OpenCV and NumPy.
Implement CNN fashions each with and with out switch studying.
Consider mannequin efficiency utilizing metrics like accuracy and F1-score.
Evaluate the outcomes of CNN fashions with and with out transfer learning.

Drawback Assertion

On this context, the Governmental Maritime and Coastguard Company seeks to deploy an automatic ship detection system leveraging laptop imaginative and prescient expertise. The target is to determine and classify ships from photographs captured by survey boats precisely. With a various vary of ship sorts, together with cargo ships, tankers, navy vessels, carriers, and cruise ships, the problem lies in growing a strong mannequin able to distinguishing between these lessons successfully.

Dataset Description

The 5 ship sorts—Cargo, Tanker, Navy, Service, and Cruise—are represented within the dataset by a gaggle of photographs taken by survey boats. The dataset presents all kinds of visible knowledge for mannequin development, with 6252 photographs for coaching and 2680 photographs for testing.

Clarification of OpenCV and CNN

Allow us to discover about OpenCV and CNN intimately:

OpenCV

With its many options for picture processing duties together with object detection, function extraction, and movie enhancing, OpenCV is a potent instrument. We are able to enhance high quality of enter photographs, determine pertinent options, and preprocess the uncooked picture knowledge by utilizing OpenCV.

Convolutional Neural Networks

CNNs are particularly designed for image-related duties. CNNs are significantly good at robotically extracting options at varied levels of abstraction from hierarchical representations of photographs. We are able to create a mannequin that may acknowledge distinctive patterns and traits linked to every kind of ship by coaching a CNN with the labeled ship photographs.

Layers in CNNs

CNNs encompass a number of layers, every serving a particular goal in processing and extracting options from enter photographs. Let’s break down the elements of a CNN:

Convolutional Layers

CNNs are largely composed of convolutional layers. These layers are made up of learnable filters, generally known as kernels, which conduct convolution operations on the enter picture by sliding over it. The filters use element-wise multiplication and summing operations to extract completely different options from the enter picture, together with edges, textures, and patterns. Usually, every convolutional layer makes use of a variety of filters to gather varied options.

Activation Operate

So as to add non-linearity to the community, an activation perform is utilized element-by-element to the output function maps following the convolution operation. Tanh, sigmoid, and ReLU (Rectified Linear Unit) are examples of widespread activation features. ReLU’s ease of use and effectivity in fixing the vanishing gradient problem make it essentially the most extensively utilized activation perform in CNNs.

Pooling Layers

Pooling layers downsample the function maps produced by the convolutional layers to protect essential data whereas lowering their spatial dimensions. The most well-liked pooling process, max pooling, successfully highlights salient options by retaining the utmost worth inside every pooling area. Pooling the enter reduces the community’s computational complexity and enhances its means to study strong traits, making it extra resilient to slight spatial fluctuations.

Totally Linked Layers

Totally linked layers usually carry out classification or regression duties based mostly on the realized options after the convolutional and pooling layers. These layers set up connections between every neuron in a layer and each different layer’s neuron, enabling the community to grasp the relationships between options extracted from the enter photographs. Within the closing phases of community development, totally linked layers are sometimes used to generate the specified output, equivalent to class chances in picture classification duties.

Softmax Layer

Usually, a softmax layer is inserted on the finish of the community to rework the category chances from the uncooked output scores in classification duties. To make sure that the output scores add as much as one and could be understood as chances, the softmax perform normalizes the values for every class. In consequence, the community can select the category with the best chance to make predictions.

CNNs leverage convolutional layers with learnable filters to extract hierarchical options from enter photographs, adopted by activation features to introduce non-linearity, pooling layers to downsample function maps, totally linked layers for high-level function illustration, and a softmax layer for classification duties. This structure permits CNNs to carry out good in varied image-related duties, together with picture classification, object detection, and segmentation.

Allow us to now apply the ideas to the dataset from the Analytics Vidhya hackathon.

Implementation of CNN

We’ll execute CNN implementation each with and with out switch studying. To start, let’s first deal with the implementation with out switch studying.

Right here’s the step-by-step implementation:

Step1: Importing Libraries and Dependencies

As we all know , the very first step is to put in all needed libraries and dependencies:

import pandas as pd
import numpy as np
import cv2
import seaborn as sns
import tensorflow as tf
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
from tensorflow.keras.fashions import Sequential
from tensorflow.math import confusion_matrix
from tensorflow.keras.callbacks import ModelCheckpoint
from tensorflow.keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, InputLayer
from glob import glob
from skimage.rework import resize
from keras.utils import to_categorical
from keras.fashions import Sequential
import keras
from keras.layers import Dense, Conv2D, MaxPool2D , Flatten
from tensorflow.keras import fashions, layers

Step2: Load the Dataset

knowledge = pd.read_csv('/kaggle/enter/shipdataset/prepare.csv')

Step3: Information Evaluation

Now, let’s conduct some knowledge evaluation:

knowledge.groupby('class').rely()

It will present insights into the distribution of classes throughout the dataset.

Step4: Visualization

Let’s now visualize this:

ship_categories = {1: 'Cargo', 2: 'Navy', 3: 'Service', 4: 'Cruise', 5: 'Tanker'}
knowledge['category_mapped'] = knowledge['category'].map(ship_categories)
sns.countplot(x='category_mapped', knowledge=knowledge)

The countplot reveals that the dataset contains 2120 photographs categorized as Cargo, 1167 as Navy, 916 as Service, 832 as Cruise, and 1217 as Tanker.

Step5: Preprocessing the information

Now let’s preprocess the information with the assistance of code beneath:

X=[]
import cv2
for img_name in knowledge.picture:
    img=cv2.imread('/kaggle/enter/shipdataset/photographs/'+img_name)
    img_resized = cv2.resize(img, (224, 224))
    X.append(img_resized)

X=np.array(X)

This code hundreds photographs from a listing, resizes them to 224×224 pixels utilizing OpenCV, and shops the resized photographs in a NumPy array.

Step6: Plotting

Now let’s plot them after resizing.

nrow = 5
ncol = 4
fig1 = plt.determine(figsize=(15, 15))
fig1.suptitle('After Resizing', measurement=32)
for i in vary(20):
    plt.subplot(nrow, ncol, i + 1)
    plt.imshow(X[i])
    plt.title('class = {x}, Ship = {y}'.format(x=knowledge["category"][i],
     y=ship_categories[data["category"][i]]))
    plt.axis('Off')
    plt.grid(False)
plt.present()

y=knowledge.class.values
y=y-1

This step subtracts 1 from every worth within the knowledge.class array, storing the consequence within the variable y.

The aim of this operation may very well be to regulate the class labels. It’s widespread in machine studying duties to start out indexing from 0 as a substitute of 1, particularly when coping with classification duties. To align the labels with zero-based indexing, subtracting 1 from the class labels is commonly accomplished, as required by machine studying algorithms or libraries.

X = X.astype('float32') / 255
y = to_categorical(y)

This code converts pixel values in X to floats between 0 and 1 and one-hot encodes categorical labels in y.

Step7: Information Splitting into Prepare/Check Dataset

Break up the dataset into coaching and testing units utilizing the train_test_split perform.

X_train, X_test, y_train, y_test = train_test_split(X, y,
 test_size=0.2, random_state=42)

Defining CNN Mannequin: Outline a CNN mannequin utilizing TensorFlow’s Sequential API, specifying the convolutional and pooling layers.

CNN_model = fashions.Sequential([
    layers.Conv2D(64, (3, 3), activation='relu', padding='same', 
    input_shape=(224, 224, 3)),
    layers.Conv2D(64, (3, 3), padding='same', activation='relu'),
    layers.MaxPooling2D((2, 2)),
    layers.Conv2D(128, (3, 3), padding='same', activation='relu'),
    layers.Conv2D(128, (3, 3), padding='same', activation='relu'), 
    layers.MaxPooling2D((2, 2)),
    layers.GlobalAveragePooling2D(),  
    layers.Dense(128, activation='relu'),
    layers.Dense(128, activation='relu'),
    layers.Dense(128, activation='relu'),
    layers.Dense(5, activation='softmax')  
])
CNN_model.abstract()

Step8: Mannequin Coaching

rain the CNN mannequin on the coaching knowledge, establishing early stopping and mannequin checkpoint to forestall overfitting and save the most effective mannequin.

Compile the mannequin with adam optimizer and loss as categorical cross entropy because it’s multiclass classification

from tensorflow.keras.optimizers import Adam
mannequin.compile(optimizer="adam",
                  loss="categorical_crossentropy",
                  metrics=['accuracy',tf.keras.metrics.F1Score()])

Saving the most effective mannequin on validation loss

from tensorflow.keras.callbacks import EarlyStopping, ModelCheckpoint
early_stop = EarlyStopping(monitor="val_loss", 
persistence=3, restore_best_weights=True)
checkpoint = ModelCheckpoint('best_model.keras', 
monitor="val_loss", save_best_only=True, mode="min")

Step9: Becoming the Mannequin

historical past = mannequin.match(X_train, y_train, 
                        epochs=20, 
                        batch_size=32, 
                        validation_data=(X_test, y_test), 
                        callbacks=[early_stop, checkpoint])

Step10: Mannequin Analysis

Now let’s do mannequin analysis utilizing educated mannequin.

from sklearn.metrics import f1_score
y_pred = mannequin.predict(X_test)

Changing predictions from one-hot encoded format to class labels.

y_pred_labels = np.argmax(y_pred, axis=1)
y_true_labels = np.argmax(y_test, axis=1)

from sklearn.metrics import classification_report
report = classification_report(y_true_labels, y_pred_labels)
print("Classification Report:")
print(report)

Plotting coaching & validation loss values

plt.determine(figsize=(10, 6))
plt.plot(historical past.historical past['loss'], label="Coaching Loss")
plt.plot(historical past.historical past['val_loss'], label="Validation Loss")
plt.title('Mannequin Loss')
plt.xlabel('Epoch')
plt.ylabel('Loss')
plt.legend()
plt.grid(True)
plt.present()

import matplotlib.pyplot as plt
plt.plot(historical past.historical past['accuracy'], label="Coaching accuracy")
plt.plot(historical past.historical past['val_accuracy'], label="Validation accuracy")
plt.title('Coaching and Validation accuracy')
plt.xlabel('Epoch')
plt.ylabel('Accuracy')
plt.legend()
plt.grid(True)
plt.present()

Testing of Information

Getting ready and preprocessing the take a look at knowledge equally to the coaching knowledge, make predictions utilizing the educated mannequin, and visualize some pattern predictions together with their predicted lessons.

take a look at=pd.read_csv('/kaggle/enter/test-data/test_ApKoW4T.csv')

X_test=[]
import cv2
for img_name in take a look at.picture:
    img=cv2.imread('/kaggle/enter/shipdataset/photographs/'+img_name)
    img_resized = cv2.resize(img, (224, 224))
    X_test.append(img_resized)
X_test=np.array(X_test)
X_test = X_test.astype('float32') / 255

Making Prediction

predictions=mannequin.predict(X_test)
predicted_class= np.argmax(predictions,axis=1)
predicted_class=predicted_class+1
csv_=take a look at.copy()
csv_
csv_['category']=predicted_class
csv_.head()

csv_['category'].value_counts()

Save Predictions in CSV

csv_.to_csv('prediction1.csv',index=False)

Plotting the Predicted Check Information

plt.determine(figsize=(8, 8))
for i in vary(20):
    plt.subplot(4, 5, i + 1)  
    plt.imshow(X_test[i])  
    plt.title(f'Predicted Class: {ship_categories[predicted_class[i]]}', fontsize=8)  
plt.tight_layout() 
plt.savefig('prediction_plot1.png')
plt.present()

Now let’s use switch studying to unravel this drawback for this we will likely be utilizing resnet.

Understanding Mobilenet

Mobilenet is a kind of convolutional neural community (CNN) designed particularly for cell and embedded units . It’s identified for being environment friendly and light-weight, making it splendid for conditions the place processing energy and battery life are restricted.

Right here’s a breakdown of Mobilenet’s key options:

Effectivity: Mobilenet employs depthwise separable convolutions, which divide knowledge processing into two steps: depthwise convolution utilizing a single filter for every enter channel and pointwise convolution utilizing 1×1 filters.
Light-weight: Mobilenet lowers the quantity of parameters wanted by the mannequin by minimizing computations. Because of this the mannequin will likely be smaller, which is vital for cell units with constrained storage.
Functions: It’s helpful for varied duties on cell units, together with picture classification, object detection, and facial recognition.

You possibly can learn extra about MobileNet by clicking right here.

We are going to make the most of MobileNet for this activity. All the things stays constant from importing libraries to knowledge splitting(identical step as with out switch studying) . Moreover, we have to import the MobileNet library.

from keras.functions import MobileNet
from keras.fashions import Mannequin
from keras.layers import Dense, GlobalAveragePooling2D
from keras.layers import Dropout, BatchNormalization

Loading Pre-trained Mannequin

Now load the Pre-trained Mannequin as base mannequin

base_model = MobileNet(weights="imagenet", include_top=False)

Freeze all layers within the base mannequin:

for layer in base_model.layers:
    layer.trainable = False

Construct Mannequin Utilizing Purposeful Operate

x = base_model.output
x = GlobalAveragePooling2D()(x)
x = Dense(1024, activation='relu')(x)
x = BatchNormalization()(x)
x = Dropout(0.5)(x)  # Add dropout with a price of 0.5
predictions = Dense(5, activation='softmax')(x)
 
#Creating the mannequin
mannequin = Mannequin(inputs=base_model.enter, outputs=predictions)

Compiling the Mannequin

Compile the mannequin with adam optimizer and loss as categorical cross entropy because it’s multiclass classification

from tensorflow.keras.optimizers import Adam
mannequin.compile(optimizer="adam",
                  loss="categorical_crossentropy",
                  metrics=['accuracy',tf.keras.metrics.F1Score()])

Saving the Greatest Mannequin on Validation Loss

from tensorflow.keras.callbacks import EarlyStopping, ModelCheckpoint
early_stop = EarlyStopping(monitor="val_loss", persistence=2, restore_best_weights=True)
checkpoint = ModelCheckpoint('best_model.keras', monitor="val_loss", 
  save_best_only=True, mode="min")

Becoming the Mannequin

historical past = mannequin.match(X_train, y_train, 
                    epochs=20, 
                    batch_size=32, 
                    validation_data=(X_test, y_test),
                    callbacks=[early_stop,checkpoint])

Mannequin Analysis

Now let’s do mannequin analysis.

from sklearn.metrics import f1_score

#Making predictions utilizing the educated mannequin
y_pred = mannequin.predict(X_test)

#Changing predictions from one-hot encoded format to class labels
y_pred_labels = np.argmax(y_pred, axis=1)
y_true_labels = np.argmax(y_test, axis=1)

from sklearn.metrics import classification_report
report = classification_report(y_true_labels, y_pred_labels)
print("Classification Report:")
print(report)

Plotting Coaching and Validation Loss Values

plt.determine(figsize=(10, 6))
plt.plot(historical past.historical past['loss'], label="Coaching Loss")
plt.plot(historical past.historical past['val_loss'], label="Validation Loss")
plt.title('Mannequin Loss')
plt.xlabel('Epoch')
plt.ylabel('Loss')
plt.legend()
plt.grid(True)
plt.present()

Plotting Accuracy Curve

import matplotlib.pyplot as plt
plt.plot(historical past.historical past['accuracy'], label="Coaching accuracy")
plt.plot(historical past.historical past['val_accuracy'], label="Validation accuracy")
plt.title('Coaching and Validation accuracy')
plt.xlabel('Epoch')
plt.ylabel('Accuracy')
plt.legend()
plt.grid(True)
plt.present()

Now do the prediction on take a look at knowledge identical as accomplished in with out switch studying

Conclusion

This examine explores two approaches to ship classification utilizing Convolutional Neural Networks (CNNs). The primary entails constructing a CNN from scratch with out switch studying methods, whereas the second makes use of switch studying utilizing MobileNet structure. Each strategies present potential options for ship classification, with switch studying providing higher efficiency with much less coaching knowledge. The selection will depend on computational sources, dataset measurement, and desired efficiency metrics.

Incessantly Requested Questions

Q1. What’s OpenCV?

A. OpenCV is a strong instrument for picture processing that gives a variety of features for duties equivalent to picture manipulation, function extraction, and object detection. It presents varied functionalities to preprocess uncooked picture knowledge, extract related options, and improve picture high quality.

Q2. What are Convolutional Neural Networks (CNNs)?

A. CNNs excel at studying hierarchical representations of photographs, robotically extracting options at completely different ranges of abstraction. They encompass a number of layers, together with convolutional layers, activation features, pooling layers, totally linked layers, and softmax layers, which work collectively to course of and extract options from enter photographs.

Q3. What’s switch studying?

A. In switch studying, a mannequin educated on one activity serves as the start line for a mannequin on a second activity. Within the context of neural networks, switch studying entails taking a pre-trained mannequin (normally educated on a big dataset) and fine-tuning it for a particular activity or dataset. This strategy may also help enhance mannequin efficiency, particularly when the brand new dataset is small or just like the unique dataset.

Source link

Enhancing Ship Classification with CNNs and Transfer Learning

Working with Input-Convex Neural Networks part3(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

Embracing the Future: The Rise of AI-Driven Development in Software Engineering The software… | by DevBlogs | Jul, 2024

Research on Metaheuristic methods part4(Machine Learning 2024) | by Monodeep Mukherjee | Jul, 2024

How Real-Time Data Analytics and AI Are Transforming Heavy Equipment Operations

NVIDIA Accelerates Google Quantum AI Processor Design With Simulation of Quantum Device Physics

Game Development and Cloud Computing: Benefits of Cloud-Native Game Servers

Teradata AI Unlimited in Microsoft Fabric is Now Available for Public Preview through Microsoft Fabric Workload Hub

Cognigy Unveils Agentic AI: Transforming the Future of Enterprise Contact Centers

Our Picks

Automated contract data extraction: A complete guide

Artificial organic neurons facilitate the connection between biology and electronics.

Who Created Artificial Intelligence (AI)?: Meet the Founding Fathers | by Ifekorede Ayoola | Jul, 2024

Most Popular

Revolutionizing the Way We Find Love

Will GenAI Replace Data Engineers? No – And Here’s Why.

Assortment Optimization Machine Learning | by Danishaliarshar | Mar, 2024

Enhancing Ship Classification with CNNs and Transfer Learning

Introduction

Studying Aims

Drawback Assertion

Dataset Description

Clarification of OpenCV and CNN

OpenCV

Convolutional Neural Networks

Layers in CNNs

Convolutional Layers

Activation Operate

Pooling Layers

Totally Linked Layers

Softmax Layer

Implementation of CNN

Step1: Importing Libraries and Dependencies

Step2: Load the Dataset

Step3: Information Evaluation

Step4: Visualization

Step5: Preprocessing the information

Step6: Plotting

Step7: Information Splitting into Prepare/Check Dataset

Step8: Mannequin Coaching

Step9: Becoming the Mannequin

Step10: Mannequin Analysis

Testing of Information

Making Prediction

Save Predictions in CSV

Plotting the Predicted Check Information

Understanding Mobilenet

Loading Pre-trained Mannequin

Construct Mannequin Utilizing Purposeful Operate

Compiling the Mannequin

Saving the Greatest Mannequin on Validation Loss

Becoming the Mannequin

Mannequin Analysis

Plotting Coaching and Validation Loss Values

Plotting Accuracy Curve

Conclusion

Incessantly Requested Questions

Related Posts