•

陳博琳

Published October 12, 2024

Forest Fire Prevention

Implementing a lightweight neural network on the KR260 edge device for precise wildfire detection.

Things used in this project

Hardware components

AMD Kria™ KR260 Robotics Starter Kit

Software apps and online services

Snappy Ubuntu Core

Story

Forest Fire Prevention Using CNN Model with Two Frameworks

1Author 1 (Po-Lin, Chen), 1Author 2 (Sih-Min, Liu) , and 2Author 3 (Pei-Jen, Wang)

1 Department of Power Mechanical Engineering,

National Tsing Hua University, Hsinchu, Taiwan,

E-mail:s112030504@m112.nthu.edu.tw

E-mail:s112033639@m112.nthu.edu.tw

2 Department of Power Mechanical Engineering,

National Tsing Hua University, Hsinchu, Taiwan,

E-mail: pjwang@pme.nthu.edu.tw

Abstract

Wildfire has always been a troublesome issue for the residents of North America. Although the monitoring system is widespread nowadays, it is hard to apply such system in natural environments. Therefore, we aim to implement a lightweight neural network on the edge device, which is KR260 in order to distinguish fire or non-fire precisely. At first, we simply use color detection by our clustering method based on K-means. But the fact is that the proposed color detection method does not predict fire or not well. Considering the performance of KR260, which is the board we apply on, we choose to train a simple CNN model on our PC, and test some samples on the board. By the proposed model using two frameworks, we obtain good results evaluated by loss and accuracy. Finally, we run forest fire prevention system on embedded system so as to prove our method that can predict wildfire precisely, thereby utilized in natural environments.

Keywords: Wildfire, Natural Environments, KR260, CNN Model.

1. Introduction

The northern American wildfire season typically runs from spring to fall; however, as the effects of climate change increase, disasters continue to shirk the expected seasonal rules, occurring with ever-increasing frequency and intensity. Wildfire usually appears in wilderness, and when disasters strike, addressing immediate needs is paramount. However, the outdoor device needs to utilize power effectively to extend the device's operating time. This Kria KR260 Robotics Starter Kit embedded system is suitable for the requirement. We implemented fire detection to report outdoor conditions. If there is any fire coming out, the rescue team can extinguish the fire before it grows larger.

2. related work

The bill of materials(BOM) we used are listed below:

• Kria KR260 Robotics Starter Kit

• 64GB SD card

• PC (usage for machine training)

Wildfire detection has been widely studied with various techniques developed over the years. Traditional approaches often rely on humidity and temperature sensors or even thermal cameras. Though these methods are much effective, they come with limitations such as high expenses, limited coverage areas, and delayed response times. Recently, image-based techniques, particularly convolutional neural networks (CNNs), have emerged for wildfire detection. These models can more accurately identify fire in images by extracting visual features. However, many of the existing CNN models are computationally intensive and unsuitable for real-time detection on edge devices. Our research addresses these limitations by developing a lightweight neural network optimized for deployment on the KR260 edge device. Unlike traditional sensor-based methods, our approach exclusively utilizes image processing, enabling real-time wildfire detection in natural environments without additional sensors. By concentrating on efficient model architectures and leveraging the capabilities of KR260 board, we strive to achieve precise and prompt wildfire detection.

3. methodology

3.1. Activate the Board

The initial step involves using the Win32 Disk Imager tool to download Ubuntu and write it onto the microSD card. Next, connect everything on the board, which contains microSD card, USB keyboard, USB mouse, monitor display, Ethernet cable, and Power Supply.

Fig. 1. The connection of KR260.

3.2. Set Up the Environment

After the Power Supply is connected, it requires time to turn on the device. Firstly, the standard system requires us to change the password when the first time logging in. Then, enter the following commands on the terminal.(See Fig.2.)

Fig. 2. The Commands Entered in Terminal.

Following that, a virtual environment is set up and is activated afterwards. The platform we use is Jupyter Notebook, which is a common tool in Anaconda.

3.3. Color Detection

At first, we want to apply color detection on our study. From the graph below, we can see after implementing our code, we cannot clearly distinguish whether it is on fire or not. The difficulty is that if a man wearing red clothes appears in the sight of the camera, the system may mistake the man as fire. As a result, we take on another method. Another reason why we cannot use color detection is that it is hard to find the algorithm to catch out all the color exchange when the forest is on fire. For instance, thick smoke will affect the results, leading to judge the fire’s red color more difficultly.

Fig. 3.a. The Scenario That Is About to Be on Fire.

Fig. 3.b. Testing Our Color Detection Algorithm by Applying a Non-Fire Image Above.

3.4. CNN Model

We use a CNN model with PyTorch and Tensorflow framework respectively. Below shows the framework of our model. To begin with, we train the model on our PC, and the code generates a file containing model parameters, such as weight and bias. Subsequently, we use this file to implement our system on KR260 with its accurate prediction.

3.4.1. Tensorflow Framework

Based on the TensorFlow framework, we execute the following process: First of all, we load some necessary libraries and initialize data, including class names (fire and non-fire) and image input size. We then randomize and standardize the training and testing images, using 90 percent of the data as the training set and 10 percent as the test set. Moreover, we builda convolutional neural network using Adam as the optimizer, train it with a batch size of 128 for 50 epochs, and save the trained model as "CNN_model.keras". On the KR260, we implement the trained model, following steps to convert color space from BGR to RGB, resize the image, normalize it, and convert the data structure to predict fire presence in images. The following figure demonstrates the proposed model architecture using Tensorflow framework.

Fig. 4. Model Architecture by Tensorflow Framework

3.4.2. PyTorch Framework

Another widely-used framework is PyTorch, which is also a common tool. The overall process is quite similar to that of Tensorflow with several differences listed below:

• Set up the device to use either a CUDA GPU or a CPU.

• Set random seeds so that reproducibility of the results can be ensured.

• The batch size in PyTorch is 256, which is twice that of TensorFlow.

• Input size is initialized to 32*32*3, where the input size in Tensorflow is initialized to 64*64*3.

• Set a learning rate scheduler so that it can adjust the learning rate during training, helping the model converge more efficiently and avoid local minima.

• Do data preprocess, such as color jittering, random erasing and so on.

Fig. 5. Model Architecture Listed by PyTorch Framework

4. experiment results

In this section, we discuss the differences between two selected frameworks. We use cross entropy as our loss function, with accuracy mainly being the benchmark to evaluate our models. Moreover, Adam is selected as the optimizer of the models, and each model of different frameworks runs 50 epochs.

4.1. Dataset Overview

A fire dataset provided on Kaggle, which is a well-known website with various kinds of datasets. The dataset contains 755 outdoor-fire images and 244 nature images such as river, foggy forest, and so on. Seeing that the ratio between fire and non-fire images is not quite balanced, we add some nature images so that the number of images with fire and without fire are approximately equal. The proportion of the training set and test set is chosen randomly. On the PyTorch framework, we allocate 10% of the data to the test set, 10% to the validation set, and the remaining 80% to the training set. On the TensorFlow framework, we allocate 10% of the data to the test set and the remaining 90% to the training set.

4.2. Different Framework Analysis

4.2.1. Tensorflow Framework

In this study, we use confusion matrix, loss, and accuracy to evaluate and analyze the performance of the model. Fig. 6 shows the confusion matrix of the model on the test set. Though the model performs well in classifying fire and non-fire, it sometimes confuses the two. This may be due to the model misclassifying early-stage fire images as the non-fire one. We can mitigate this issue by adding more early-stage fire images in the dataset.

Fig. 6. Confusion Matrix Using Tensorflow Framework

In Fig. 7, it demonstrates the change of the loss during the training process. From the figure, we can see that the loss decreases rapidly in the early stages but begins to plateau after reaching a certain point. However, the curve shows some fluctuations though dropout layer and early stopping techniques have been applied. In Section 6, future work can explore more refined hyperparameter adjustments to further improve model performance.

Fig. 7. Loss Function by 50 Epochs

The accuracy of the model on the training set and the test set show in Fig. 8. The model achieves an accuracy of around 98% on the test set, indicating good generalization ability.

Fig. 8. Final Accuracy Result

4.2.2. PyTorchFramework

In this study, we also use confusion matrix, cross entropy loss, and accuracy to conduct analyses about the performance of the model. As shown below, the confusion matrix of the model based on the test set is depicted in Fig. 9, where 0 represents non-fire and 1 represents fire.

Fig. 9. Confusion Matrix Using PyTorch Framework

We can observe from Fig. 10 that the loss function in PyTorch framework exhibits larger fluctuations, especially for validation loss. Consequently, we may infer that overfitting exists since our model performs better during training process compared to test process. What’s more, due to the smaller size of test dataset, validation data in each batch encompasses higher variances.

Fig. 10. Loss Function by 50 Epochs

Fig. 11. Final Accuracy Result

4.3. Validation of Prediction

Apart from the accuracy provided by the code, we actually test some images to validate whether the model is well performed or not. The listed tables show the true ratio with 30 fire images and 30 non-fire images by two frameworks respectively by random selection.

4.3.1. Tensorflow Framework

Below figures demonstrate how we predict the forest fire in reality by testing fire and non-fire images.

Fig. 12. Test Fire Image by Tensorflow Framework

Fig. 13. Test Non-Fire Image by Tensorflow Framework

Compared to the confusion matrix provided in the last section, Table 1 shows that the non-fire class achieves higher accuracy in real situations. The reason may be that there are some numbers of images about to be on fire in the fire dataset. In addition, randomly selecting sample images results in huge uncertainty so that the model mistake the situation as the other one.

Table 1. Ground Truth Table by Tensorflow Framework.(Unit: number of images.)

4.3.2. PyTorch Framework

Fig. 14. Test Fire Image by PyTorch Framework

In contrast to the confusion matrix presented earlier in Section 4.2.2, Table 2 illustrates that among the 30 samples in each class, 3 samples are misclassified in both fire and non-fire categories. This could stem from the challenges of the model in discerning between fire and non-fire images that share similar visual attributes in the initial stages of the model training process; moreover, the random selection of sample images introduces variability that occasionally results in misclassification.

Table 2. Ground Truth Table by PyTorch Framework.(Unit: number of images.)

5. Conclusion

In this study, our ultimate objective is to implement a lightweight neural network on edge devices, which is KR260, to address the challenge of wildfire detection in natural environments. Initially, we experiment with a color detection method based on K-means clustering; consequently, the accuracy of predicting fire does not meet our needs. Considering the performance limitations of the KR260, we opt to train a simple convolutional neural network (CNN) model on a PC and test its performance on the KR260 board.

Our proposed CNN model performs well in evaluations of loss and accuracy across both frameworks, indicating its effectiveness in distinguishing between fire and non-fire scenarios. Finally, integrating the model into an embedded system for forest fire prevention demonstrates the ability of our method to accurately predict wildfires in natural environments.

6. future work

For future work, we aim to enhance our wildfire detection system by exploring more advanced neural network architectures and optimization techniques to further increase the performance of the model. Expanding the dataset with diverse and challenging fire scenarios, such as some animals in the same color as fire appearing in the forest, will also be a priority to ensure the reliability of the system in different natural environments. Our goal is to achieve real-time wildfire monitoring solely through image processing without temperature and humidity sensors. In the end, real-world deployment and long-term monitoring will be conducted to validate the effectiveness of the system in actual wildfire prevention efforts.

acknowledgement

We would like to express our gratitude to whom has great contribution to the successful completion of this research. Firstly, we would like to thank our advisor, Pei-Jeng Wang, for his unwavering support, invaluable guidance, and encouragement. His insights and suggestions have significantly enhanced the quality of this work. We also extend our appreciation to our and lab mates for their assistance. Their feedback and advice have been instrumental in refining my ideas and approaches. Moreover, we are quite grateful that AMD provides the edge device KR260, which was sponsored by the company. Their support, including the necessary funding and resources, as well as offering the facilities and equipment required for the experiments, has been invaluable. Finally, we are profoundly thankful to our family and friends for their steadfast support and understanding; their patience and encouragement have really been a source of strength for us.

References

[1] "Getting Started with Kria KR260 Robotics Starter Kit, " AMD, Available: https://www.amd.com/en/products/system-on-modules/kria/k26/kr260-robotics-starter-kit/getting-started/connecting-everything.html. [Accessed: June, 2024].

[2] Kaggle, Fire Dataset, https://www.kaggle.com/datasets/phylake1337/fire-dataset, accessed on: June 28, 2024.

[3] Emanuel Sousa Tomé, Rita P. Ribeiro, Inês Dutra, and Arlete Rodrigues, "An Online Anomaly Detection Approach for Fault Detection on Fire Alarm Systems, " Sensors, vol. 23, no. 10, pp. 4902, May 2023. Available online: https://doi.org/10.3390/s23104902.

[4] Hao Wu, Deyang Wu, Jinsong Zhao, "An intelligent fire detection approach through cameras based on computer vision methods, " Process Safety and Environmental Protection, vol. 127, pp. 245-256, July 2019.

[5] Turgay Celik, Kai-Kuang Ma, "Computer vision based fire detection in color images, " in 2008 IEEE Conference on Soft Computing in Industrial Applications.

[6] Xuanxuan Hong, Wei Wang, Quanli Liu, "Design and Realization of Fire Detection Using Computer Vision Technology, " in 2019 Chinese Control And Decision Conference (CCDC).

[7] Viktor Tuba, Romana Capor-Hrosik, Eva Tuba. (2017) Forest Fires Detection in Digital Images Based on Color Features. International Journal of Environmental Science, 2, 66-70

[8] Zhao, R., Niu, X., Wu, Y., Luk, W., Liu, Q. (2017). Optimizing CNN-Based Object Detection Algorithms on Embedded FPGA Platforms. In: Wong, S., Beck, A., Bertels, K., Carro, L. (eds) Applied Reconfigurable Computing. ARC 2017. Lecture Notes in Computer Science(), vol 10216. Springer, Cham. https://doi.org/10.1007/978-3-319-56258-2_22

[9] K. Muhammad, J. Ahmad, Z. Lv, P. Bellavista, P. Yang and S. W. Baik, "Efficient Deep CNN-Based Fire Detection and Localization in Video Surveillance Applications, " in IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 49, no. 7, pp. 1419-1434, July 2019, doi: 10.1109/TSMC.2018.2830099.

# -*- coding: utf-8 -*-
"""
Created on Mon May 13 15:47:43 2024

@author: a3311
"""

import numpy as np
import os
from sklearn.metrics import confusion_matrix
import seaborn as sn; sn.set(font_scale=1.4)
from sklearn.utils import shuffle           
import matplotlib.pyplot as plt             
import cv2                                 
import tensorflow as tf                
from tqdm import tqdm
from keras.models import Sequential
from keras.layers import Dense, Activation, Dropout, Flatten
from keras.layers import Conv2D
from keras.layers import MaxPooling2D
from keras.optimizers import SGD, Adam


class_names = ['nofire','fire']
class_names_label = {class_name:i for i, class_name in enumerate(class_names)}
# 'nofire': 0 / 'fire': 1

nb_classes = len(class_names)

IMAGE_SIZE = (64, 64)

def load_data():
    datasets = ['seg_train', 'seg_test']
    output = []
    
    # Iterate through training and test sets
    for dataset in datasets:
        
        images = []
        labels = []
        
        print("Loading {}".format(dataset))
        
        # Iterate through each folder corresponding to a category
        for folder in os.listdir(dataset):
            label = class_names_label[folder]
            
            # Iterate through each image in our folder
            for file in tqdm(os.listdir(os.path.join(dataset, folder))):
                
                # Get the path name of the image
                img_path = os.path.join(os.path.join(dataset, folder), file)
                
            
                # Open and resize the img
                image = cv2.imread(img_path)
                image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)
                #Read photos with OpenCV, colors are considered as BGR, need to convert to RGB, errors indicate grayscale or already converted.
                image = cv2.resize(image, IMAGE_SIZE) 
                
                # Append the image and its corresponding label to the output
                images.append(image)
                labels.append(label)
                
        
                
        images = np.array(images, dtype = 'float32')
        labels = np.array(labels, dtype = 'int32')   
        
        output.append((images, labels))

    return output

(train_images, train_labels), (test_images, test_labels) = load_data()

#%%
'random'
train_images, train_labels = shuffle(train_images, train_labels, random_state=25)
'standardization'
train_images = train_images / 255.0 
test_images = test_images / 255.0
#%%
'Modeling'
input_shape = (64, 64, 3)

model = Sequential([
    Conv2D(64, (3, 3), input_shape=input_shape, padding='same',
           activation='relu', strides=2),
    MaxPooling2D(pool_size=(2, 2), strides=2),
    Dropout(0.2),
    Conv2D(128, (3, 3), input_shape=input_shape, padding='same',
           activation='relu', strides=2),
    MaxPooling2D(pool_size=(2, 2), strides=2),
    Dropout(0.2),
    Flatten(),
    Dropout(0.5),
    Dense(2, activation='softmax') #output layeractivaiton with softmax
])

model.compile(optimizer = 'adam', #SGD(lr=0.1)
              loss = 'sparse_categorical_crossentropy',
              metrics=['accuracy'])
#%%
history = model.fit(train_images, train_labels, 
                    #validation_data=(test_images, test_labels),
                    #verbose=2,callbacks=[earlyStop],
                    batch_size=128, epochs=50)
#%%
'model report'
plt.title('train_loss')
plt.ylabel('loss')
plt.xlabel('Epoch')
plt.plot(history.history["loss"])
#%%
'prediction'
predictions = model.predict(test_images)     # Vector of probabilities
pred_labels = np.argmax(predictions, axis = 1) # We take the highest probability

#%%
'confusion matrix'
CM = confusion_matrix(test_labels, pred_labels)
def accuracy(confusion_matrix):
    diagonal_sum = confusion_matrix.trace()
    sum_of_all_elements = confusion_matrix.sum()
    return diagonal_sum / sum_of_all_elements 
print(accuracy(CM))

#use function
from sklearn import metrics
#Print the prediction accuracy
print ("Accuracy = ", metrics.accuracy_score(test_labels, pred_labels))
#%%
'visiable confusion matrix'
ax = plt.axes()
sn.heatmap(CM, annot=True, 
           annot_kws={"size": 10}, 
           xticklabels=class_names, 
           yticklabels=class_names, ax = ax)
ax.set_title('Confusion matrix')
plt.show()
#%%
'save model'
from keras.models import load_model
model.save("CNN_model.keras")
model = load_model('CNN_model.keras')

import glob
import os
import random

import matplotlib.pyplot as plt
import numpy as np
import pandas as pd

import torch
import torchvision
import torch.nn as nn
import torch.nn.functional as F
import torch.optim as optim

from PIL import Image
from sklearn.model_selection import train_test_split
from sklearn.metrics import confusion_matrix, ConfusionMatrixDisplay
from torch.utils.data import DataLoader, Dataset, RandomSampler, Subset, random_split
from torchvision import transforms, models, datasets
from tqdm import tqdm

# device
# os.environ["CUDA_VISIBLE_DEVICES"] = "0"
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
print(f'Using {device} device')

# set the seed
def set_all_seed(seed):
    np.random.seed(seed)
    random.seed(seed)
    torch.manual_seed(seed)
set_all_seed(123)

# batch size
batch_size = 256

# resize the data image
resize_transform = transforms.Resize((32, 32))
# resize_transform = transforms.Resize((460, 860))

# data preprocessing
train_transform = transforms.Compose([
    transforms.Pad(4, padding_mode='reflect'),
    transforms.RandomHorizontalFlip(),
    transforms.RandomRotation(10),
    resize_transform,
    transforms.RandomCrop(32),
    transforms.ColorJitter(brightness=0.2, contrast=0.2, saturation=0.2, hue=0.2),
    transforms.RandomErasing(),
    transforms.ToTensor(),
])
test_transform = transforms.Compose([
    resize_transform,
    transforms.ToTensor(),
])

# import dataset
#dataset_path = r'C:\Users\112033639\Desktop\\AMD_contest\Dataset\Fire'
#train_dataset = datasets.ImageFolder(root=dataset_path, transform=train_transform)
# = datasets.ImageFolder(root=dataset_path, transform=test_transform)

#train_dataloader = DataLoader(train_dataset, batch_size=batch_size, shuffle=True, pin_memory=True)
#test_dataloader = DataLoader(test_dataset, batch_size=batch_size, shuffle=False, pin_memory=True)

# import dataset
dataset_path = r'C:\Users\112033639\Desktop\\AMD_contest\Dataset\Fire\fire_dataset'
dataset = datasets.ImageFolder(root=dataset_path)

# Split the dataset into training and validation sets (80% training, 20% testing)
train_size = int(0.8 * len(dataset))
test_size = len(dataset) - train_size
train_dataset, test_dataset = random_split(dataset, [train_size, test_size])

# Apply transforms
train_dataset.dataset.transform = train_transform
test_dataset.dataset.transform = test_transform

# Create data loaders
train_dataloader = DataLoader(train_dataset, batch_size=batch_size, shuffle=True, pin_memory=True)
test_dataloader = DataLoader(test_dataset, batch_size=batch_size, shuffle=False, pin_memory=True)

# bulid the network
class NeuralNetwork(nn.Module):
    def __init__(self,input_size, output_size):
        super().__init__()
        self.flatten = nn.Flatten()
        self.linear_relu_stack = nn.Sequential(
            nn.Linear(input_size, 2048),  
            nn.ReLU(),
            nn.Dropout(0.5),
            nn.Linear(2048, 2048),
            nn.ReLU(),
            nn.Dropout(0.5),
            nn.Linear(2048, 1024),
            nn.ReLU(),
            nn.Dropout(0.5),
            nn.Linear(1024, 512),
            nn.ReLU(),
            nn.Dropout(0.5),
            nn.Linear(512, output_size), 
        )

    def forward(self, x):
        x = self.flatten(x)
        logits = self.linear_relu_stack(x)
        # logits = self.hidden_layers(x)
        return logits

# class NeuralNetwork(nn.Module):
    # def __init__(self, input_channels, output_size):
        # super(NeuralNetwork, self).__init__()
        # self.conv1 = nn.Conv2d(input_channels, 64, kernel_size=3, stride=2, padding=1)
        # self.pool1 = nn.MaxPool2d(kernel_size=2, stride=2)
        # self.dropout1 = nn.Dropout(0.5)
        # self.conv2 = nn.Conv2d(64, 128, kernel_size=3, stride=2, padding=1)
        # self.pool2 = nn.MaxPool2d(kernel_size=2, stride=2)
        # self.dropout2 = nn.Dropout(0.5)
        # self.flatten = nn.Flatten()
        # self.fc1 = nn.Linear(128 * 2 * 2, 512)  
        # self.dropout3 = nn.Dropout(0.5)
        # self.fc2 = nn.Linear(512, output_size)
    
    # def forward(self, x):
        # x = self.conv1(x)
        # x = F.relu(x)
        # x = self.pool1(x)
        # x = self.dropout1(x)
        # x = self.conv2(x)
        # x = F.relu(x)
        # x = self.pool2(x)
        # x = self.dropout2(x)
        # x = self.flatten(x)
        # x = self.fc1(x)
        # x = F.relu(x)
        # x = self.dropout3(x)
        # x = self.fc2(x)
        # return x


input_channels = 3
input_size = 32*32*3
output_size = 2


# load the model
# model = NeuralNetwork(input_channels, output_size)
model = NeuralNetwork(input_size, output_size)
model.eval()
print(model)

# num_fc = model.fc.in_features
# model.fc = nn.Linear(num_fc, 2)

# loss function
loss_fn = nn.CrossEntropyLoss()

# optimizer (L2 normalization)
optimizer = torch.optim.Adam(model.parameters(), lr=1e-3, weight_decay=1e-3)

# learning rate scheduler
scheduler = torch.optim.lr_scheduler.StepLR(optimizer, step_size=10, gamma=0.1)


# train
def train(dataloader, model, loss_fn, optimizer):
    num_batches = len(dataloader)
    size = len(dataloader.dataset)
    epoch_loss = 0
    correct = 0
    
    all_preds = []
    all_labels = []

    model.train()

    for X, y in tqdm(dataloader):
        X, y = X.to(device), y.to(device)

        # Compute prediction error
        pred = model(X)
        loss = loss_fn(pred, y)

        # Backpropagation
        optimizer.zero_grad()
        loss.backward()
        optimizer.step()

        epoch_loss += loss.item()
        pred = pred.argmax(dim=1, keepdim=True)
        correct += pred.eq(y.view_as(pred)).sum().item()
        
        # Store predictions and labels
        all_preds.extend(pred.view(-1).cpu().numpy())
        all_labels.extend(y.cpu().numpy())

    avg_epoch_loss = epoch_loss / num_batches
    avg_acc = correct / size
    
    # Calculate confusion matrix
    cm = confusion_matrix(all_labels, all_preds)
    disp = ConfusionMatrixDisplay(confusion_matrix=cm)
    disp.plot()
    plt.show()

    return avg_epoch_loss, avg_acc

# test
def test(dataloader, model, loss_fn):
    num_batches = len(dataloader)
    size = len(dataloader.dataset)
    epoch_loss = 0
    correct = 0
    
    all_preds = []
    all_labels = []

    model.eval()

    with torch.no_grad():
        for X, y in tqdm(dataloader):

            X, y = X.to(device), y.to(device)

            pred = model(X)

            epoch_loss += loss_fn(pred, y).item()
            pred = pred.argmax(dim=1, keepdim=True)
            correct += pred.eq(y.view_as(pred)).sum().item()
            
            # Store predictions and labels
            all_preds.extend(pred.view(-1).cpu().numpy())
            all_labels.extend(y.cpu().numpy())

    avg_epoch_loss = epoch_loss / num_batches
    avg_acc = correct / size
    
    # Calculate confusion matrix
    cm = confusion_matrix(all_labels, all_preds)
    disp = ConfusionMatrixDisplay(confusion_matrix=cm)
    disp.plot()
    plt.show()

    return avg_epoch_loss, avg_acc

# epoch
epochs = 50

# Initialize lists to store training and testing metrics
train_losses = []
train_accuracies = []
test_losses = []
test_accuracies = []

# train accuracy initialization
train_acc1 = 0

# test accuracy initialization
test_acc1 = 0

# start_train
for epoch in range(epochs):

    # train loss/accuracy
    train_loss, train_acc = train(train_dataloader, model, loss_fn, optimizer)
    train_losses.append(train_loss)
    train_accuracies.append(train_acc)
    print(f"Epoch {epoch + 1:2d}: Loss = {train_loss:.4f} Acc = {train_acc:.2f}")
    train_acc1+=train_acc

    # test loss/accuracy
    test_loss, test_acc = test(test_dataloader, model, loss_fn)
    test_losses.append(test_loss)
    test_accuracies.append(test_acc)
    print(f"Epoch {epoch + 1:2d}: Loss = {train_loss:.4f} Acc = {train_acc:.2f} Test_Loss = {test_loss:.4f} Test_Acc = {test_acc:.2f}")
    test_acc1 += test_acc
    
    # save the model
    torch.save(model.state_dict(), f'model_epoch_{epoch}.pth')

# Plot accuracy and loss curves
epochs_range = range(1, epochs + 1)

plt.figure(figsize=(12, 5))

# Plot training & validation accuracy
plt.subplot(1, 2, 1)
plt.plot(epochs_range, train_accuracies, label='Training Accuracy')
plt.plot(epochs_range, test_accuracies, label='Validation Accuracy')
plt.xlabel('Epochs')
plt.ylabel('Accuracy')
plt.title('Training and Validation Accuracy')
plt.legend()

# Plot training & validation loss
plt.subplot(1, 2, 2)
plt.plot(epochs_range, train_losses, label='Training Loss')
plt.plot(epochs_range, test_losses, label='Validation Loss')
plt.xlabel('Epochs')
plt.ylabel('Loss')
plt.title('Training and Validation Loss')
plt.legend()

plt.show()

import cv2
import numpy as np
from sklearn.cluster import AgglomerativeClustering
from collections import Counter
import pandas as pd
import numpy as np 
import matplotlib.pyplot as plt 
import sklearn
from sklearn.cluster import KMeans
from collections import Counter
from skimage.color import rgb2lab, deltaE_cie76
import random

def get_image(image_path, downsample_factor=0.2):
    image = cv2.imread(image_path)  
    image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)  
    if downsample_factor < 1:  # 
        image = cv2.resize(image, (0, 0), fx=downsample_factor, fy=downsample_factor)
    return image

def RGB2HEX(color):
    return "#{:02x}{:02x}{:02x}".format(int(color[0]), int(color[1]), int(color[2]))  

def top_down_hierarchical_clustering(image, num_colors, sample_size=10000, random_seed=50):
    pixels = image.reshape((-1, 3))
    np.random.seed(random_seed)  # 
    sample_size = min(sample_size, len(pixels))
    random_indices = np.random.choice(len(pixels), sample_size, replace=False)  
    sample_pixels = pixels[random_indices]
    clustering = AgglomerativeClustering(n_clusters=num_colors)
    labels = clustering.fit_predict(sample_pixels)

    cluster_centers = []
    for label in range(num_colors):
        cluster_center = sample_pixels[labels == label].mean(axis=0)
        cluster_centers.append(cluster_center)

    return np.array(cluster_centers), labels

image = get_image('fire.41.png', downsample_factor=0.1)

number_of_colors = 10

cluster_centers, labels = top_down_hierarchical_clustering(image, number_of_colors)
modified_image = image.reshape(image.shape[0]*image.shape[1], 3)  #  RGB 
clf = KMeans(n_clusters=number_of_colors, init=cluster_centers, n_init=1)
labels = clf.fit_predict(modified_image)  # 
counts = Counter(labels)  # 

center_colors = clf.cluster_centers_  # 
# 
ordered_colors = [center_colors[i] for i in counts.keys()]  # 
hex_colors = [RGB2HEX(ordered_colors[i]) for i in counts.keys()]  # 
rgb_colors = [ordered_colors[i] for i in counts.keys()]  #  RGB 

plt.title('Colors Detection ($n=10$)', fontsize=20)  # 
plt.pie(counts.values(), labels = hex_colors, colors = hex_colors)  #

# -*- coding: utf-8 -*-
"""
Created on Mon May 13 16:54:57 2024

@author: a3311
"""

# necessary libraries
import numpy as np
import cv2
from keras.models import load_model

# load model
model = load_model("CNN_model.keras")

# read one image and preprocess
def preprocess_image(image_path):
    # read image
    image = cv2.imread(image_path)
    # change color space
    image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)
    # adjust size
    image = cv2.resize(image, (64, 64))
    # normalization
    image = image / 255.0
    # change image into numpy array and add one dimensionality
    # so that it meets the needs of model input
    image = np.expand_dims(image, axis=0)
    return image

# define image path
image_path = "fire.688.png"  

# preprocess image and predict
image = preprocess_image(image_path)
prediction = model.predict(image)

# the probability of nofire(left) / fire(right)
print(prediction)

# print the results of prediction
if (prediction[0][0] > prediction[0][1]):
    print("non-fire")
else:
    print("fire")

import numpy as np
import cv2
import torch
import torch.nn as nn
import torchvision.transforms as transforms
from PIL import Image

# model structure
class NeuralNetwork(nn.Module):
    def __init__(self,input_size, output_size):
        super().__init__()
        self.flatten = nn.Flatten()
        self.linear_relu_stack = nn.Sequential(
            nn.Linear(input_size, 1024),  
            nn.ReLU(),
            nn.Linear(1024, 1024),
            nn.ReLU(),
            nn.Linear(1024, 512),  
            nn.ReLU(),
            nn.Linear(512, output_size),  
        )

    def forward(self, x):
        x = self.flatten(x)
        logits = self.linear_relu_stack(x)
        # logits = self.hidden_layers(x)
        return logits

# initialize
input_size = 32 * 32 * 3  
output_size = 2
model = NeuralNetwork(input_size, output_size)

# load state dictionary
# enter in the last epoch (.pth)
model.load_state_dict(torch.load("model_epoch_4.pth"))
model.eval()

# data preprocess
def preprocess_image(image_path):
    # read image
    image = Image.open(image_path).convert("RGB")
    # image preprocess
    transform = transforms.Compose([
        transforms.Resize((32, 32)),  
        transforms.ToTensor(),
    ])
    image = transform(image)
    image = image.unsqueeze(0)  
    return image

# image path
image_path = "fire.41.png"  

# predict
image = preprocess_image(image_path)
image = image.to(torch.device('cpu'))  

with torch.no_grad():
    output = model(image)
    prediction = torch.argmax(output, dim=1)
    print(f'Predicted class: {prediction.item()}')

# print
if prediction.item() == 0:
    print("non-fire")
else:
    print("fire")

Credits

SIH-MIN, LIU

1 project • 0 followers

陳博琳

1 project • 0 followers

我一直沒辦法被找到，我想要參加比賽，哭阿

Forest Fire Prevention

Things used in this project

Hardware components

Software apps and online services

Story

E-mail:s112030504@m112.nthu.edu.tw

E-mail:s112033639@m112.nthu.edu.tw

E-mail: pjwang@pme.nthu.edu.tw

Abstract

Code

cnn.py

cnn_test.py

hier_kmeans.py

judgement.py

kr260_test.py

Credits

SIH-MIN, LIU

陳博琳

Comments

Embed the widget on your own site

Forest Fire Prevention

Forest Fire Prevention

Things used in this project

Hardware components

Software apps and online services

Story

E-mail:s112030504@m112.nthu.edu.tw

E-mail:s112033639@m112.nthu.edu.tw

E-mail: pjwang@pme.nthu.edu.tw

Abstract

Code

cnn.py

cnn_test.py

hier_kmeans.py

judgement.py

kr260_test.py

Credits

SIH-MIN, LIU

陳博琳

Comments

Related channels and tags