Week 6

Week 6

March 13, 2022

This week I finished programming the basic CNN model using transfer learning. I decided to train it for 20 epochs to make sure there weren’t any runtime errors in my code. As I don’t own an Nvidia GPU (I have an AMD GPU), I couldn’t make use of the pytorch version that utilised CUDA to speed up processing. There is a RocM version of pytorch for AMD GPUs1 but RocM isn’t as mature as CUDA and only officially supports a small subset of Linux distributions. Therefore, for this model I trained on the COU for only 20 epochs just to make sure it ran successfully before trying to find some Nvidia compute.

Transfer learning model architecture #

Full #

Transfer learning architecture

Simplified #

Simplified Transfer learning architecture

Validation and train loss over 20 epochs #

plot of training and validation loss over 20 epochs


  1. https://pytorch.org/blog/pytorch-for-amd-rocm-platform-now-available-as-python-package/ ↩︎