CS 5043: HW3: Convolutional Neural Networks

Objectives

Assignment Notes

Data Set

The Core50 data set is a large database of videos of objects as they are being moved/rotated under a variety of different lighting and background conditions. Our general task is to classify the object being shown in a single frame of one of these videos.

Data Organization


Provided Code

We are providing the following code posted on the main course web page:


Prediction Problem

We will focus on classifying one of four object classes: mugs, cans, balls and cups. For HW 3, all object instances are included in each data fold. However, different folds are composed of different background conditions. Hence, we are building a model that can distinguish these objects from one-another in the context of arbitrary background (i.e., we are building models that can distinguish all mugs from all cans, balls and cups).

The provided code (starting at load_data_set()) will set up the object classes and TF Datasets for your.


Architectures

You will create two convolutional neural networks to distinguish these four classes: one will be a shallow network and the other will be a deep network. Each will nominally have the following structure:

Since the data set is relatively small (in terms of the number of distinct objects), it is important to take steps to address the over-fitting problem. Here are the key tools that you have:


Experiments


Hints / Notes


What to Hand In

A single zip file that contains:

Grading


andrewhfagg -- gmail.com

Last modified: Mon Mar 6 23:42:33 2023