CA3 and CA4: Deep Learning (CEG5304)
Deadline: April 14, 2023 at 5pm
Read carefully:
This is an individual assignment.
Academic integrity must be strictly
followed. Copying from other's code/text or from anyone or any
source is not allowed (unless stated clearly in the
instructions).
Exchanging codes/text is not allowed. Software
will be used to detect any form of source code plagiarism. Your
submitted code must be grouped/separated into the same parts in
the instructions. In your submission, you must provide us with
all necessary libraries, so that we can compile your code.
You are not allowed to use any
toolboxes unless stated in the instructions explicitly (if you
are not sure, you should ask). If you use
separate files in your submission, you must zip them to one
file. The submission must not be in separate files in different
times (it must be submitted together once). The deadline is strict,
so please prepare and plan early and carefully. Late
submission will cause score deduction (assuming 100 is the full
score, there will be deduction of 10 points in the first 24
hours, 30 points in the next 24 hours, 60 points in the next 24 hours,
and 100 points beyond this).
CA3: Style Transfer using Image-to-Image Translation
Figure 1: Style transfer using Image-to-Image Translation
In this assignment, your tasks is to translate from one
image to another image, as examples shown in Fig. 1.
CA3-Part 1: PyTorch Code
- Download the data from:
Kaggle: I’m Something of a
Painter Myself.
- Use the data to implement an appropriate algorithm to transfer an image into
Monet’s styles. You are allowed to use publicly available
code. However, it must be based on PyTorch. Moreover, you
must cite the source properly in your code and your report.
CA3-Part 2: Report
Write a report in PDF based on the below instructions. You
must write down the instruction numbers and also the
instructions themselves on your report along with your answers
(so that we can identify your answer rather easily).
- Write your name and student ID.
- Draw the architecture/pipeline. If you use the publicly
available code, you must state it clearly and provide the
link of the code.
- Explain how the method works by discussing the
pipeline.
- Test your model using the 3 provided images (in folder
CA3 Part 2-4 of the provided zip file), and show the results
in your report. Here is the provided zip
file: download.
- Test your model using the 3 provided images (in folder
CA3 Part 2-5 of the provided zip file), and show the results
in your report.
- Compare your results with the real monet paintings
(available in the dataset), and state any drawbacks of your
results.
- Provide any possible solutions for the stated drawbacks
or any possible improvements for your method. Provide
justifications for your solutions.
CA3-Part 3: Video
- Create a video presenting your report with duration of
maximum 10 minutes.
- In the video, you must use powerpoint to show your
slides.
- In the video, you must show your face next to your
powerpoint slides while you're presenting.
- The video must be in the AVI or MP4 format.
- The presentation must follow the contents of your report.
* Any recording tool is fine, but you might want to use Zoom to record your presentation.
CA4: Semantic Segmentation
Figure 2: Semantic Segmentation
In this assignment, your task is to segment an input image
semantically based on the classes, as examples shown
in Fig. 2.
CA4-Part 1: PyTorch Code
- Find from the internet or create your best
model/network for semantic segmentation. You are allowed to
use any publicly available code. However, it must be based
on PyTorch. Moreover, you must cite the source properly in
your code and your report.
- Train your network using the The Oxford-IIIT Pet Dataset
dataset. If you are using your own GPU, you need to download
the dataset
from: The Oxford-IIIT Pet Dataset website.
CA4-Part 2: Report
Write a report in PDF based on the below instructions. You
must write down the instruction numbers and also the
instructions themselves on your report along with your answers
(so that we can identify your
answer rather easily).
- Write your name and student ID.
- Draw the architecture/pipeline. If you use the publicly
available code, you must state it clearly and provide the
link of the code.
- Explain how the method works by discussing the pipeline.
- Compute the mIoU of your segmentation results with
respect to the corresponding ground-truths.
- Test your model using the provided 3 pet images (in
folder CA4 Part 2-5 of the provided zip file) in different styles, and show the results
in your report. Here is the provided zip
file: download.
- For the previous instruction, if there are any wrong
segmentations, discuss why they are wrong.
- Test your model using the provided 3 pet images (in
folder CA4 Part 2-7 of the provided zip file)
that are taken in nighttime, and show the results
in your report.
- For the previous instruction, if there are any wrong
segmentations, discuss why they are wrong.
- Provide possible solutions for the nighttime
segmentation problem. Provide justifications for your
solutions. The more details your solution is, the
more points you will get.
CA4-Part 3: Video
- Create a video presenting your report with duration of
maximum 10 minutes.
- In the video, you must use powerpoint to show your
slides.
- In the video, you must show your face next to your
powerpoint slides while you're presenting.
- The video must be in the AVI or MP4 format.
- The presentation must follow the contents of your
report.
* Any recording tool is fine, but you might want to use Zoom to record your presentation.
Tips for GPU Resources
Running on GPU is more efficient for training your models. If
you do not have GPU resources, you may
consider
Google Colab.
Submission
Submit your code, report and video via Canvas by
the deadline.