Advanced Computer Vision Cap6412 (3 Credit Hours)

Instructor: Dr. Mubarak Shah


Office: HEC 245

Phone: 4078235077

Time: Tuesdays and Thursdays 3:00 to 4:15PM

Location: HEC 117

Schedule: Schedule

Office Hours: Tuesday 4:15 to 5:00PM; Thursdays 2:00 to 3:00PM and by appointment

Extra Discussion Session: Wednesdays 4:30 to 5:30, HEC 356

Course webpage:

Course Contents

This is an Advanced Computer Vision which will expose graduate students to the cutting-edge research. In each class we will discuss one recent research paper related to active areas of current research, in particular employing Deep Learning. Computer vision has been very active area of research for many decades and researchers have been working on solving important challenging problems. During the last few years, Deep Learning involving Artificial Neural Networks has been disruptive force in computer vision. Employing deep learning, tremendous progress has been made in a very short time in solving difficult problems and very impressive results have obtained in image and video classification, localization, semantic segmentation, etc. New techniques, datasets, hardware and software libraries are emerging almost every day. Deep Computer vision is impacting research in Robotics, Natural Language understanding, Computer Graphics, multi-modal analysis etc.


There is no text book for this class. We will discuss recent research papers.

Recommended supplemental textbook:

        Ian Goodfellow, Yoshua Bengio, Aaron Courville. Deep Learning.

Meeting Format

The class will meet twice a week for a 75 min lecture, taught by the instructor

Recommended online courses and useful links CS231n: Convolutional Neural Networks for Visual Recognition CS224n: Natural Language Processing with Deep Learning CS 294: Deep Reinforcement Learning Very nice explanations of some DL concepts



Grading Policy

        Reports 20%

        Presentation 10%

        Attendance and Discussion 20%

        Projects/Programs 50%

Late Policy


        0 for late reports 


        20% off per day

        up to 4 days




        GAN: Generative Adversarial Networks

        Deep Reinforcement Learning

        Semi and Unsupervised Learning

        Human Action and Activity Recognition

        Vision and Language


Student Learning Outcomes

After the completion of the course, the students should be able to:

        Read and understand a research paper.

        Write a comprehensive review of the paper.

        To identify strong and weak points of the papers.

        To come up with own ideas to solve the same problem, which may lead to their first research paper.

        To implement known method or work and successfully complete individual project.


Important Dates:



Statement on Academic Integrity:

The UCF Golden Rule ( ) will be observed in the class. Plagiarism and

Cheating of any kind on an examination, quiz, or assignment will result at least in an "F" for that assignment (and may, depending on the severity of the case, lead to an "F" for the entire course) and may be subject to appropriate referral to the Office of Student Conduct for further action. I will assume for this course that you will adhere to the academic creed of this University and will maintain the highest standards of academic integrity. In other words, don't cheat by giving answers to others or taking them from anyone else. I will also adhere to the highest standards of academic integrity, so please do not ask me to change (or expect me to change) your grade illegitimately or to bend or break rules for one person that will not apply to everyone.