ROBA-THESIS-2020.pdf (5.49 MB)

Leap: a Model-Based Reinforcement Learning Framework for Fast Object Detection

Download (5.49 MB)
thesis
posted on 01.08.2020, 00:00 by Edoardo Roba
Create a new algorithm for Object DetecKon through Python. The starKng point of the project was to consider the model-free algorithm described in Urbana-Champaign’s paper where they showed an algorithm for Object DetecKon based on Reinforcement Learning. The idea was to create an agent that could start from a state (a bounding box in our case) and reach the final target. The goal of our project was to make this project get closer to real-Kme simulaKons. Seen the projects for Fast RCNN and Faster RCNN, the goal of our project was to create an Object DetecKon algorithm based on Reinforcement Learning that could bypass the CNN somehow. We created a model-based algorithm, so during the Q-training, we recorded the TransiKons (state-acKon-next_state). These data are then used for the training process of a further network. The purpose of this new network was to predict the future TransiKons, in order to bypass the CNN every acKon the Agent was taking. Ader the correct design, the correct E-greedy policy assumpKon and the correct pre-processing, we end up with a faster algorithm with a low loss in accuracy (the more predicKons you do, the worse it gets, but the faster it goes).

History

Advisor

Trivedi, Amit

Chair

Trivedi, Amit

Department

Electrical and Computer Engineering

Degree Grantor

University of Illinois at Chicago

Degree Level

Masters

Degree name

MS, Master of Science

Committee Member

Bottino, Andrea Devroye, Natasha

Submitted date

August 2020

Thesis type

application/pdf

Language

en

Exports

Categories

Exports