Mostra i principali dati dell'item

dc.contributor.authorCiaparrone, Gioele
dc.date.accessioned2023-02-24T12:35:20Z
dc.date.available2023-02-24T12:35:20Z
dc.date.issued2021-10-11
dc.identifier.urihttp://elea.unisa.it:8080/xmlui/handle/10556/6450
dc.identifier.urihttp://dx.doi.org/10.14273/unisa-4522
dc.description2019 - 2020it_IT
dc.description.abstractIn recent years, deep learning (DL) has obtained numerous successes in analyzing complex data, such as images or audio. A particularly recent area of application is the analysis of videos. This thesis focuses on the application of deep learning algorithm to two video analysis tasks: Multiple Object Tracking (MOT) and Face-based Video Retrieval (FBVR). The first main part of the thesis presents an in-depth survey of the state of the art of DL-based MOT algorithms. This is the first comprehensive survey specifically on the use of DL for MOT, focusing on 2D frames extracted from single-camera videos. I identify the four main steps of a MOT algorithm and describe the various DL techniques used in the literature in each of those four steps. I also collect and compare results obtained by existing algorithms on the most common MOT datasets and I analyze the most successful techniques employed. Finally, I present a discussion about the open issues of current MOT algorithms and the possible solutions and future directions of research. The second part of the thesis focuses instead on the task of FBVR. I present a novel pipeline for the retrieval of unconstrained multi-shot videos using faces, specifically in the context of television-like videos. Since no existing dataset in the literature is appropriate for an end-to-end evaluation of the proposed pipeline, I build a large-scale video dataset by adapting the VoxCeleb2 dataset to the task of FBVR. I compare and evaluate numerous DL-based approaches for the various steps in pipeline, such as shot detection, face detection and face recognition, and I describe the advantages and disadvantages of each employed technique. ... [edited by Author]it_IT
dc.language.isoenit_IT
dc.publisherUniversita degli studi di Salernoit_IT
dc.subjectDeep learningit_IT
dc.subjectMultiple object trackingit_IT
dc.subjectVideo retrievalit_IT
dc.titleMultiple object tracking and face-based video retrieval: applications of deep learning to video analysisit_IT
dc.typeDoctoral Thesisit_IT
dc.subject.miurINF/01 INFORMATICAit_IT
dc.contributor.coordinatoreAntonelli, Valerioit_IT
dc.description.cicloXXXIII cicloit_IT
dc.contributor.tutorTagliaferri, Robertoit_IT
dc.identifier.DipartimentoScienze aziendali – Management & innovation systemsit_IT
 Find Full text

Files in questo item

Thumbnail
Thumbnail
Thumbnail

Questo item appare nelle seguenti collezioni

Mostra i principali dati dell'item