Demystifying Human Action Recognition in Deep Learning with Space-Time Feature Descriptors (PDF)

Name: Demystifying Human Action Recognition in Deep Learning with Space-Time Feature Descriptors
Brand: GRIN Verlag
SKU: 96585814
Price: 15.99 EUR
Availability: OnlineOnly

(Sprache: Englisch)

Autor: Mike Nkongolo

Schreiben Sie einen Kommentar zu "Demystifying Human Action Recognition in Deep Learning with Space-Time Feature Descriptors".

Demystifying Human Action Recognition in Deep Learning with Space-Time Feature Descriptors, Mike Nkongolo

Merken

Mehr zum Inhalt Autorenporträt Video

sofort als Download lieferbar

Bestellnummer: 96585814

eBook (pdf) 15.99 €

Download bestellen

Verschenken

Lastschrift, Kreditkarte, Paypal, Rechnung
Kostenloser tolino webreader

Produktdetails

Produktbeschreibung
Autorenporträt
eBook Hilfe
Biblio. Angaben / Produktdetails

Produktinformationen zu „Demystifying Human Action Recognition in Deep Learning with Space-Time Feature Descriptors (PDF)“

Research Paper (postgraduate) from the year 2018 in the subject Computer Science - Internet, New Technologies, , course: Machine Learning, language: English, abstract: Human Action Recognition is the task of recognizing a set of actions being performed in a video sequence. Reliably and efficiently detecting and identifying actions in video could have vast impacts in the surveillance, security, healthcare and entertainment spaces.

The problem addressed in this paper is to explore different engineered spatial and temporal image and video features (and combinations thereof) for the purposes of Human Action Recognition, as well as explore different Deep Learning architectures for non-engineered features (and classification) that may be used in tandem with the handcrafted features. Further, comparisons between the different combinations of features will be made and the best, most discriminative feature set will be identified.

In the paper, the development and implementation of a robust framework for Human Action Recognition was proposed. The motivation behind the proposed research is, firstly, the high effectiveness of gradient-based features as descriptors - such as HOG, HOF, and N-Jets - for video-based human action recognition. They are capable of capturing both the salient spatial
and temporal information in the video sequences, while removing much of the redundant information that is not pertinent to the action. Combining these features in a hierarchical fashion further increases performance.

Autoren-Porträt von Mike Nkongolo

Mike Nkongolo received the BSc (Hons) degree in computer science from the University of the Witwatersrand, Johannesburg, South Africa, in 2016. He is currently working toward the Masters degree in the School of Computer Science and Applied Mathematics, University of the Witwatersrand. His research interests include the theory and applications of Intelligent Systems, Web-based platforms and Machine Learning, Sentiment detection in Web Mining, and Artificial Intelligence-Natural Languages Processing.