Demystifying Human Action Recognition in Deep Learning with Space-Time Feature Descriptors (PDF)
(Sprache: Englisch)
Research Paper (postgraduate) from the year 2018 in the subject Computer Science - Internet, New Technologies, , course: Machine Learning, language: English, abstract: Human Action Recognition is the task of recognizing a set of actions being performed in a...
sofort als Download lieferbar
eBook (pdf)
15.99 €
- Lastschrift, Kreditkarte, Paypal, Rechnung
- Kostenloser tolino webreader
Produktdetails
Produktinformationen zu „Demystifying Human Action Recognition in Deep Learning with Space-Time Feature Descriptors (PDF)“
Research Paper (postgraduate) from the year 2018 in the subject Computer Science - Internet, New Technologies, , course: Machine Learning, language: English, abstract: Human Action Recognition is the task of recognizing a set of actions being performed in a video sequence. Reliably and efficiently detecting and identifying actions in video could have vast impacts in the surveillance, security, healthcare and entertainment spaces.
The problem addressed in this paper is to explore different engineered spatial and temporal image and video features (and combinations thereof) for the purposes of Human Action Recognition, as well as explore different Deep Learning architectures for non-engineered features (and classification) that may be used in tandem with the handcrafted features. Further, comparisons between the different combinations of features will be made and the best, most discriminative feature set will be identified.
In the paper, the development and implementation of a robust framework for Human Action Recognition was proposed. The motivation behind the proposed research is, firstly, the high effectiveness of gradient-based features as descriptors - such as HOG, HOF, and N-Jets - for video-based human action recognition. They are capable of capturing both the salient spatial
and temporal information in the video sequences, while removing much of the redundant information that is not pertinent to the action. Combining these features in a hierarchical fashion further increases performance.
The problem addressed in this paper is to explore different engineered spatial and temporal image and video features (and combinations thereof) for the purposes of Human Action Recognition, as well as explore different Deep Learning architectures for non-engineered features (and classification) that may be used in tandem with the handcrafted features. Further, comparisons between the different combinations of features will be made and the best, most discriminative feature set will be identified.
In the paper, the development and implementation of a robust framework for Human Action Recognition was proposed. The motivation behind the proposed research is, firstly, the high effectiveness of gradient-based features as descriptors - such as HOG, HOF, and N-Jets - for video-based human action recognition. They are capable of capturing both the salient spatial
and temporal information in the video sequences, while removing much of the redundant information that is not pertinent to the action. Combining these features in a hierarchical fashion further increases performance.
Autoren-Porträt von Mike Nkongolo
Mike Nkongolo received the BSc (Hons) degree in computer science from the University of the Witwatersrand, Johannesburg, South Africa, in 2016. He is currently working toward the Masters degree in the School of Computer Science and Applied Mathematics, University of the Witwatersrand. His research interests include the theory and applications of Intelligent Systems, Web-based platforms and Machine Learning, Sentiment detection in Web Mining, and Artificial Intelligence-Natural Languages Processing.
Bibliographische Angaben
- Autor: Mike Nkongolo
- 2018, 33 Seiten, Englisch
- Verlag: GRIN Verlag
- ISBN-10: 3668642591
- ISBN-13: 9783668642591
- Erscheinungsdatum: 21.02.2018
Abhängig von Bildschirmgröße und eingestellter Schriftgröße kann die Seitenzahl auf Ihrem Lesegerät variieren.
eBook Informationen
- Dateiformat: PDF
- Größe: 1.35 MB
- Ohne Kopierschutz
- Vorlesefunktion
Sprache:
Englisch
Kommentar zu "Demystifying Human Action Recognition in Deep Learning with Space-Time Feature Descriptors"
Schreiben Sie einen Kommentar zu "Demystifying Human Action Recognition in Deep Learning with Space-Time Feature Descriptors".
Kommentar verfassen