A Comprehensive Video Dataset for Multi-Modal Recognition Systems

Anand Handa; Rashi Agarwal; Narendra Kohli

doi:10.5334/dsj-2019-055

A Comprehensive Video Dataset for Multi-Modal Recognition Systems

Authors

Anand Handa Dr. APJ Abdul Kalam Technical University, Kanpur https://orcid.org/0000-0003-0075-1165
Rashi Agarwal Department of IT, University Institute of Engineering and Technology, Kanpur
Narendra Kohli Department of CSE, Harcourt Butler Technical University, Kanpur

DOI:

https://doi.org/10.5334/dsj-2019-055

Keywords:

Machine leaning, Deep learning, video datasets, Convolutional Neural Network

Abstract

This paper presents a comprehensive, highly defined and fully labelled video dataset. This dataset consists of videos related to 67 different subjects. The videos contain similar text and the text contains digits from 1 to 20 recited by 67 different subjects using the same experimental setup. This dataset can be used as a unique resource for researchers and analysts for training deep neural networks to build highly efficient and accurate recognition models in various domains of computer vision such as face recognition model, expression recognition model, speech recognition model, text recognition, etc. In this paper, we also train models related to face recognition and speech recognition on our dataset and also compare the results with the publically available datasets to show the effectiveness of our dataset. The experimental results show that our comprehensive dataset is more accurate than other dataset on which the models are tested.

Author Biographies

Anand Handa, Dr. APJ Abdul Kalam Technical University, Kanpur

I am a research scholar in computer science and engineering department at Dr. APJ Abdul Kalam Technical University, Lucknow, UP, India. His area of interests includes image processing, computer vision, and machine learning

Rashi Agarwal, Department of IT, University Institute of Engineering and Technology, Kanpur

Rashi Agarwal is an assistant professor and head in the Department of Information Technology at the University Institute of Engineering and Technology, CSJM University, Kanpur, UP, India. She has done her Ph.D. in image processing from Dr. APJ Abdul Kalam Technical University, Lucknow. Her area of interests includes image processing and machine learning

Narendra Kohli, Department of CSE, Harcourt Butler Technical University, Kanpur

Narendra Kohli is a professor in the Department of Computer Science and Engineering at Harcourt Butler Technical University, Kanpur, UP, India. He has done his Ph.D. in image processing from the Indian Institute of Technology, Kanpur. His area of interests includes medical imaging, computer vision, and image processing

Downloads

Published

2019-11-08

Issue

Vol. 18 (2019)

Section

Data Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors who publish with this journal agree to the following terms. If a submission is rejected or withdrawn prior to publication, all rights return to the author(s):

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.

Submitting to the journal implicitly confirms that all named authors and rights holders have agreed to the above terms of publication. It is the submitting author's responsibility to ensure all authors and relevant institutional bodies have given their agreement at the point of submission.

Note: some institutions require authors to seek written approval in relation to the terms of publication. Should this be required, authors can request a separate licence agreement document from the editorial team (e.g. authors who are Crown employees).