A Comprehensive Video Dataset for Multi-Modal Recognition Systems

Authors

  • Anand Handa Dr. APJ Abdul Kalam Technical University, Kanpur https://orcid.org/0000-0003-0075-1165
  • Rashi Agarwal Department of IT, University Institute of Engineering and Technology, Kanpur
  • Narendra Kohli Department of CSE, Harcourt Butler Technical University, Kanpur

DOI:

https://doi.org/10.5334/dsj-2019-055

Keywords:

Machine leaning, Deep learning, video datasets, Convolutional Neural Network

Abstract

This paper presents a comprehensive, highly defined and fully labelled video dataset. This dataset consists of videos related to 67 different subjects. The videos contain similar text and the text contains digits from 1 to 20 recited by 67 different subjects using the same experimental setup. This dataset can be used as a unique resource for researchers and analysts for training deep neural networks to build highly efficient and accurate recognition models in various domains of computer vision such as face recognition model, expression recognition model, speech recognition model, text recognition, etc. In this paper, we also train models related to face recognition and speech recognition on our dataset and also compare the results with the publically available datasets to show the effectiveness of our dataset. The experimental results show that our comprehensive dataset is more accurate than other dataset on which the models are tested.

Author Biographies

Anand Handa, Dr. APJ Abdul Kalam Technical University, Kanpur

I am a research scholar in computer science and engineering department at Dr. APJ Abdul Kalam Technical University, Lucknow, UP, India. His area of interests includes image processing, computer vision, and machine learning

Rashi Agarwal, Department of IT, University Institute of Engineering and Technology, Kanpur

Rashi Agarwal is an assistant professor and head in the Department of Information Technology at the University Institute of Engineering and Technology, CSJM University, Kanpur, UP, India. She has done her Ph.D. in image processing from Dr. APJ Abdul Kalam Technical University, Lucknow. Her area of interests includes image processing and machine learning

Narendra Kohli, Department of CSE, Harcourt Butler Technical University, Kanpur

Narendra Kohli is a professor in the Department of Computer Science and Engineering at Harcourt Butler Technical University, Kanpur, UP, India. He has done his Ph.D. in image processing from the Indian Institute of Technology, Kanpur. His area of interests includes medical imaging, computer vision, and image processing

Downloads

Published

2019-11-08

Issue

Section

Data Articles