Harnessing Convolutional Neural Networks and Transfer Learning to Perform Vision-Oriented Activity Recognition of Humans

Dr. Sunil Bhutada; B. Yeshwanth Raj

doi:10.32628/CSEIT2390111

Authors

Dr. Sunil Bhutada Sreenidhi Institute of Science & Technology, Yamnampet, Ghatkesar, Hyderabad, India
B. Yeshwanth Raj MTech Student, Department of IT, Sreenidhi Institute of Science & Technology, Yamnampet, Ghatkesar, Hyderabad, India

Keywords:

Human Activity Recognition, Criminological Investigations, Convolutional Neural Networks, Transfer Learning.

Abstract

Since the debut of the Internet of Things, there has always been a lot of noteworthy evolution in the aspect of "Human Activity recognition". Recognition of activities of humans possesses its own connotation and purpose and can be employed in diverse range of disciplines featuring medical assistance, nefarious activities, and espionage. It's possible that it could be critically pertinent in order to undergo ample amount of criminological investigations. To anticipate various human behaviors, a myriad of machine learning techniques are used. However, deep learning models have trounced standard machine learning strategies. Convolutional Neural Networks (CNN), a type of deep learning model, could very well heuristically extract the features and drastically cut overall processing expenditure. The action recognition kinetics dataset can be used to predict human activities using the CNN model. Here, we use transfer learning specifically for visual categorization problems.

References

B. Bhandari, J. Lu, X. Zheng, S. Rajasegarar, and C. Karmakar, “Noninvasive sensor based automated smoking activity detection,” in Pro-ceedings of Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE, 2017, pp. 845–848.
L. Yao, Q. Z. Sheng, X. Li, T. Gu, M. Tan, X. Wang, S. Wang, and W. Ruan, “Compressive representation for device-free activity recognition with passive rfid signal strength,” IEEE Transactions on Mobile Computing, vol. 17, no. 2, pp. 293–306, 2018.
I. Lillo, J. C. Niebles, and A. Soto, “Sparse composition of body poses and atomic actions for human activity recognition in rgb-d videos,” Image and Vision Computing, vol. 59, pp. 63–75, 2017.
W. Zhu, C. Lan, J. Xing, W. Zeng, Y. Li, L. Shen, and X. Xie, “Co-occurrence feature learning for skeleton based action recognition using regularized deep lstm networks,” in Thirtieth AAAI Conference on Artificial Intelligence, 2016.
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, “Going deeper with convolutions,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 1–9.
J. Deng, W. Dong, R. Socher, L. Li, Kai Li, and Li Fei-Fei, “ImageNet: A large-scale hierarchical image database,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, June 2009, pp. 248–255.
A. Jalal, N. Sarif, J. T. Kim, and T.-S. Kim, “Human activity recognition via recognized body parts of human depth silhouettes for residents monitoring services at smart home,” Indoor and built environment, vol. 22, no. 1, pp. 271–279, 2013.
K. Simonyan and A. Zisserman, “Two-stream convolutional networks for action recognition in videos,” in Advances in neural information processing systems, 2014, pp. 568–576.
G. Gkioxari, R. Girshick, and J. Malik, “Contextual action recognition with r* cnn,” in Proceedings of the IEEE international conference on computer vision, 2015, pp. 1080–1088.
L. Wang, Y. Xiong, Z. Wang, and Y. Qiao, “Towards good practices for very deep two-stream convnets,” arXiv preprint arXiv:1507.02159, 2015.
D. Tran, L. Bourdev, R. Fergus, L. Torresani, and M. Paluri, “Deep end2end voxel2voxel prediction,” in Proceedings of the IEEE conference on computer vision and pattern recognition workshops, 2016, pp. 17–24.
P. Wang, W. Li, J. Wan, P. Ogunbona, and X. Liu, “Cooperative trainingof deep aggregation networks for rgb-d action recognition,” in Thirty-Second AAAI Conference on Artificial Intelligence, 2018.
S. Ji, W. Xu, M. Yang, and K. Yu, “3d convolutional neural networks for human action recognition,” IEEE transactions on pattern analysis and machine intelligence, vol. 35, no. 1, pp. 221–231, 2013.
P. Khaire, P. Kumar, and J. Imran, “Combining cnn streams of rgb-d and skeletal data for human activity recognition,” Pattern Recognition Letters, vol. 115, pp. 107–116, 2018.

Harnessing Convolutional Neural Networks and Transfer Learning to Perform Vision-Oriented Activity Recognition of Humans

Authors

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite