Human Action Recognition (HAR) and Speech Recognition (SR) using Data Science

Dr. Sumithra Devi K A; Swet raj Shrivastava; Pranav Ranjan; Romit Dev

doi:https://www.doi.org/10.59256/indjcst.20250402026

ARCHIVES

Original Article

Human Action Recognition (HAR) and Speech Recognition (SR) using Data Science

Dr. Sumithra Devi K A¹ Swet raj Shrivastava² Pranav Ranjan³ Romit Dev⁴

¹Dean Academics and Head, Computer Engineering & Engineering in Data Science, Dayananda Sagar Academy of Technology and Management, Bengaluru, Karnataka, India. ²³⁴Students, Computer Engineering & Engineering in Data Science, Dayananda Sagar Academy of Technology and Management, Bengaluru, Karnataka, India.

Published Online: May-August 2025

Pages: 206-209

Cite this article

↗ https://www.doi.org/10.59256/indjcst.20250402026

References

[1] J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei, "ImageNet: A LargeScale Hierarchical Image Database," in IEEE Conference
on Computer Vision and Pattern Recognition (CVPR), 2009.
[2] W. Kay, J. Carreira, K. Simonyan, B. Zhang, C. Hillier, and S. Vijayanarasimhan et al., "The Kinetics Human Action Video Dataset,"
arXiv:1705.06950, 2017.
[3] A. Baevski, H. Zhou, A. Mohamed, and M. Auli, "wav2vec 2.0: A Framework for Self- Supervised Learning of Speech
Representations," in NeurIPS, 2020.
[4] A. Graves, A.-R. Mohamed, and G. Hinton, "Speech Recognition with Deep Recurrent Neural Networks," in IEEE International
Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013
[5] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez et al., "Attention Is All You Need," in NeurIPS, 2017.
[6] A. Karpathy and L. Fei-Fei, "Deep Visual-Semantic Alignments for Generating Image Descriptions," in IEEE Transactions on Pattern
Analysis and Machine Intelligence, 2015.
[7] A. Hannun, C. Case, J. Casper, B. Catanzaro, G. Diamos, and E. Elsen et al., "Deep Speech: Scaling Up End-to-End Speech
Recognition," arXiv:1412.5567, 2014.

Quick Links

Download

Manuscript Template Copyright Form

Policies

Share Article

X

Facebook

Or copy link

https://test.indjcst.com/archives/10.59256/indjcst.20250402026

*Instagram doesn't support direct link sharing from web. Copy the link and share it in your Instagram story or post.

ARCHIVES

Human Action Recognition (HAR) and Speech Recognition (SR) using Data Science

Cite this article

References

Related Articles

Transforming Cyber-Physical Systems: Machine Learning for Secure and Efficient Solutions

Exploring AI Techniques for Quantum Threat Detection and Prevention

Maturity Models for Business Intelligence: An Overview

INSPIRO: An AI Driven Institution Auditor

Adaptive AI Framework for Anomaly Detection and DDoS Mitigation in Distributed Systems

Predictive Modeling for College Admission Using Machine Learning and Statistical Methods

PlumX Metrics

Dimension

Quick Links

Download

Policies

Share Article