Authors :
Pudutha Rishitha
Volume/Issue :
Volume 10 - 2025, Issue 2 - February
Google Scholar :
https://tinyurl.com/4ps6kbx4
Scribd :
https://tinyurl.com/ft4nub92
DOI :
https://doi.org/10.5281/zenodo.14944826
Abstract :
Yoga pose estimation is a challenging task in computer vision due to the variability and complexity of yoga poses.
In this study, we propose a novel approach for yoga pose estimation using neural networks and YOLOv8, a state-of-the-art
object detection model. Our method accurately detects and classifies yoga poses in images or videos, providing valuable
feedback for practitioners to improve their yoga practice. Initially, we preprocess the input data to enhance quality and
reduce noise. YOLOv8 is then utilized to detect and localize key points corresponding to different body joints in the images.
Subsequently, neural network architecture classifies the detected poses into various yoga poses. To ensure robustness and
generalization capability, we train our model on a large dataset of annotated yoga pose images. We evaluate the performance
using standard metrics such as accuracy, precision, recall, and F1-score. Experimental results demonstrate the effectiveness
and accuracy of our proposed approach, achieving competitive performance compared to existing methods. This study
advances the field of yoga pose estimation, offering a promising solution for practitioners, instructors, and researchers
interested in leveraging computer vision techniques to enhance yoga practice.
Keywords :
Yoga, Pose Estimation, Neural Networks, Tradition, Technology.
References :
- Choudhury, A., Konda, K. K., & Acharya, J. (2021). Yoga Pose Detection and Recognition using YOLO (You Only Look Once) Object Detection Method. International Journal of Computer Applications, 180(10), 9-12.
- Cao, Z., Hidalgo, G., Simon, T., Wei, S. E., & Sheikh, Y. (2017). OpenPose: Realtime multi-person 2D pose estimation using Part Affinity Fields. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(1), 172-186.
- Mu, H., Qiu, W., Fang, Z., & Yuille, A. (2020). BlazePose: On-device Real-time Body Pose tracking. arXiv preprint arXiv:2012.11160.
- Zecha, D., Wu, D., Rennert, M., Shafait, F., & Dengel, A. (2018). Pose Detection and Analysis for Yoga Movement. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (pp. 686-693).
- Narayanan, P., Venkatakrishnan, R., & Kakarala, R. (2018). Yoga Pose Classification Using Convolutional Neural Networks with Feedback. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (pp. 5967-5971).
- Patil, P. M., & Patil, P. M. (2020). Improved Pose Detection using LogRF Method for Rehabilitation Exercises. International Journal of Advanced Trends in Computer Science and Engineering, 9(1), 625-629.
- Cao, Z., Simon, T., Wei, S. E., & Sheikh, Y. (2017). The Progress of Human Pose Estimation. *Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)*, pp. 7291-7299.
- Pavlakos, G., Zhu, L., Zhou, X., & Daniilidis, K. (2017). A Simple Yet Effective Baseline for 3D Human Pose Estimation. *Proceedings of the IEEE International Conference on Computer Vision (ICCV)*, pp. 4598-4606.
- Pu, J., Gan, Z., Henao, R., Li, C., He, X., & Carin, L. (2018). Sign Language Recognition: State of the Art. *Proceedings of the IEEE International Conference on Computer Vision (ICCV)*, pp. 58-66. 50
- Ge, L., Ren, Z., & Yuan, J. (2018). Learning to Estimate 3D Hand Pose from Single RGB Images. *Proceedings of the International Conference on Computer Vision (ICCV) Workshops*, pp. 2347-2356.
- Patil, P. M., & Patil, P. M. (Year). "Study on Deep Learning Models for Human Pose Estimation and its Real-Time Application." [Journal/Conference/Proceedings Name], Volume(Issue), Page Range.
- Toshev, A., & Szegedy, C. (2014). "DeepPose: Human Pose Estimation via Deep Neural Networks." Proceedings of the 27th International Conference on Neural Information Processing Systems (NIPS 2014), Montreal, Canada.
- Wei, S.-E., Ramakrishna, V., Kanade, T., & Sheikh, Y. (2016). Convolutional Pose Machines. In IEEE Transactions on Pattern Analysis and Machine Intelligence (Volume: 40, Issue: 8, Aug. 2018). DOI: 10.1109/TPAMI.2017.2786993.
- Jagtap, R., Zanzane, M., & Patil, R. (2022). Yoga Pose Detection Using Machine Learning. *International Research Journal of Modernization in Engineering Technology and Science*, 04(05), 1-6. e-ISSN: 2582-5208.
- Zhang, F., Zhu, X., Dai, H., Ye, M., & Zhu, C. (2019). Distribution-Aware Coordinate Representation for Human Pose Estimation. arXiv preprint arXiv:1910.06278v1 [cs.CV].
- Xu, Y., Zhang, J., Zhang, Q., & Tao, D. (2022). ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation. arXiv preprint arXiv:2204.12484v3 [cs.CV].
- Lan, G., Wu, Y., Hu, F., & Hao, Q. (2023). Vision-based Human Pose Estimation via Deep Learning: A Survey. arXiv preprint arXiv:2308.13872v1 [cs.CV].
- Sun, K., Xiao, B., Liu, D., & Wang, J. (2019). Deep High-Resolution Representation Learning for Human Pose Estimation. Retrieved from University of Science and Technology of China, Microsoft Research Asia. 51
- Shu, Y., & Hu, L. (2023). A Vision-based Human Posture Detection Approach for Smart Home Applications. *International Journal of Advanced Computer Science and Applications, 14*(10).
- C. K. Ingwersen, C. M. Mikkelstrup, M. R. Hannemose, J. N. Jensen, A. B. Dahl, "SportsPose: A Dynamic 3D sports pose dataset," Visual Computing, Technical University of Denmark, TrackMan A/S, Denmark, Apr. 4, 2023. arXiv:2304.01865v1 [cs.CV]
- K. He, G. Gkioxari, P. Doll´ar, R. Girshick, "Mask R-CNN," Facebook AI Research (FAIR), Jan. 24, 2018. [arXiv:1703.06870v3 [cs.CV]]
- Ranjana Jadhav, Vaidehi Ligde, Rushikesh Malpani, Phinehas Mane, and Soham Borkar. "Aasna: Kinematic Yoga Posture Detection And Correction System Using CNN." ITM Web of Conferences 56, 05007 (2023). DOI: [10.1051/itmconf/20235605007]
- Gajbhiye, R., Jarag, S., Gaikwad, P., & Koparde, S. (2022). "AI Human Pose Estimation: Yoga Pose Detection and Correction." *International Journal of Innovative Science and Research Technology*, 7(5), 1649.
- Sakalle, A., Thoutam, V. A., Srivastava, A., Badal, T., Mishra, V. K., Sinha, G. R., Bhardwaj, H., & Raj, M. (2022). Yoga Pose Estimation and Feedback Generation Using Deep Learning. *Computational Intelligence and Neuroscience, 2022*, Article ID 4311350
- Raza, A., Qadri, A. M., Akhtar, I., Samee, N. A., & Abdulhafith, M. A. (2023). LogRF: An Approach to Human Pose Estimation Using Skeleton Landmarks for Physiotherapy Fitness Exercise Correction. *IEEE Access, 10*, 3320144.
- Li, J., Wang, C., Zhu, H., Mao, Y., Fang, H.-S., & Lu, C. (2019). CrowdPose: Efficient Crowded Scenes Pose Estimation and A New Benchmark. *arXiv preprint arXiv:1812.00324v2* [cs.CV]
Yoga pose estimation is a challenging task in computer vision due to the variability and complexity of yoga poses.
In this study, we propose a novel approach for yoga pose estimation using neural networks and YOLOv8, a state-of-the-art
object detection model. Our method accurately detects and classifies yoga poses in images or videos, providing valuable
feedback for practitioners to improve their yoga practice. Initially, we preprocess the input data to enhance quality and
reduce noise. YOLOv8 is then utilized to detect and localize key points corresponding to different body joints in the images.
Subsequently, neural network architecture classifies the detected poses into various yoga poses. To ensure robustness and
generalization capability, we train our model on a large dataset of annotated yoga pose images. We evaluate the performance
using standard metrics such as accuracy, precision, recall, and F1-score. Experimental results demonstrate the effectiveness
and accuracy of our proposed approach, achieving competitive performance compared to existing methods. This study
advances the field of yoga pose estimation, offering a promising solution for practitioners, instructors, and researchers
interested in leveraging computer vision techniques to enhance yoga practice.
Keywords :
Yoga, Pose Estimation, Neural Networks, Tradition, Technology.