Authors :
Gadi Haritha Rani; Mandapalli Rafath Kumar; Balam Mounica
Volume/Issue :
Volume 10 - 2025, Issue 2 - February
Google Scholar :
https://tinyurl.com/yck4euax
Scribd :
https://tinyurl.com/y4h898hd
DOI :
https://doi.org/10.5281/zenodo.14964324
Abstract :
Autonomous systems, including self-driving vehicles and robotic navigation, rely heavily on accurate 3D object
detection for safe and efficient operation. Traditional vision-based approaches often struggle in low-light or adverse weather
conditions, necessitating the integration of LiDAR and depth sensing technologies. This paper explores the latest
advancements in AI-driven 3D object detection, leveraging deep learning models such as PointNet, VoxelNet, and
Transformer-based architectures. We discuss the role of sensor fusion techniques, where LiDAR and depth cameras
complement RGB data for enhanced perception. Additionally, we analyze challenges in real-time processing, occlusion
handling, and domain adaptation, while highlighting recent breakthroughs in self-supervised learning and few-shot learning
for 3D detection. Experimental results demonstrate the effectiveness of AI-powered models in improving detection accuracy,
robustness, and computational efficiency. This study provides a comprehensive overview of AI's role in enhancing
perception and decision-making for next-generation autonomous systems.
Keywords :
3D Object Detection, LiDAR(Light Detection and Ranging), Depth Sensing, PointNet, VoxelNet, and Transformer- Based Architectures.
References :
- Qi, C. R., Su, H., Mo, K., & Guibas, L. J. (2017). PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).DOI: 10.1109/CVPR.2017.16
- Qi, C. R., Yi, L., Su, H., & Guibas, L. J. (2017). PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space. Advances in Neural Information Processing Systems (NeurIPS).DOI: 10.48550/arXiv.1706.02413
- Zhou, Y., &Tuzel, O. (2018). VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).DOI: 10.1109/CVPR.2018.00474
- Shi, S., Wang, X., & Li, H. (2019). PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).DOI: 10.1109/CVPR.2019.01140
- Qi, C. R., Liu, W., Wu, C., Su, H., & Guibas, L. J. (2018). Frustum PointNets for 3D Object Detection from RGB-D Data. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).DOI: 10.1109/CVPR.2018.00273
- Chen, X., Ma, H., Wan, J., Li, B., & Xia, T. (2017). Multi-View 3D Object Detection Network for Autonomous Driving. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).DOI: 10.1109/CVPR.2017.208
- Ku, J., Mozifian, M., Lee, J., Harakeh, A., &Waslander, S. L. (2018). Joint 3D Proposal Generation and Object Detection from View Aggregation. International Conference on Intelligent Robots and Systems (IROS).DOI: 10.1109/IROS.2018.8593945
- Misra, I., Liao, Y., Sokolic, J., &Girshick, R. (2021).An End-to-End Transformer Model for 3D Object Detection. International Conference on Computer Vision (ICCV).DOI: 10.48550/arXiv.2109.08141
Autonomous systems, including self-driving vehicles and robotic navigation, rely heavily on accurate 3D object
detection for safe and efficient operation. Traditional vision-based approaches often struggle in low-light or adverse weather
conditions, necessitating the integration of LiDAR and depth sensing technologies. This paper explores the latest
advancements in AI-driven 3D object detection, leveraging deep learning models such as PointNet, VoxelNet, and
Transformer-based architectures. We discuss the role of sensor fusion techniques, where LiDAR and depth cameras
complement RGB data for enhanced perception. Additionally, we analyze challenges in real-time processing, occlusion
handling, and domain adaptation, while highlighting recent breakthroughs in self-supervised learning and few-shot learning
for 3D detection. Experimental results demonstrate the effectiveness of AI-powered models in improving detection accuracy,
robustness, and computational efficiency. This study provides a comprehensive overview of AI's role in enhancing
perception and decision-making for next-generation autonomous systems.
Keywords :
3D Object Detection, LiDAR(Light Detection and Ranging), Depth Sensing, PointNet, VoxelNet, and Transformer- Based Architectures.