Authors :
Fatemeh Kouhestani; Milad Hadizadeh Masali
Volume/Issue :
Volume 11 - 2026, Issue 3 - March
Google Scholar :
https://tinyurl.com/ms4rnvac
Scribd :
https://tinyurl.com/2ndp7vj9
DOI :
https://doi.org/10.38124/ijisrt/26mar613
Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.
Abstract :
Artificial intelligence has increasingly been integrated into medical diagnosis, particularly in clinical settings
characterized by high patient volume, limited access to specialists, and time-sensitive decision-making. From an engineering
and system-oriented perspective, medical diagnosis can be conceptualized as a complex monitoring and fault-detection
problem in which heterogeneous clinical signals are processed through sensor-like inference mechanisms to identify
abnormal system states. Recent advances in deep learning architecture have enabled artificial intelligence systems to extract
hierarchical features from high-dimensional clinical data and perform data-driven diagnostic inference. This study
examines the role of artificial intelligence in modern medical diagnosis by comparing deep learning–based machine
reasoning with human clinical judgment, emphasizing system-level performance rather than isolated predictive accuracy.
The analysis is grounded in empirical evidence from recent comparative studies conducted in real-world clinical
environments, including primary care, emergency medicine, and telemedicine, primarily within healthcare systems of
developed countries.
The results demonstrate that deep learning–driven diagnostic systems perform effectively in structured diagnostic
scenarios involving standardized inputs and repetitive pattern recognition, achieving performance comparable to that of
non-expert clinicians. These strengths reflect characteristics commonly associated with automated monitoring, sensor-level
inference, and anomaly detection systems, including rapid signal processing, consistent decision outputs, and reduced
diagnostic variability. However, deep learning–based models exhibit clear limitations in complex clinical conditions that
require contextual interpretation, integration of incomplete or uncertain data, and adaptive reasoning under rare or
unforeseen scenarios. In such cases, expert clinicians continue to demonstrate superior performance. Additional challenges
include bias in training data, limited generalizability across diverse populations, and system-level concerns related to safety,
accountability, interpretability, and operational trust. Collectively, these findings indicate that artificial intelligence is most
effective when deployed as a supervised deep learning–based diagnostic subsystem within a broader human-centered
decision framework, where human oversight ensures robustness, resilience, and safe operation in real-world clinical
practice.
Keywords :
Artificial intelligence, medical diagnosis, deep learning, sensor-based diagnostics, anomaly detection, human–AI collaboration, system-level analysis.
References :
- Takita, H., Yamashita, R., Okuno, T., Matsuo, K., & Kido, S. (2025). A systematic review and meta-analysis of diagnostic performance of generative artificial intelligence models in medicine. NPJ Digital Medicine, 8, 43. https://doi.org/10.1038/s41746-025-01543-z
- Hayat, H., Bhatti, U. A., Khan, M. A., & Nam, Y. (2025). Toward the autonomous AI doctor: Quantitative benchmarking of diagnostic performance against clinicians. arXiv preprint. https://arxiv.org/abs/2507.22902
- Shan, G., Zhang, Y., Li, Q., & Wang, J. (2025). Comparing diagnostic accuracy of clinical professionals and large language models: A systematic review. JMIR Medical Informatics, 13, e64963. https://doi.org/10.2196/64963
- Al-Obeidat, F., Al-Shorman, A., & Al-Zoubi, M. (2024). Diagnostic performance of artificial intelligence-based models versus clinicians in hepatocellular carcinoma detection. BMC Medical Imaging, 24, 198. https://doi.org/10.1186/s12880-024-01298-3
- Topol, E. J. (2019). High-performance medicine: The convergence of human and artificial intelligence. Nature Medicine, 25(1), 44–56. https://doi.org/10.1038/s41591-018-0300-7
- Rajpurkar, P., Chen, E., Banerjee, O., & Topol, E. J. (2022). AI in health and medicine. Nature Medicine, 28(1), 31–38. https://doi.org/10.1038/s41591-021-01614-0
- Kouhestani, F. (2025). Advancing skin cancer diagnostics with human-AI synergy: A review. International Journal of Advanced Research in Science Communication and Technology, 5(1), 566–575. https://doi.org/10.48175/IJARSCT-28467
- Kouhestani, F. (2025). The impact of climate change on biological systems and biodiversity. International Journal of Science and Research Archive, 14(1), 1885–1900. https://doi.org/10.30574/ijsra.2025.14.1.0320
- Liu, X., Faes, L., Kale, A. U., et al. (2019). A comparison of deep learning performance against health-care professionals in medical imaging. The Lancet Digital Health, 1(6), e271–e297. https://doi.org/10.1016/S2589-7500(19)30123-2
- McKinney, S. M., Sieniek, M., Godbole, V., et al. (2020). International evaluation of an AI system for breast cancer screening. Nature, 577(7788), 89–94. https://doi.org/10.1038/s41586-019-1799-6
- Kelly, C. J., Karthikesalingam, A., Suleyman, M., Corrado, G., & King, D. (2019). Key challenges for delivering clinical impact with artificial intelligence. BMC Medicine, 17, 195. https://doi.org/10.1186/s12916-019-1426-2
- Sendak, M. P., D’Arcy, J., Kashyap, S., et al. (2020). A path for translation of machine learning products into healthcare delivery. NPJ Digital Medicine, 3, 19. https://doi.org/10.1038/s41746-020-0226-9
- Nagendran, M., Chen, Y., Lovejoy, C. A., et al. (2020). Artificial intelligence versus clinicians: Systematic review of diagnostic accuracy. BMJ, 368, m689. https://doi.org/10.1136/bmj.m689
- Shortliffe, E. H., & Sepúlveda, M. J. (2018). Clinical decision support in the era of artificial intelligence. JAMA, 320(21), 2199–2200. https://doi.org/10.1001/jama.2018.17163
- Tschandl, P., Rinner, C., Apalla, Z., et al. (2020). Human–computer collaboration for skin cancer recognition. Nature Medicine, 26(8), 1229–1234. https://doi.org/10.1038/s41591-020-0942-0
- London, A. J. (2019). Artificial intelligence and black-box medical decisions. Hastings Center Report, 49(1), 15–21. https://doi.org/10.1002/hast.973
- Ghassemi, M., Oakden-Rayner, L., & Beam, A. L. (2021). The false hope of current approaches to explainable artificial intelligence in health care. The Lancet Digital Health, 3(11), e745–e750. https://doi.org/10.1016/S2589-7500(21)00208-9
- Yu, K. H., Beam, A. L., & Kohane, I. S. (2018). Artificial intelligence in healthcare. Nature Biomedical Engineering, 2(10), 719–731. https://doi.org/10.1038/s41551-018-0305-z
- Masali, M. H., & Zargarzadeh, H. (2024). Sensor fusion for incipient hydrogen detection using deep learning. International Journal of Research, 5(9).
- Masali, M. H., Mobtahej, M., & Zargarzadeh, H. (2024). Experimental gas leakage detection using integrated sensor arrays using deep learning. International Journal of Engineering Research and Applications, 14(06), 93–108. https://doi.org/10.9790/9622-140693108
Artificial intelligence has increasingly been integrated into medical diagnosis, particularly in clinical settings
characterized by high patient volume, limited access to specialists, and time-sensitive decision-making. From an engineering
and system-oriented perspective, medical diagnosis can be conceptualized as a complex monitoring and fault-detection
problem in which heterogeneous clinical signals are processed through sensor-like inference mechanisms to identify
abnormal system states. Recent advances in deep learning architecture have enabled artificial intelligence systems to extract
hierarchical features from high-dimensional clinical data and perform data-driven diagnostic inference. This study
examines the role of artificial intelligence in modern medical diagnosis by comparing deep learning–based machine
reasoning with human clinical judgment, emphasizing system-level performance rather than isolated predictive accuracy.
The analysis is grounded in empirical evidence from recent comparative studies conducted in real-world clinical
environments, including primary care, emergency medicine, and telemedicine, primarily within healthcare systems of
developed countries.
The results demonstrate that deep learning–driven diagnostic systems perform effectively in structured diagnostic
scenarios involving standardized inputs and repetitive pattern recognition, achieving performance comparable to that of
non-expert clinicians. These strengths reflect characteristics commonly associated with automated monitoring, sensor-level
inference, and anomaly detection systems, including rapid signal processing, consistent decision outputs, and reduced
diagnostic variability. However, deep learning–based models exhibit clear limitations in complex clinical conditions that
require contextual interpretation, integration of incomplete or uncertain data, and adaptive reasoning under rare or
unforeseen scenarios. In such cases, expert clinicians continue to demonstrate superior performance. Additional challenges
include bias in training data, limited generalizability across diverse populations, and system-level concerns related to safety,
accountability, interpretability, and operational trust. Collectively, these findings indicate that artificial intelligence is most
effective when deployed as a supervised deep learning–based diagnostic subsystem within a broader human-centered
decision framework, where human oversight ensures robustness, resilience, and safe operation in real-world clinical
practice.
Keywords :
Artificial intelligence, medical diagnosis, deep learning, sensor-based diagnostics, anomaly detection, human–AI collaboration, system-level analysis.