A systematic review of large language models for automation in civil engineering applications challenges and future directions| International Journal of Innovative Science and Research Technology

A Systematic Review of Large Language Models for Automation in Civil Engineering: Applications, Challenges, and Future Directions

Authors : Abhay Kumar; Pawan Kumar; Abhishek Kumar Jha; Akansha Jaiswal; Mohd Zia Hussain; Faiz Akram

Volume/Issue : Volume 11 - 2026, Issue 4 - April

Google Scholar : https://tinyurl.com/wtu68z25

Scribd : https://tinyurl.com/4k37c267

DOI : https://doi.org/10.38124/ijisrt/26apr455

PlumX Metrics

Semantic Scholar

ResearchGate

Note : A published paper may take 4-5 working days from the publication date to appear in PlumX Metrics, Semantic Scholar, and ResearchGate.

Abstract : The swift progress of large language models (LLMs) has generated substantial interest in their capacity to revolutionize automation in diverse fields, such as civil engineering. Although large language models have shown impressive abilities in processing natural language and automating tasks, their potential in civil engineering has not been thoroughly investigated, with research remaining scattered and lacking comprehensive consolidation. This paper presents a thorough systematic review to chart the existing scope of LLM-based automation in civil engineering, with the aim of uncovering primary applications, obstacles, and prospective research avenues. We analyze existing studies across multiple dimensions, such as civil and structural engineering, industrial automation, traffic management, education, scientific research, and software development, then critically evaluate the methodological approaches and practical implementations reported in the literature. The review indicates LLMs hold potential for automating design optimization, construction planning, and decision-making processes, but struggle with issues such as gaps in domain-specific knowledge, poor data quality, and safety risks. Moreover, we pinpoint developing tendencies, such as the merging of LLMs with digital twins and building information modeling (BIM), which may transform automation in the domain. The findings highlight the need for robust evaluation frameworks and interdisciplinary collaboration to address technical and ethical barriers. This review consolidates these insights, establishing a basis for subsequent investigations and the actual implementation of LLMs in civil engineering automation.

Keywords : Large Language Models, Industrial Automation, Design Optimization, Decision Making, Civil Engineering.

References :

H. Khairulzaman and F. Usman, “Automation in civil engineering design in assessing building energy efficiency,” Unable to determine the complete publication venue, 2018.
M. Hadi, R. Qureshi, A. Shah, M. Irfan, A. Zafar, et al., “A survey on large language models: Applications, challenges, limitations, and practical usage,” Authorea, 2023.
K. Du et al., “LLM-MANUF: An integrated framework of fine-tuning large language models for intelligent decision-making in manufacturing,” Advanced Engineering Informatics, 2025.
S. Qin et al., “Intelligent design and optimization system for shear wall structures based on large language models and generative artificial intelligence,” Journal of Building Engineering, 2024.
J. Saad-Falcon, O. Khattab, K. Santhanam, et al., “UDAPDR: Unsupervised domain adaptation via LLM prompting and distillation of rerankers,” in Proceedings of the 2023 conference on empirical methods in natural language processing, 2023.
G. Lee, S. Jang, and S. Hyun, “A generalized LLM-augmented BIM framework: Application to a speech-to-BIM system,” arXiv preprint arXiv:2409.18345, 2024.
Y. Hu, Y. Goktas, D. Yellamati, et al., “The use and misuse of pre-trained generative large language models in reliability engineering,” in 2024 annual reliability and maintainability symposium, 2024.
Y. Xiong, J. Wang, B. Li, Y. Zhu, and Y. Zhao, “Self-organizing agent network for llm-based workflow automation,” arXiv preprint arXiv:2508.13732, 2025.
R. Agbareia, M. Omar, O. Zloto, N. Chandala, T. Tai, et al., “The role of prompt engineering for multimodal LLM glaucoma diagnosis,” medRxiv, 2024.
M. Page, J. McKenzie, P. Bossuyt, et al., “The PRISMA 2020 statement: An updated guideline for reporting systematic reviews,” BMJ, vol. 372, p. n71, 2021.
C. Du, S. Esser, S. Nousias, et al., “Text2BIM: Generating building models using a large language model-based multiagent framework,” Journal of Computing in Civil Engineering, 2026.
J. Chen and Y. Bao, “A multi-agent large language model (llm) framework for code-complying design automation of concrete structures,” Available at SSRN 5193679, 2025.
S. Jang and G. Lee, “Interactive design by integrating a large pre-trained language model and building information modeling,” Computing in civil engineering 2023, 2023.
J. Chen and Y. Bao, “Artificial intelligence copilot for automated design of buildings using knowledge graphs and numerical models,” Available at SSRN 6330100, 6330.
S. Youwai, D. Phim, V. Murcia, and R. Onas, “Large language model-based multi-agent systems for automated foundation design: Router-driven task classification and expert selection framework,” AI in Civil Engineering, 2026.
S. Youwai, D. Phim, V. Murcia, and R. Onas, “Investigating the potential of large language model-based router multi-agent architectures for foundation design automation: A task classification and expert …,” arXiv preprint arXiv:2506.13811, 2025.
H. Liang, M. Kalaleh, and Q. Mei, “Integrating large language models for automated structural analysis,” arXiv preprint arXiv:2504.09754, 2025.
J. Liu, Z. Geng, R. Cao, L. Cheng, P. Bocchini, et al., “A large language model-empowered agent for reliable and robust structural analysis,” Unable to determine the complete publication venue, 2026.
P. Parsafard, O. Elezaj, D. Ekundayo, et al., “Automation in construction cost budgeting using generative artificial intelligence,” in Proceedings of the 2024 dubai conference, 2024.
A. Singh, A. Pal, and S. Hsieh, “A two-phase AI-driven approach to automated construction planning using small language models for activity sequencing and missing task prediction,” Unable to determine the complete publication venue, 2025.
D. Durmus, S. Isaac, A. Carbonari, et al., “Knowledge-based systems in the era of large language models: A case study in fire safety management,” in International symposium on automation and robotics in construction (isarc), 2025.
D. Durmus, A. Giretti, O. Ashkenazi, et al., “The role of large language models for decision support in fire safety planning,” Unable to determine the complete publication venue, 2024.
S. Fuchs, M. Witbrock, J. Dimyadi, and R. Amor, “Using large language models for the interpretation of building regulations,” arXiv preprint arXiv:2407.21060, 2024.
J. Zhou and Z. Ma, “Named entity recognition for construction documents based on fine-tuning of large language models with low-quality datasets,” Automation in Construction, 2025.
D. Liu, X. Zhou, and Y. Li, “Enhancing natural language retrieval of BIM data through integration of large language models with multi-agent systems,” in Proceedings of CAADRIA, 2025.
Q. Chen and X. Yin, “Tailored vision-language framework for automated hazard identification and report generation in construction sites,” Advanced Engineering Informatics, 2025.
X. Wang, Q. Yue, and X. Liu, “Crack image classification and information extraction in steel bridges using multimodal large language models,” Automation in Construction, 2025.
S. Park, C. Menassa, and V. Kamat, “Integrating large language models with multimodal virtual reality interfaces to support collaborative human–robot construction work,” Journal of Computing in Civil Engineering, 2025.
H. Liang, Y. Zhou, M. Kalaleh, and Q. Mei, “Automating structural engineering workflows with large language model agents,” arXiv preprint arXiv:2510.11004, 2025.
S. Jin, D. Kim, J. Lee, and D. Lee, “Language models as BIM interpreters: Unlocking IFC data for automation in construction informatics,” Available at SSRN 5563440, 5563.
P. Bazrafshan, K. Melag, and A. Ebrahimkhanlou, “Semantic and lexical analysis of pre-trained vision language artificial intelligence models for automated image descriptions in civil engineering,” AI in civil engineering, 2025.
A. Doris, D. Grandi, R. Tomich, et al., “Designqa: A multimodal benchmark for evaluating large language models’ understanding of engineering documentation,” Journal of Computing and Information Science in Engineering, 2025.
Y. Xia, N. Jazdi, J. Zhang, C. Shah, et al., “Control industrial automation system with large language model agents,” Unable to determine the complete publication venue, 2025.
J. Oyekan, C. Turner, M. Bax, and E. Graf, “Applying ontologies and knowledge augmented large language models to industrial automation: A decision-making guidance for achieving human-robot …,” arXiv preprint arXiv:2505.18553, 2025.
L. da Silva, A. Kocher, F. Gehlhoff, et al., “On the use of large language models to generate capability ontologies,” in Ieee international conference on emerging technologies and factory automation, 2024.
S. Katragadda, “Utilizing LLM models for advanced automation, manufacturing operations,” Journal of Mechanical, Civil and Industrial Engineering, 2026.
Y. Jadhav and A. B. Farimani, “Large language model agent as a mechanical designer,” Journal of Engineering Design, 2026.
W. Chen, C. Liu, W. Huang, J. Lyu, M. Yang, et al., “Analogtester: A large language model-based framework for automatic testbench generation in analog circuit design,” in IEEE/ACM international conference on computer aided design, 2025.
Y. Li et al., “Large language models for manufacturing,” arXiv preprint arXiv:2410.21418, 2024.
O. Zheng, M. Abdel-Aty, D. Wang, Z. Wang, et al., “ChatGPT is on the horizon: Could a large language model be suitable for intelligent traffic safety research and applications?” arXiv preprint arXiv:2303.05382, 2023.
S. Li, T. Azfar, and R. Ke, “Chatsumo: Large language model for automating traffic scenario generation in simulation of urban mobility,” IEEE Transactions on Intelligent Vehicles, 2024.
S. Masri, H. Ashqar, and M. Elhenawy, “Leveraging large language models (LLMs) for traffic management at urban intersections: The case of mixed traffic scenarios,” arXiv preprint arXiv:2408.00948, 2024.
H. Xu et al., “Genai-powered multi-agent paradigm for smart urban mobility: Opportunities and challenges for integrating large language models (llms) and retrieval-augmented …,” arXiv preprint arXiv:2409.00494, 2024.
Z. Zhou et al., “Human-centric reward optimization for reinforcement learning-based automated driving using large language models,” arXiv preprint arXiv:2405.04135, 2024.
Z. Peng, Y. Wang, X. Han, L. Zheng, and J. Ma, “Learningflow: Automated policy learning workflow for urban driving with large language models,” arXiv preprint arXiv:2501.05057, 2025.
X. Tian et al., “Drivevlm: The convergence of autonomous driving and large vision-language models,” arXiv preprint arXiv:2402.12289, 2024.
Y. Zhang and Y. Nie, “Interndrive: A multimodal large language model for autonomous driving scenario understanding,” in Proceedings of, 2024.
K. Jiang et al., “Koma: Knowledge-driven multi-agent framework for autonomous driving with large language models,” IEEE Transactions on Intelligent Transportation Systems, 2024.
P. Larrondo, J. Ortiz, and B. Frank, “Work-in-progress: Fine-tuning large language models for automated feedback in complex engineering problem-solving,” in Asee annual conference, 2024.
R. Gao, X. Guo, X. Li, A. Narayanan, N. Thomas, et al., “Towards scalable automated grading: Leveraging large language models for conceptual question evaluation in engineering,” arXiv preprint arXiv:2411.03659, 2024.
W. Morris, L. Holmes, J. Choi, and S. Crossley, “Automated scoring of constructed response items in math assessment using large language models,” International Journal of Artificial Intelligence in Education, 2025.
S. Rony, T. Fei, and S. Arsovski, “Educational justice. Reliability and consistency of large language models for automated essay scoring and its implications,” Unable to determine the complete publication venue, 2025.
S. Yeung, “A comparative study of rule-based, machine learning and large language model approaches in automated writing evaluation (AWE),” in Proceedings of the 15th international learning analytics and knowledge conference, 2025.
A. Mizumoto and M. Eguchi, “Exploring the potential of using an AI language model for automated essay scoring,” Research Methods in Applied Linguistics, 2023.
X. Niu and J. Zhang, “Enhancing automated text coding in online learning research: A systematic calibration framework for large language models,” IEEE Transactions on Learning Technologies, 2026.
M. Perkins, “Academic integrity considerations of AI large language models in the post-pandemic era: ChatGPT and beyond,” Journal of University Teaching and Learning Practice, 2023.
R. Hu, Y. Wu, T. Su, Y. Wang, S. Hu, and J. Huang, “Automated extraction of mechanical constitutive models from scientific literature using large language models: Applications in cultural heritage conservation,” arXiv preprint arXiv:2602.16551, 2026.
S. Bae, M. Jeon, and H. Moon, “Text mining in MOF research: From manual curation to large language model-based automation,” Chemical Communications, 2025.
Y. Zhang, S. Itani, K. Khanal, E. Okyere, G. Smith, et al., “Gptarticleextractor: An automated workflow for magnetic material database construction,” Journal of Magnetism and Magnetic Materials, 2024.
S. Jiang, D. Evans-Yamamoto, D. Bersenev, et al., “ProtoCode: Leveraging large language models (LLMs) for automated generation of machine-readable PCR protocols from scientific publications,” SLAS technology, 2024.
S. Leong, S. Pablo-García, Z. Zhang, et al., “Automated electrosynthesis reaction mining with multimodal large language models (MLLMs),” Chemical Science, 2024.
Y. Wu and F. Tang, “scExtract: Leveraging large language models for fully automated single-cell RNA-seq data annotation and prior-informed multi-dataset integration,” Genome Biology, 2025.
J. Pijanowski, Y. Mezgueldi, A. Lee, D. Moghanaki, et al., “Automated extraction of unstructured post-SBRT toxicity data from radiology reports using large language models,” arXiv preprint arXiv:2602.23492, 2026.
B. Romera-Paredes, M. Barekatain, A. Novikov, M. Balog, et al., “Mathematical discoveries from program search with large language models,” Nature, 2024.
D. Boiko, R. MacKnight, B. Kline, and G. Gomes, “Autonomous chemical research with large language models,” Nature, 2023.
G. Li, C. Zhi, J. Chen, J. Han, and S. Deng, “Exploring parameter-efficient fine-tuning of large language model on automated program repair,” in Proceedings of the 39th ieee/acm international conference on automated software engineering, 2024.
Y. Charalambous, E. Manino, and L. Cordeiro, “Automated repair of AI code with large language models and formal verification,” arXiv preprint arXiv:2405.08848, 2024.
H. Pearce, B. Tan, B. Ahmad, R. Karri, et al., “Examining zero-shot vulnerability repair with large language models,” in IEEE symposium on security and privacy, 2023.
Z. Gou, Z. Shao, Y. Gong, Y. Shen, Y. Yang, et al., “Critic: Large language models can self-correct with tool-interactive critiquing,” arXiv preprint arXiv:2305.11738, 2023.
E. Nijkamp, B. Pang, H. Hayashi, L. Tu, H. Wang, et al., “Codegen: An open large language model for code with multi-turn program synthesis,” arXiv preprint arXiv:2203.13474, 2022.
J. Yao, Z. Zhou, W. Chen, and W. Cui, “Leveraging large language models for automated proof synthesis in rust,” arXiv preprint arXiv:2311.03739, 2023.
R. Liu, M. Chen, L. Wu, J. Ke, and G. Li, “Enhancing automated loop invariant generation for complex programs with large language models,” Science of Computer Programming, 2025.
Y. Zhang, Z. Liu, Y. Feng, and B. Xu, “Leveraging large language model to assist detecting rust code comment inconsistency,” in Proceedings of the 39th ieee/acm international conference on automated software engineering, 2024.
M. Fazelnia, M. Mirakhorli, and H. Bagheri, “Translation titans, reasoning challenges: Satisfiability-aided language models for detecting conflicting requirements,” in Proceedings of the 39th ieee/acm international conference on automated software engineering, 2024.
M. Hasan, J. Li, I. Ahmed, and H. Bagheri, “Automated repair of declarative software specifications in the era of large language models,” arXiv preprint arXiv:2310.12425, 2023.
W. Sisomboon, J. Kaewyotha, and W. Songpan, “Automated software test case generation using directional partially weighted ensemble large language models with retrieval-augmented generation (RAG),” IEEE Access, 2026.
N. Jain et al., “Livecodebench: Holistic and contamination free evaluation of large language models for code,” arXiv preprint arXiv:2403.07974, 2024.
M. Du, L. Tuan, B. Ji, Q. Liu, et al., “Mercury: A code efficiency benchmark for code large language models,” in Advances in neural information processing systems, 2024.
S. Kawano, H. Nonaka, and K. Yoshino, “Claimbrush: A novel framework for automated patent claim refinement based on large language models,” in 2024 IEEE international conference on artificial intelligence and knowledge engineering, 2024.
X. Yuan, J. Li, D. Wang, Y. Chen, X. Mao, et al., “S-eval: Towards automated and comprehensive safety evaluation for large language models,” Proceedings of the ACM SIGSAC Conference on Computer and Communications Security, 2025.
I. Vykopal, M. Pikuliak, I. Srba, R. Moro, et al., “Disinformation capabilities of large language models,” in Proceedings of the 62nd annual meeting of the association for computational linguistics, 2024.
Y. Huang et al., “Trustllm: Trustworthiness in large language models,” arXiv preprint arXiv:2401.05561, 2024.
A. Karbasi, O. Montasser, J. Sous, and G. Velegkas, “(Im) possibility of automated hallucination detection in large language models,” arXiv preprint arXiv:2504.17004, 2025.
E. Mugnier, E. Gonzalez, N. Polikarpova, et al., “Laurel: Unblocking automated verification with large language models,” in Proceedings of the 44th ACM SIGPLAN international conference on programming language design and implementation, 2025.
A. Zou, Z. Wang, N. Carlini, M. Nasr, J. Kolter, et al., “Universal and transferable adversarial attacks on aligned language models,” arXiv preprint arXiv:2307.15043, 2023.
Y. Liu, Y. Luo, X. Li, X. Dong, B. Gu, et al., “Evaluating large language models for time series anomaly detection in aerospace software,” Unable to determine the complete publication venue with the given information, 2025.
T. Zhang, F. Ladhak, E. Durmus, P. Liang, et al., “Benchmarking large language models for news summarization,” Transactions of the Association for Computational Linguistics, 2024.
J. Zhou, T. Lu, S. Mishra, S. Brahma, S. Basu, et al., “Instruction-following evaluation for large language models,” arXiv preprint arXiv:2311.07911, 2023.
T. Cui et al., “Risk taxonomy, mitigation, and assessment benchmarks of large language model systems,” arXiv preprint arXiv:2401.05778, 2024.
Q. Yu et al., “Xfinder: Large language models as automated evaluators for reliable evaluation,” arXiv preprint arXiv:2405.11874, 2024.

The swift progress of large language models (LLMs) has generated substantial interest in their capacity to revolutionize automation in diverse fields, such as civil engineering. Although large language models have shown impressive abilities in processing natural language and automating tasks, their potential in civil engineering has not been thoroughly investigated, with research remaining scattered and lacking comprehensive consolidation. This paper presents a thorough systematic review to chart the existing scope of LLM-based automation in civil engineering, with the aim of uncovering primary applications, obstacles, and prospective research avenues. We analyze existing studies across multiple dimensions, such as civil and structural engineering, industrial automation, traffic management, education, scientific research, and software development, then critically evaluate the methodological approaches and practical implementations reported in the literature. The review indicates LLMs hold potential for automating design optimization, construction planning, and decision-making processes, but struggle with issues such as gaps in domain-specific knowledge, poor data quality, and safety risks. Moreover, we pinpoint developing tendencies, such as the merging of LLMs with digital twins and building information modeling (BIM), which may transform automation in the domain. The findings highlight the need for robust evaluation frameworks and interdisciplinary collaboration to address technical and ethical barriers. This review consolidates these insights, establishing a basis for subsequent investigations and the actual implementation of LLMs in civil engineering automation.

Keywords : Large Language Models, Industrial Automation, Design Optimization, Decision Making, Civil Engineering.

Paper Submission Last Date
31 - July - 2026

SUBMIT YOUR PAPER CALL FOR PAPERS

Video Explanation for Published paper

Never miss an update from Papermashup

Get notified about the latest tutorials and downloads.

Subscribe by Email

Get alerts directly into your inbox after each post and stay updated.

Subscribe by RSS

Add our RSS to your feedreader to get regular updates from us.