Multilingual Natural Language Prompts and Code Generation: A Study on Large Language Model Cross-Linguistic Performance

Madhuri Nakkella; B. Manikyala Rao; Neti Praveen; Ranjith Kumar Chinnam; Dhanshree Mukund Pande; Prabhakararao Kolli; Kumar Devapogu; M. V. Rajesh

doi:10.70917/ijcisim-2026-2677

Authors

Madhuri Nakkella Department of Information Technology, Shri Vishnu Engineering College for Women, Bhimavaram, Andhra Pradesh, India.
B. Manikyala Rao Department of Computer Science and Engineering, Aditya University, Surampalem, Andhra Pradesh, India.
Neti Praveen Department of Computer Science and Information Technology, S.R.K.R. Engineering College, Bhimavaram, Andhra Pradesh, India.
Ranjith Kumar Chinnam Department of Artificial Intelligence and Machine Learning (AI & ML), Aditya University, Surampalem, Andhra Pradesh, India.
Dhanshree Mukund Pande Department of Computer Science and Engineering (Cyber Security & Data Science) and Artificial Intelligence & Data Science, VNR Vignana Jyothi Institute of Engineering and Technology (VNR VJIET), Hyderabad, Telangana, India.
Prabhakararao Kolli Department of Artificial Intelligence and Machine Learning (AI & ML), Aditya University, Surampalem, Andhra Pradesh, India.
Kumar Devapogu Vignan's Foundation for Science, Technology and Research, Guntur, Andhra Pradesh, India.
M. V. Rajesh Department of Information Technology, Aditya University, Surampalem, Andhra Pradesh, India.

DOI:

https://doi.org/10.70917/ijcisim-2026-2677

Keywords:

Large Language Models, Multilingual Code Generation, Cross-Linguistic Performance, Language Efficiency Score (LES), Code Efficiency Index (CEI)

Abstract

Large Language Models (LLMs) have become the most powerful technique for code generation and have profound impact on contemporary software development process. Moreover, while existing GPT models perform well in translating natural language prompts into executable code, their behavior when prompted with input in multiple lan- guages has not been sufficiently studied till now, even though developers around the world speak and write in many different tongues. It is crucial to understand such a behavior in order to achieve fair AI-aided programming. Most prior work centers on English or limited bilingual studies, leaving uncertainty about how language influences code quality and efficiency.

In this paper, we examine GPT-4.5’s cross-lingual perfor- mance in Python code generation. Our multilingual bench- mark includes 30 algorithmic tasks across six computer science domains, tested in 30 languages. We measure execution time, efficiency and accuracy using Language Efficiency Score (LES) and Code Efficiency Index (CEI) and supported by clustering and correlation analysis .We have considered 900 samples, efficiency and execution stability showes a strong correlation (r > 0.92), while prompt length has little impact.The Natural Languages Tamil, Ukrainian, and Japanese yield the most efficient code, whereas English, Persian, and Mandarin produce d longer, slower scripts.
Our results proved that prompt language(Natural Language) matters in LLM code generation, emphasizing the importance of multilingual-aware prompt engineering for efficiency and robustness in real-world software development process.

Downloads

Download data is not yet available.

Multilingual Natural Language Prompts and Code Generation: A Study on Large Language Model Cross-Linguistic Performance

Authors

DOI:

Keywords:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

Information