Optimizing Artificial Intelligence Chatbots: A Study on the Overfitting Pitfalls in Fine-Tuning Large Language Models for Specialized Tasks

Mohd Firdauz Norhadi; Mazlina Abdul Majid; Syaifulradzman Shaifuddin

Optimizing Artificial Intelligence Chatbots: A Study on the Overfitting Pitfalls in Fine-Tuning Large Language Models for Specialized Tasks

Authors

Mohd Firdauz Norhadi Politeknik Muadzam Shah Author
Mazlina Abdul Majid Politeknik Muadzam Shah Author
Syaifulradzman Shaifuddin Politeknik Muadzam Shah Author

Keywords:

Large Language Models, Artificial Intelligence, Chatbots, Fine-Tuning, Overfitting

Abstract

This paper presented an empirical investigation into the hyperparameter optimization of the Meta LLaMA 3.2 3B Instruct model, conducted during the POLYCC LLM League 2025 competition. The study utilized a parameter-efficient fine-tuning approach via AWS SageMaker Jumpstart to adapt the artificial intelligence for specialized conversational tasks. The research demonstrated a critical disconnect between traditional computer science metrics specifically training and evaluation loss and the actual conversational success rate, measured as Win Rate (WR). Experimental data revealed that minimizing training loss to near-zero on small datasets (under 500 examples) induced catastrophic overfitting, severely degrading the chatbot's real-world performance to as low as 10%. The optimal configuration was identified at a moderate dataset scale of 1000 examples trained for 20 epochs, achieving a peak Win Rate of 36%.

Downloads

Download data is not yet available.

Downloads

Published

04.06.2026

Issue

Vol. 6 No. 1 (2026): Special Issue on Large Language Model (LLM) POLYCC League

Section

Articles

How to Cite

Optimizing Artificial Intelligence Chatbots: A Study on the Overfitting Pitfalls in Fine-Tuning Large Language Models for Specialized Tasks. (2026). Journal of STEM and Education, 6(1), 57-61. https://journalstem.net/ojs/index.php/pkb/article/view/142

Download Citation

Optimizing Artificial Intelligence Chatbots: A Study on the Overfitting Pitfalls in Fine-Tuning Large Language Models for Specialized Tasks

Authors

Keywords:

Abstract

Downloads

Downloads

Published

Issue

Section

Categories

How to Cite

ISSN

Indexing

Visitor