Optimizing a Contextual Chatbot for the POLYCC: A Competition-Based LLM Fine-Tuning Experience

Authors

  • Siti Zaleha Ibrahim Politeknik Kota Bharu Author
  • Norsuzila Shafie Politeknik Kota Bharu Author
  • Wan Siti Rodziah Mohd Nasir Politeknik Kota Bharu Author

Keywords:

Large Language Model, Fine-Tuning, LoRA, POLYCC, Competition-Based Evaluation

Abstract

AbstractThis paper reports a competition-based experience in fine-tuning a large language model (LLM) for the Malaysian Polytechnic and Community College (POLYCC) domain using the SynLoRA-SGS methodology. Participating in the student category of the POLYCC LLM League 2025, our team fine-tuned a Meta-Llama-3-8B-Instruct model through bilingual synthetic data generation and Low-Rank Adaptation (LoRA) hyperparameter optimization on Amazon SageMaker. Despite qualifying in last position (6th of 6 finalists) during the automated leaderboard stage, the team achieved second place overall in the final multi-source evaluation with a grand total of 149 points, including the highest spectator score among all finalists. This result demonstrates that focused dataset refinement between competition stages can produce substantial performance gains, and that a systematic fine-tuning approach can overcome initial ranking disadvantages when evaluated through holistic multi-source assessment.

 

Downloads

Download data is not yet available.

Downloads

Published

04.06.2026

How to Cite

Optimizing a Contextual Chatbot for the POLYCC: A Competition-Based LLM Fine-Tuning Experience. (2026). Journal of STEM and Education, 6(1), 49-52. https://journalstem.net/ojs/index.php/pkb/article/view/134