Development of Smart Assessment System For Evaluating Maritime English Competence Using Machine Learning

Aprizawati Aprizawati; Romadhoni Romadhoni; Budhisantoso Budhisantoso

doi:10.35445/alishlah.v18i1.9624

Development of Smart Assessment System For Evaluating Maritime English Competence Using Machine Learning

Aprizawati Aprizawati, Romadhoni Romadhoni, Budhisantoso Budhisantoso

Abstract

Maritime English proficiency assessment is essential for cadets and maritime professionals, yet manual scoring can be time-consuming and prone to inter-rater variability. This study proposes a web-based smart assessment system that integrates machine learning to classify Maritime English proficiency into Beginner/Intermediate/Advanced using four feature scores: listening, reading, writing, and speaking. The dataset used in this work is simulated (1,000 records) for proof-of-concept evaluation because access to large, standardized real examination data was limited and required institutional clearance; simulation enables controlled class balance and repeatable experimentation. Class labels are generated using rubric-based threshold rules, and the labeling scheme is validated by two Maritime English examiners who review the thresholds and independently rate a random subset of 200 samples; agreement is quantified using Cohen’s kappa (κ) to ensure reliability. We adopt an 80/20 hold-out split and apply stratified 5-fold cross-validation on the training set for model selection, using grid-search hyperparameter tuning. We compare Support Vector Machine (SVM) and Random Forest and report accuracy, precision, recall, macro-F1, and brief per-class performance for Beginner/Intermediate/Advanced. SVM achieves 92% accuracy with macro-F1 = 0.905, outperforming Random Forest (89%, macro-F1 = 0.875). Future work will validate the system using real assessment datasets in operational training settings.

Keywords

competency evaluation, intelligent assessment systems, machine learning, maritime education, maritime english

Full Text:

PDF

References

Alaa, M., & Izaz. (2022). Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models. PeerJ Computer Science, 8, e803. https://doi.org/10.7717/peerj-cs.803

Boedojo Wiwoho Soetatmoko Jogo & Rosmayana Rosmayana. (2025). Enhancing maritime vocational education: Integrating sustainability, employability, and career pathways. Jurnal Kajian dan Penelitian Umum, 3(1), 20–39. https://doi.org/10.47861/jkpu-nalanda.v3i1.1495

Bolbot, V., Methlouthi, O., Chaal, M., Valdez Banda, O., BahooToroody, A., Tsetkova, A., Hellström, M., Saarni, J., Virtanen, S., Owen, D., Du, L., & Basnet, S. (2022). Identification and analysis of educational needs for naval architects and marine engineers in relation to the foreseen context of Maritime Autonomous Surface Ships (MASS) (Report). Aalto University School of Engineering. URN:NBN:fi-fe2022081153856.

Chintalapudi, N. (2023). Machine learning algorithms for improving the health care of seafarers through medical text classification and predicting the onsite occurrence of diseases [Doctoral dissertation, Università di Camerino]. Università di Camerino Institutional Repository. https://pubblicazioni.unicam.it/handle/11581/483683

Chukwura, J. C. (2023). A comparative study of several classification metrics and their performances on data. World Journal of Advanced Engineering Technology and Sciences, 8(1), 308–314. https://doi.org/10.30574/wjaets.2023.8.1.0054

Lam, H., Bertini, E., Isenberg, P., Plaisant, C., & Carpendale, S. (2012). Empirical studies in information visualization: Seven scenarios. IEEE Transactions on Visualization and Computer Graphics, 18(9), 1520–1536. https://doi.org/10.1109/TVCG.2011.279

Ervin, Yuk., Huang, Y. F., Ng, J. L., AlDahoul, N., Ahmed, A. N., & Elshafie, A. (2022). An evaluation of various data pre-processing techniques with machine learning models for water level prediction. Natural Hazards, 110, 121–153. https://doi.org/10.1007/s11069-021-04939-8

Fan, X. (2023). Accelerated English teaching methods: The role of digital technology. Journal of Psycholinguistic Research, 52(5), 1545–1558. https://doi.org/10.1007/s10936-023-09961-4

Frolova, O. O. (2020). Integrating standard marine communication phrases into Maritime English course. Pedagogy of the Formation of a Creative Personality in Higher and Secondary Education, 68(2), 212–215. https://doi.org/10.32840/1992-5786.2020.68-2.42

Gyorgy, S. (2024). Overfitting, underfitting and general model overconfidence and under-performance pitfalls and best practices in machine learning and AI. In Artificial Intelligence and Machine Learning in Health Care and Medical Sciences: Best Practices and Pitfalls (pp. 477–524). Springer. https://doi.org/10.1007/978-3-031-39355-6_10

Hadeel, & Mohammed. (2020). (2020). Investigating the effectiveness of flipped learning on enhancing students’ English language skills. English Review: Journal of English Education, 9(1), 193–204. https://doi.org/10.25134/erjee.v9i1.3799

Iie, Samsul, Retno, & Mukhammad. (2025). Management of maritime education in practical learning on training ships: Case study of cadets in navigation practice. Journal of Innovation in Educational and Cultural Research, 6(2), 262–266. https://doi.org/10.46843/jiecr.v6i2.1990

Panda, J. P. (2022). Machine learning for naval architecture, ocean and marine engineering. Journal of Marine Science and Technology, 28(1), 1–26. https://doi.org/10.1007/s00773-022-00914-5

Kulikova, I. (2024). Introduction of international standards for teaching maritime English as a guarantee of safety at sea. Innovate Pedagogy, 67(1), 45–52. https://doi.org/10.32782/2663-6085/2023/67.1.4

Marcin, Bogdan, Edwin, Marlena, Ahmed, Anna, . . . Jordi. (2021). Comparison of support vector machines and random forests for Corine land cover mapping. Remote Sensing, 13(4), 777. https://doi.org/10.3390/rs13040777

Margareta, & Samrat. (2021). Learning and learning-to-learn by doing: An experiential learning approach for integrating human factors into maritime design education. Retrieved from https://so04.tci-thaijo.org/index.php/MTR/article/view/241912

Marudut, Zainal, & Sintowati. (2024. Enhancing global maritime education: A qualitative exploration of post-internship perspectives and preparedness among cadets. Journal of Education and Learning (EduLearn), 18(4), 1134–1146. https://doi.org/10.11591/edulearn.v18i4.2171

Michał. (2025). Can chatgpt replace the teacher in assessment? a review of research on the use of large language models in grading and providing feedback. Retrieved from https://www.preprints.org/manuscript/202509.1233/download/final_file

Michał. (2025). Can ChatGPT replace the teacher in assessment? A review of research on the use of large language models in grading and providing feedback (Preprint). Preprints.org. https://doi.org/10.20944/preprints202509.1233.v1

Muhammed, Nur, & Filiz (2023). Comparison between random forest and support vector machine algorithms for LULC classification. International Journal of Engineering and Geosciences, 8(1), 1–10. https://doi.org/10.26833/ijeg.987605

R., & Pargaulan. (2025). Enhancing Technical Competency in Naval Engine Systems through Industry-Based Learning in Maritime Vocational Education. Retrieved from https://www.journal.yp3a.org/index.php/diajar/article/view/4255

Rung-Ching. (2019). Random forest and support vector machine on features selection for regression analysis. International Journal of Innovative Computing, Information and Control, 15(6), 2027–2037. https://doi.org/10.24507/ijicic.15.06.2027

Salman, A., & Nazir, S. (2021). Assessing the technology self-efficacy of maritime instructors: An explorative study. Education Sciences, 11(7), 342. https://doi.org/10.3390/educsci11070342

Samer, Ma, Norhazwani, random, & neural. (2024). Unveiling the tapestry of machine learning: A comparative analysis of support vector machines, random forests, and neural networks in diverse applications. Tuijin Jishu/Journal of Propulsion Technology, 45(3), Article 7303. https://doi.org/10.52783/tjjpt.v45.i03.7303

Santiago, E., Echeverría, J. C., Hernández, J., & Aguilar, M. (2014). Application of random forests methods to diabetic retinopathy classification analyses. PLOS ONE, 9(5), e98587. https://doi.org/10.1371/journal.pone.0098587

Sharma, A. (2023). Potential of technology supported competence development for Maritime Education and Training (Doctoral dissertation). https://doi.org/10.13140/RG.2.2.14565.17124

Songcan, & Qiang. (2011). Structural regularized support vector machine: A framework for structural large margin classifier. IEEE Transactions on Neural Networks, 22(4), 573–587. https://doi.org/10.1109/TNN.2011.2108315

Tutie. (2023). Analyzing of using educational technology to improve the quality and equity of learning outcomes at Politeknik Maritim Negeri. Jurnal IQRA’: Kajian Ilmu Pendidikan, 8(1), 100–116.

Vaishali & Rupa (2011). Impact of outlier removal and normalization approach in modified k-means clustering algorithm. International Journal of Computer Science Issues (IJCSI), 8(5), 331–336

Yisi. (2025). AI meets maritime training: Precision analytics for enhanced safety and performance (arXiv Preprint). arXiv. https://doi.org/10.48550/arXiv.2507.01274

Yong, & algorithm. (2020). Machine learning for enterprises: Applications, algorithm selection, and challenges. Retrieved from https://www.sciencedirect.com/science/article/pii/S0007681319301521

Yong, & algorithm.. (2020). Machine learning for enterprises: Applications, algorithm selection, and challenges. Business Horizons, 63(2), 157–170. https://doi.org/10.1016/j.bushor.2019.10.005

Zaini, Z. (2024). From Simulators to Screens: A Critical Review of Online Distance Education in Maritime Education and Training. ALAM Journal of Maritime Studies, 5(1), 52–61. Retrieved from https://ajms.alam.edu.my/index.php/ajms/article/view/37

DOI: https://doi.org/10.35445/alishlah.v18i1.9624