Development of an intellectualized program system based on rag-architecture | Scientific journal of the Ternopil National Technical University

PDF

Published: 24-03-2026

DOI: https://doi.org/10.33108/visnyk_tntu2026.01.037

Keywords:

Artificial Intelligence, chatbot, dialogue system, Language Model, Large Language Model, generation, prompt, RAG, fine-tuning, accuracy, BERTScore

Daria Zavtrak

Ternopil Ivan Puluj National Technical University

Valentyn Khychan

Ternopil Ivan Puluj National Technical University

Yeva Dubanych

Ternopil Ivan Puluj National Technical University

Bohdan Dobrotvor

Ternopil Ivan Puluj National Technical University

Yaroslav Herasimchuk

Ternopil Ivan Puluj National Technical University

Borys Bessarab

Ternopil Ivan Puluj National Technical University

Artem Manziy

Ternopil Ivan Puluj National Technical University

Oleh Pastukh

Ternopil Ivan Puluj National Technical University

Abstract

The article presents an approach in the creation of an intellectualized dialogue system for automation of communication with university applicants during admission campaigns. Retrieval-Augmented Generation (RAG) architecture, which combines search of relevant information with generation of a response using a Large Language Model, is the basis of the development. Fine-tuning of the GPT-2 model with regulatory documentation of a higher educational facility was used to provide support for Ukrainian language. A hybrid mechanism of response generation, which combines fragments extracted from a knowledge base with segments generated by the language model, was proposed. An architecture, which ensures the relevance, accuracy and completeness of data, as well as lowering the workload on university employees, was realized. The quality of the system was rated using both objective (response speed, BERTScore, relevance) and subjective metrics (completeness, convenience, search flexibility), which allowed to record the efficiency of the approach. Presented instrument demonstrates the perspective of the usage of RAG approaches in applied tasks of the educational sphere, especially in support systems for admission committees.

Issue

Vol. 121 No. 1 (2026)

Section

Articles

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

References

[1] Three out of four people use AI at work. Microsoft. (2024). https://news.microsoft.com/annual-wti-2024/

[2] J. Swacha, M. Gracel, Retrieval-augmented generation (RAG) chatbots for education: A survey of applications, Applied Sciences. 15 (2025) 4234. https://doi.org/10.3390/app15084234.

[3] T.T. Nguyen, et al., NEU-chatbot: Chatbot for admission of National Economics University, Computers and Education: Artificial Intelligence. 2 (2021) 100036. https://doi.org/10.1016/j.caeai.2021.100036.

[4] N. Chidipothu, et al., Improving large language model (LLM) performance with retrieval-augmented generation (RAG): Development of a transparent generative AI university support system for educational purposes, Journal of Big Data and Artificial Intelligence. 3 (2025) 1. https://doi.org/10.54116/jbdai.v3i1.50.

[5] M.-T. Nguyen, et al., Building a chatbot for supporting the admission of universities, in: 2021 13th International Conference on Knowledge and Systems Engineering (KSE), Bangkok, Thailand, 2021. https://doi.org/10.1109/KSE53942.2021.9648677.

[6] L.S.T. Nguyen, T.T. Quan, URAG: Implementing a unified hybrid RAG for precise answers in university admission chatbots – A case study at HCMUT, Communications in Computer and Information Science. (2025) 82–93. https://doi.org/10.1007/978-981-96-4285-4_7.

[7] malteos, gpt2-uk, Hugging Face. https://huggingface.co/malteos/gpt2-uk.

[8] E. Latif, X. Zhai, Fine-tuning ChatGPT for automatic scoring, Computers and Education: Artificial Intelligence. 6 (2024) 100210. https://doi.org/10.1016/j.caeai.2024.100210.

[9] N. Ding, et al., Enhancing chat language models by scaling high-quality instructional conversations, in: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, 2023. https://doi.org/10.18653/v1/2023.emnlp-main.183.

[10] C.W. Schmidt, et al., Tokenization is more than compression, in: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Miami, FL, USA, 2024, pp. 678–702. https://doi.org/10.18653/v1/2024.emnlp-main.40.

[11] D. Gyawali, Comparative analysis of CPU and GPU profiling for deep learning models, arXiv. (2023). https://arxiv.org/abs/2309.02521.

[12] Z. Ul Abideen, Autoregressive models for natural language processing, Medium. https://medium.com/@zaiinn440/autoregressive-models-for-natural-language-processing-b95e5f933e1f.

[13] M. Prytula, O. Sinkevych, I. Olenych, Comparison of zero-shot approach and retrieval-augmented generation for analyzing the tone of comments in the Ukrainian language, Electronics and Information Technologies. 28 (2024). https://doi.org/10.30970/eli.28.1.

[14] T. Zhang, V. Kishore, F. Wu, K.Q. Weinberger, Y. Artzi, BERTScore: Evaluating text generation with BERT, arXiv. (2019). https://arxiv.org/abs/1904.09675.

[15] V. Yatsyshyn, et al., Technology of relational database management systems performance evaluation during computer systems design, Scientific Journal of the Ternopil National Technical University. 109 (2023) 59–??. https://doi.org/10.33108/visnyk_tntu2023.01.054.

Most read articles by the same author(s)

Oleh Pastukh, Vasyl Yatsyshyn, Development of software for neuromarketing based on artificial intelligence and data science using high-performance computing and parallel programming technologies , Scientific journal of the Ternopil National Technical University: Vol. 113 No. 1 (2024)
Viktor Stashkiv, Andrii Khamarchuk, Kyrylo Chornopyskyi, Vladyslav Shumeiko, Maksym Chorniak, Karina Yarosh, Valentyna Tserkovniuk, Oleh Pastukh, The semantic power of text content as a flow of a vector field of embeddings , Scientific journal of the Ternopil National Technical University: Vol. 120 No. 4 (2025)
Vasyl Yatsyshyn, Oleh Pastukh, Andriy Palamar, Ruslan Zharovskyi, Technology of relational database management systems performance evaluation during computer systems design , Scientific journal of the Ternopil National Technical University: Vol. 109 No. 1 (2023)
Oleh Pastukh, Vasyl Yatsyshyn, Brain-computer interaction neurointerface based on artificial intelligence and its parallel programming using high-performance calculation on cluster mobile devices , Scientific journal of the Ternopil National Technical University: Vol. 112 No. 4 (2023)
Yuriy Petrov, Oleh Pastukh, Comparative analysis of MLP and KAN neural network architectures in neurointerface technologies , Scientific journal of the Ternopil National Technical University: Vol. 119 No. 3 (2025)
Oleksandr Bryk, Oleh Pastukh, Extraction of important data for cognitive software systems based on data science , Scientific journal of the Ternopil National Technical University: Vol. 117 No. 1 (2025)
Oleksandr Zadvornyi, Oleh Pastukh, Сomparative analysis of machine learning algorithms for market capitalization time series forecasting , Scientific journal of the Ternopil National Technical University: Vol. 119 No. 3 (2025)
Volodymyr Stefanyshyn, Ivan Stefanyshyn, Oleh Pastukh, Serhii Kulikov, Comparison of the accuracy of machine learning algorithms for brain-computer interaction based on high-performance computing technologies , Scientific journal of the Ternopil National Technical University: Vol. 115 No. 3 (2024)