Безперервні цикли зворотного зв’язку: онлайн-налаштування LLM за допомогою сигналів користувача

Софія Швець

Журнал: Том 30, № 3, 2025

Сторінки: 106 – 120

DOI: https://doi.org/10.62660/bcstu/3.2025.106

1 017 Переглядів

Безперервні цикли зворотного зв’язку: онлайн-налаштування LLM за допомогою сигналів користувача

Софія Швець

Отримано 14.03.2025

Доопрацьовано 01.08.2025

Прийнято 15.09.2025

Анотація

Інтенсивне зростання використання мовних моделей реального часу вимагає механізмів їх динамічної адаптації до змін у запитах, термінології та очікуваннях користувачів. Метою дослідження було вивчення підходів до перенавчання великих мовних моделей на основі безперервного зворотного зв’язку. Для досягнення цієї мети було застосовано теоретичне та структурно-функціональне моделювання архітектури адаптації, експериментальну реалізацію циклу перенавчання мовної моделі з обробкою та класифікацією різних типів зворотного зв’язку, а також кількісну оцінку результатів за допомогою автоматичних та користувацьких метрик. Результати дослідження показали ефективність архітектури безперервного онлайннавчання, яка забезпечує актуальність та стабільність мовної моделі в реальному часі. У дослідженні визначено, що неявний зворотний зв’язок зустрічається в 4-10 разів частіше, ніж явний зворотний зв’язок, але явний зворотний зв’язок дає вищий приріст точності відповідей. Запропонована система успішно інтегрує різні типи користувацьких сигналів, забезпечуючи динамічну генерацію навчальних прикладів та гібридне перенавчання, зберігаючи при цьому якість та узгодженість результатів. Програмний цикл Python для адаптивного перенавчання мовної моделі включав обробку та фільтрацію користувацьких сигналів для формування високоякісного буфера навчальних пар. Після 500 кроків перенавчання на 52 912 парах запит-відповідь спостерігалося значне покращення моделі, що підтверджувалося зменшенням функції втрат з 3,82 до 3,15 та стабільністю процесу точного налаштування без ознак перенавчання. Результати попереднього навчання показали помірне покращення якості відповідей після адаптації: лексична подібність за даними Recall-Oriented Understudy for Gisting Evaluation становила 0,102, точність за даними Bilingual Evaluation Understudy – 0,006, а суб’єктивна задоволеність користувачів зросла до 0,24, зберігаючи при цьому стабільність моделі із середнім значенням косинусної подібності 0,396. Підхід, запропонований у цьому дослідженні, покращує якість та релевантність відповідей мовних моделей у реальному часі, зберігаючи їх стабільність, і може бути використаний у продуктивних системах для покращення користувацького досвіду

Ключові слова

адаптивне перенавчання; генеративні трансформатори; адаптація динамічної моделі; реалізація на Python; гібридне навчання; метрики оцінки якості; стабільність мовної моделі

Використані джерела

Ai, Q., Dou, Z., & Zhang, M. (2024). Improving generative information retrieval systems based on user feedback. In R.W. White & C. Shah (Eds.), Information access in the era of generative AI (pp. 111-133). Cham: Springer. doi: 10.1007/978-3-031-73147-1_5 .
Anisuzzaman, D.M., Malins, J.G., Friedman, P.A., & Attia, Z.I. (2025). Fine-tuning LLMs for specialized use cases. Mayo Clinic Proceedings Digital Health, 3(1), article number 100184. doi: 10.1016/j.mcpdig.2024.11.005 .
Balaskas, G., Papadopoulos, H., Pappa, D., Loisel, Q., & Chastin, S. (2025). A framework for domain-specific dataset creation and adaptation of large language models. Computers, 14(5), article number 172. doi: 10.3390/ computers14050172 .
Bodaghi, A., Fung, B.C.M., & Schmitt, K.A. (2024). AugmenToxic: Leveraging reinforcement learning to optimize LLM instruction fine-tuning for data augmentation to enhance toxicity detection. ACM Transactions on the Web. doi: 10.1145/3700791 .
Cardó, A.V. (2025). Investigating feedback types in reinforcement learning with human feedback and large language models. (Bachelor’s thesis, Aalto University, Espoo, Finland).
Dombi, J., & Jónás, T. (2022). Weighted aggregation systems and an expectation level-based weighting and scoring procedure. European Journal of Operational Research, 299(2), 580-588. doi: 10.62441/nano-ntp.vi.5447.
Gaikwad, S.V., Agarkar, P., Mohapatra, S., & Bagade, S. (2024). Fine-tuning LLM for sentiment analysis. Nanotechnology Perceptions, 20(6), 4946-4959. doi: 10.62441/nano-ntp.vi.5447 .
Haider, Z., Rahman, H., Devabhaktuni, V., Moeykens, S., & Chakraborty, P. (2025). A framework for mitigating malicious RLHF feedback in LLM training using consensus-based reward. Scientific Reports, 15, article number 9177. doi: 10.1038/s41598-025-92889-7.
Hao, S., & Duan, L. (2025). Online learning from strategic human feedback in LLM fine-tuning. In CASSP 2025 – 2025 IEEE international conference on acoustics, speech and signal processing (pp. 1-5). Hyderabad: IEEE. doi: 10.1109/ICASSP49660.2025.10887891 .
Hilel, A., Shenfeld, I., Andreas, J., & Choshen, L. (2025). LLM hypnosis: Exploiting user feedback for unauthorized knowledge injection to all users. ArXiv. doi: 10.48550/arXiv.2507.02850.
Kamath, U., Keenan, K., Somers, G., & Sorenson, S. (2024). Tuning for LLM alignment. In Large language models: A deep dive (pp. 177-218). Cham: Springer. doi: 10.1007/978-3-031-65647-7_5.
Kampelopoulos, D., Tsanousa, A., Vrochidis, S., & Kompatsiaris, I. (2025). A review of LLMs and their applications in the architecture, engineering and construction industry. Artificial Intelligence Review, 58, article number 250. doi: 10.1007/s10462-025-11241-7.
Karunakaran, S., & Jain, A. (2025). Fine-tuning LLMs for personality preservation in AI assistants. International Journal of Research in Modern Engineering & Emerging Technology, 13(4), 172-191. doi: 10.63345/ijrmeet.org. v13.i4.10.
Lin, X., Wang, W., Li, Y., Yang, S., Feng, F., Wei, Y., & Chua, T. (2024). Data-efficient fine-tuning for LLMbased recommendation. In Proceedings of the 45th international ACM SIGIR conference on research and development in information retrieval (pp. 365-374). New York: Association for Computing Machinery. doi: 10.1145/3626772.3657807 .
Martin, D. (2025). Improving CXR report labeling through LLM fine-tuning and human feedback. Preprints. doi: 10.20944/preprints202504.1668.v1 .
Mazzullo, E., & Bulut, O. (2025). Automated feedback generation for open-ended questions: Insights from fine-tuned LLMs . In Proceedings of large foundation models for educational assessment (pp. 103-120). Vancouver: MLResearchPress.
Next Electronics. (n.d.). Elastic weight consolidation (EWC). Retrieved from https://www.next.gr/ai/deep-learning theory/elastic-weight-consolidation-ewc/.
NinjaTech AI. (n.d.). Ninja LLM suite. Retrieved from https://www.ninjatech.ai/product/ninja-llm.
OpenAssistant conversations dataset. (n.d.). Retrieved from https://huggingface.co/datasets/OpenAssistant/ oasst1 .
Pratap, S., Aranha, A.R., Kumar, D., Malhotra, G., Iyer, A.P.N., & Shylaja, S.S. (2025). The fine art of fine-tuning: A structured review of advanced LLM fine-tuning techniques. Natural Language Processing Journal, 11, article number 100144. doi: 10.1016/j.nlp.2025.100144 .
Punnaivanam, M., & Velvizhy, P. (2024). Contextual fine-tuning of language models with classifier-driven content moderation for text generation. Entropy, 26(12), article number 1114. doi: 10.3390/e26121114.
Rawal, N., Tavva, P., & Selvakumar, P. (2024). Enhancing large language model performance with reinforcement learning from human feedback: A comprehensive study on Q&A, summarization, and classification. In 2024 international conference on electrical, computer and energy technologies (pp. 1-6). Sydney: IEEE. doi: 10.1109/ ICECET61485.2024.10698396 .
Rehan, S., Al-Bander, B., & Al-Said Ahmad, A. (2025). Harnessing large language models for automated software testing: A leap towards scalable test case generation. Electronics, 14(7), article number 1463. doi: 10.3390/ electronics14071463 .
Shi, T., et al. (2024). WildFeedback: Aligning LLMs with in-situ user interactions and feedback. ArXiv. doi: 10.48550/arXiv.2408.15549.
Terras, N., Pereira, F., Silva, A.R., Santos, AA., Lopes, A.M., Silva, A.F.D., Cartal, L.A., Apostolescu, T.C., Badea, F., & Machado, J. (2025). Integration of deep learning vision systems in collaborative robotics for real-time applications. Applied Sciences, 15(3), article number 1336. doi: 10.3390/app15031336 .
Wang, J., Lu, H., Liu, Y., Ma, H., Wang, Y., Gu, Y., Zhang, S., Han, N., Bi, S., Baugher, L., Chi, E.H., & Chen, M. (2024). LLMs for user interest exploration in large-scale recommendation systems. In T. di Noia, P. Lops, T. Joachims. K. Verbert, P. Castells, Z. Dong & B. London (Eds.), Proceedings of the 18th ACM conference on recommender systems (pp. 872-877). New York: Assoviation Computing Machinery. doi: 10.1145/3640457.3688161 .
Wang, K., Lu, Y., Santacroce, M., Gong, Y., Zhang, C., & Shen, Y. (2025). Adapting LLM agents with universal communication feedback. In Findings of the association for computational linguistics (pp. 6105-6122). Albuquerque: Association for Computational Linguistics. doi: 10.18653/v1/2025.findings-naacl.339.
Watson, E., Viana, T., Zhang, S., Sturgeon, B., & Petersson, L. (2024). Towards an end-to-end personal fine-tuning framework for AI value alignment. Electronics, 13(20), article number 4044. doi: 10.3390/electronics13204044 .
Wu, X.-K., et al. (2025). LLM fine-tuning: Concepts, opportunities, and challenges. Big Data and Cognitive Computing, 9(4), article number 87. doi: 10.3390/bdcc9040087 .
Yang, B., Tian, H., Ren, J., Zhang, H., Klein, J., Bissyandé, T.F., Le Goues, C., & Jin, S. (2025). MORepair: Teaching LLMs to repair code via multi-objective fine-tuning. ACM Transactions on Software Engineering and Methodology. doi: 10.1145/3735129 .
Ye, C., Xiong, W., Zhang, Y., Dong, H., Jiang, N., & Zhang, T. (2025). Online iterative reinforcement learning from human feedback with general preference model . In Proceedings of the 38th international conference on neural information processing systems (pp. 81773-81807). Red Hook: Curran Associates Inc.
Zhao, Q., Harper, F.M., Adomavicius, G., & Konstan, J.A. (2018). Explicit or implicit feedback? Engagement or satisfaction? A field experiment on machine-learning-based recommender systems. In Proceedings of the 33rd annual ACM symposium on applied computing (pp. 1331-1340). New York: Association for Computing Machinery. doi: 10.1145/3167132.3167275.

ЦИТУВАТИ

Shvets, S. (2025). Continuous feedback loops: Online fine-tuning of LLMs with user signals. Bulletin of Cherkasy State Technological University, 30(3), 106-120. https://doi.org/10.62660/bcstu/3.2025.106

[1] Ai, Q., Dou, Z., & Zhang, M. (2024). Improving generative information retrieval systems based on user feedback. In R.W. White & C. Shah (Eds.), Information access in the era of generative AI (pp. 111-133). Cham: Springer. doi: 10.1007/978-3-031-73147-1_5 .

[2] Anisuzzaman, D.M., Malins, J.G., Friedman, P.A., & Attia, Z.I. (2025). Fine-tuning LLMs for specialized use cases. Mayo Clinic Proceedings Digital Health, 3(1), article number 100184. doi: 10.1016/j.mcpdig.2024.11.005 .

[3] Balaskas, G., Papadopoulos, H., Pappa, D., Loisel, Q., & Chastin, S. (2025). A framework for domain-specific dataset creation and adaptation of large language models. Computers, 14(5), article number 172. doi: 10.3390/ computers14050172 .

[4] Bodaghi, A., Fung, B.C.M., & Schmitt, K.A. (2024). AugmenToxic: Leveraging reinforcement learning to optimize LLM instruction fine-tuning for data augmentation to enhance toxicity detection. ACM Transactions on the Web. doi: 10.1145/3700791 .

[5] Cardó, A.V. (2025). Investigating feedback types in reinforcement learning with human feedback and large language models. (Bachelor’s thesis, Aalto University, Espoo, Finland).

[6] Dombi, J., & Jónás, T. (2022). Weighted aggregation systems and an expectation level-based weighting and scoring procedure. European Journal of Operational Research, 299(2), 580-588. doi: 10.62441/nano-ntp.vi.5447.

[7] Gaikwad, S.V., Agarkar, P., Mohapatra, S., & Bagade, S. (2024). Fine-tuning LLM for sentiment analysis. Nanotechnology Perceptions, 20(6), 4946-4959. doi: 10.62441/nano-ntp.vi.5447 .

[8] Haider, Z., Rahman, H., Devabhaktuni, V., Moeykens, S., & Chakraborty, P. (2025). A framework for mitigating malicious RLHF feedback in LLM training using consensus-based reward. Scientific Reports, 15, article number 9177. doi: 10.1038/s41598-025-92889-7.

[9] Hao, S., & Duan, L. (2025). Online learning from strategic human feedback in LLM fine-tuning. In CASSP 2025 – 2025 IEEE international conference on acoustics, speech and signal processing (pp. 1-5). Hyderabad: IEEE. doi: 10.1109/ICASSP49660.2025.10887891 .

[10] Hilel, A., Shenfeld, I., Andreas, J., & Choshen, L. (2025). LLM hypnosis: Exploiting user feedback for unauthorized knowledge injection to all users. ArXiv. doi: 10.48550/arXiv.2507.02850.

[11] Kamath, U., Keenan, K., Somers, G., & Sorenson, S. (2024). Tuning for LLM alignment. In Large language models: A deep dive (pp. 177-218). Cham: Springer. doi: 10.1007/978-3-031-65647-7_5.

[12] Kampelopoulos, D., Tsanousa, A., Vrochidis, S., & Kompatsiaris, I. (2025). A review of LLMs and their applications in the architecture, engineering and construction industry. Artificial Intelligence Review, 58, article number 250. doi: 10.1007/s10462-025-11241-7.

[13] Karunakaran, S., & Jain, A. (2025). Fine-tuning LLMs for personality preservation in AI assistants. International Journal of Research in Modern Engineering & Emerging Technology, 13(4), 172-191. doi: 10.63345/ijrmeet.org. v13.i4.10.

[14] Lin, X., Wang, W., Li, Y., Yang, S., Feng, F., Wei, Y., & Chua, T. (2024). Data-efficient fine-tuning for LLMbased recommendation. In Proceedings of the 45th international ACM SIGIR conference on research and development in information retrieval (pp. 365-374). New York: Association for Computing Machinery. doi: 10.1145/3626772.3657807 .

[15] Martin, D. (2025). Improving CXR report labeling through LLM fine-tuning and human feedback. Preprints. doi: 10.20944/preprints202504.1668.v1 .

[16] Mazzullo, E., & Bulut, O. (2025). Automated feedback generation for open-ended questions: Insights from fine-tuned LLMs . In Proceedings of large foundation models for educational assessment (pp. 103-120). Vancouver: MLResearchPress.

[17] Next Electronics. (n.d.). Elastic weight consolidation (EWC). Retrieved from https://www.next.gr/ai/deep-learning theory/elastic-weight-consolidation-ewc/.

[18] NinjaTech AI. (n.d.). Ninja LLM suite. Retrieved from https://www.ninjatech.ai/product/ninja-llm.

[19] OpenAssistant conversations dataset. (n.d.). Retrieved from https://huggingface.co/datasets/OpenAssistant/ oasst1 .

[20] Pratap, S., Aranha, A.R., Kumar, D., Malhotra, G., Iyer, A.P.N., & Shylaja, S.S. (2025). The fine art of fine-tuning: A structured review of advanced LLM fine-tuning techniques. Natural Language Processing Journal, 11, article number 100144. doi: 10.1016/j.nlp.2025.100144 .

[21] Punnaivanam, M., & Velvizhy, P. (2024). Contextual fine-tuning of language models with classifier-driven content moderation for text generation. Entropy, 26(12), article number 1114. doi: 10.3390/e26121114.

[22] Rawal, N., Tavva, P., & Selvakumar, P. (2024). Enhancing large language model performance with reinforcement learning from human feedback: A comprehensive study on Q&A, summarization, and classification. In 2024 international conference on electrical, computer and energy technologies (pp. 1-6). Sydney: IEEE. doi: 10.1109/ ICECET61485.2024.10698396 .

[23] Rehan, S., Al-Bander, B., & Al-Said Ahmad, A. (2025). Harnessing large language models for automated software testing: A leap towards scalable test case generation. Electronics, 14(7), article number 1463. doi: 10.3390/ electronics14071463 .

[24] Shi, T., et al. (2024). WildFeedback: Aligning LLMs with in-situ user interactions and feedback. ArXiv. doi: 10.48550/arXiv.2408.15549.

[25] Terras, N., Pereira, F., Silva, A.R., Santos, AA., Lopes, A.M., Silva, A.F.D., Cartal, L.A., Apostolescu, T.C., Badea, F., & Machado, J. (2025). Integration of deep learning vision systems in collaborative robotics for real-time applications. Applied Sciences, 15(3), article number 1336. doi: 10.3390/app15031336 .

[26] Wang, J., Lu, H., Liu, Y., Ma, H., Wang, Y., Gu, Y., Zhang, S., Han, N., Bi, S., Baugher, L., Chi, E.H., & Chen, M. (2024). LLMs for user interest exploration in large-scale recommendation systems. In T. di Noia, P. Lops, T. Joachims. K. Verbert, P. Castells, Z. Dong & B. London (Eds.), Proceedings of the 18th ACM conference on recommender systems (pp. 872-877). New York: Assoviation Computing Machinery. doi: 10.1145/3640457.3688161 .

[27] Wang, K., Lu, Y., Santacroce, M., Gong, Y., Zhang, C., & Shen, Y. (2025). Adapting LLM agents with universal communication feedback. In Findings of the association for computational linguistics (pp. 6105-6122). Albuquerque: Association for Computational Linguistics. doi: 10.18653/v1/2025.findings-naacl.339.

[28] Watson, E., Viana, T., Zhang, S., Sturgeon, B., & Petersson, L. (2024). Towards an end-to-end personal fine-tuning framework for AI value alignment. Electronics, 13(20), article number 4044. doi: 10.3390/electronics13204044 .

[29] Wu, X.-K., et al. (2025). LLM fine-tuning: Concepts, opportunities, and challenges. Big Data and Cognitive Computing, 9(4), article number 87. doi: 10.3390/bdcc9040087 .

[30] Yang, B., Tian, H., Ren, J., Zhang, H., Klein, J., Bissyandé, T.F., Le Goues, C., & Jin, S. (2025). MORepair: Teaching LLMs to repair code via multi-objective fine-tuning. ACM Transactions on Software Engineering and Methodology. doi: 10.1145/3735129 .

[31] Ye, C., Xiong, W., Zhang, Y., Dong, H., Jiang, N., & Zhang, T. (2025). Online iterative reinforcement learning from human feedback with general preference model . In Proceedings of the 38th international conference on neural information processing systems (pp. 81773-81807). Red Hook: Curran Associates Inc.

[32] Zhao, Q., Harper, F.M., Adomavicius, G., & Konstan, J.A. (2018). Explicit or implicit feedback? Engagement or satisfaction? A field experiment on machine-learning-based recommender systems. In Proceedings of the 33rd annual ACM symposium on applied computing (pp. 1331-1340). New York: Association for Computing Machinery. doi: 10.1145/3167132.3167275.