1.
Venkata Siva Prasad Bharathula. The Role of Reward Models and Reinforcement Learning in LLM Fine-tuning. Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol [Internet]. 2025 Mar. 4 [cited 2025 Jul. 12];11(2):471-7. Available from: https://ijsrcseit.com/index.php/home/article/view/CSEIT25112381