Venkata Siva Prasad Bharathula (2025) “The Role of Reward Models and Reinforcement Learning in LLM Fine-tuning”, International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 11(2), pp. 471–477. doi:10.32628/CSEIT25112381.