Venkata Siva Prasad Bharathula. “The Role of Reward Models and Reinforcement Learning in LLM Fine-Tuning”. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, vol. 11, no. 2, Mar. 2025, pp. 471-7, https://doi.org/10.32628/CSEIT25112381.