1.
Venkata Siva Prasad Bharathula. The Role of Reward Models and Reinforcement Learning in LLM Fine-tuning. Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol. 2025;11(2):471-477. doi:10.32628/CSEIT25112381