[1]

Venkata Siva Prasad Bharathula 2025. The Role of Reward Models and Reinforcement Learning in LLM Fine-tuning. International Journal of Scientific Research in Computer Science, Engineering and Information Technology. 11, 2 (Mar. 2025), 471–477. DOI:https://doi.org/10.32628/CSEIT25112381.