Athul Ramkumar. (2024). Enabling On-Device Inference of Large Language Models : Challenges, Techniques, and Applications. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 10(6), 595-604. https://doi.org/10.32628/CSEIT241061100