Athul Ramkumar. “Enabling On-Device Inference of Large Language Models : Challenges, Techniques, and Applications”. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, vol. 10, no. 6, Nov. 2024, pp. 595-04, https://doi.org/10.32628/CSEIT241061100.