Return to Article Details Enabling On-Device Inference of Large Language Models : Challenges, Techniques, and Applications Download Download PDF