[1]
Y. Zhang and P. Huang, “Efficient Deployment of Multimodal LargeModels: A Surveyon Technical Innovations, Industrial Applications, and Challenges of Heterogeneous MoE Architecture, Low-bit Quantization, and Cloud-Edge-End Collaboration (2024-2026)”, Silence, vol. 1, no. 1, pp. 13–39, Feb. 2026, doi: 10.5281/zenodo.18681507.