Zhang, Y. and Huang, P. (2026) “Efficient Deployment of Multimodal LargeModels: A Surveyon Technical Innovations, Industrial Applications, and Challenges of Heterogeneous MoE Architecture, Low-bit Quantization, and Cloud-Edge-End Collaboration (2024-2026)”, Silence, 1(1), pp. 13–39. doi:10.5281/zenodo.18681507.