Zhang, Y., & Huang, P. (2026). Efficient Deployment of Multimodal LargeModels: A Surveyon Technical Innovations, Industrial Applications, and Challenges of Heterogeneous MoE Architecture, Low-bit Quantization, and Cloud-Edge-End Collaboration (2024-2026). Silence, 1(1), 13-39. https://doi.org/10.5281/zenodo.18681507