[1]
Zhang, Y. and Huang, P. 2026. Efficient Deployment of Multimodal LargeModels: A Surveyon Technical Innovations, Industrial Applications, and Challenges of Heterogeneous MoE Architecture, Low-bit Quantization, and Cloud-Edge-End Collaboration (2024-2026). Silence. 1, 1 (Feb. 2026), 13–39. DOI:https://doi.org/10.5281/zenodo.18681507.