ZHANG, Yaolin; HUANG, Pengrong. Efficient Deployment of Multimodal LargeModels: A Surveyon Technical Innovations, Industrial Applications, and Challenges of Heterogeneous MoE Architecture, Low-bit Quantization, and Cloud-Edge-End Collaboration (2024-2026). Silence, Berlin, Germany, v. 1, n. 1, p. 13–39, 2026. DOI: 10.5281/zenodo.18681507. Disponível em: https://test.journals.panorama-sg.com/index.php/Silence/article/view/232.. Acesso em: 7 apr. 2026.