《DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding》论文阅读
论文原文链接:https://arxiv.org/pdf/2412.10302?本文在DeepSeek-VL以及DeepSeek-V2的基础上来写的,可以先回顾一下这两篇论文的内容:《DeepSeek-VL:TowardsReal-WorldVision-LanguageUnderstanding》阅读解析-CSDN博客《DeepSeek-V2:AStrong,Economical,andEffi