Another exciting day for Multimodal AI! The MiniCPM-V repository by is trending on GitHub. Impressive Results:
MiniCPM-Llama3-V 2.5 (8B) surpasses GPT-4V, Gemini Pro, & Claude 3
MiniCPM-V 2.0 (2B) surpasses Yi-VL 34B, CogVLM-Chat 17B, & Qwen-VL-Chat 10B
MiniCPM-V is efficiently deployable on end-side devices Read more: https://github.com/OpenBMB/MiniCPM-V
MiniCPM-V is building with Gradio to showcase framework's flexibility for creating powerful AI Vision apps. Local Gradio demo: https://github.com/OpenBMB/MiniCPM-V?tab=readme-ov-file#webui-demo
@opendatascience