标签 - MLLMs - Ayang's home

共计 4 篇文章

2025

论文笔记 - InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

论文笔记 - MiniGPT-4 Enhancing Vision-Language Understanding with Advanced Large Language Models

论文笔记 - BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

论文笔记 - BLIP Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation