Code & Coffee
  • 首页
  • 归档
  • 分类
  • 标签
  • 关于

共计 7 篇文章


2025

03-10
论文笔记 - InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
03-06
论文笔记 - MiniGPT-4 Enhancing Vision-Language Understanding with Advanced Large Language Models
03-06
论文笔记 - BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
03-06
论文笔记 - BLIP Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

2024

12-04
论文笔记 - An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
12-03
论文笔记 - Improving Language Understanding by Generative Pre-Training
11-29
论文笔记 - BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

搜索

晚安~
总访问量 次 总访客数 人