Code & Coffee
  • 首页
  • 归档
  • 分类
  • 标签
  • 关于

论文笔记 - ComprehendEdit: A Comprehensive Dataset and Evaluation Framework for Multimodal Knowledge Editing

1. Information Title: ComprehendEdit: A Comprehensive Dataset and Evaluation Framework for Multimodal Knowledge Editing Link: ComprehendEdit Paper Source: arxiv Date: 2024.12.17 2. Summary 提出了 Compre
2025-03-11
论文阅读
#深度学习 #知识编辑 #多模态

论文笔记 - MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge

1. Information Title: MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge Link: MMKE-Bench Paper Source: International Conference on Learning Representations (ICLR) Date: 2025 2. S
2025-03-10
论文阅读
#深度学习 #知识编辑 #多模态

论文笔记 - InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning

1. Information Title: InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning Link: InstructBLIP Paper Source: Advances in Neural Information Processing Systems (NeurIPS)
2025-03-10
论文阅读
#深度学习 #Pre-training #Fine-tuning #MLLMs #指令微调

论文笔记 - MiniGPT-4 Enhancing Vision-Language Understanding with Advanced Large Language Models

1. Information Title: MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models Link: MiniGPT-4 Paper Source: arXiv Date: 2023.04.20 2. Summary MiniGPT-4 是一个视觉-语言模型,它通过对齐一
2025-03-06
论文阅读
#深度学习 #Pre-training #Fine-tuning #MLLMs

论文笔记 - BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

1. Information Title: BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models Link: BLIP-2 Paper Source: International Joint Conference on Artificial Int
2025-03-06
论文阅读
#深度学习 #Pre-training #Fine-tuning #MLLMs

论文笔记 - BLIP Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

1. Information Title: BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation Link: BLIP Paper Source: International Conference on Machine Learning (IC
2025-03-06
论文阅读
#深度学习 #Pre-training #Fine-tuning #MLLMs

论文笔记 - Masked Autoencoders Are Scalable Vision Learners

1. Information Title: Masked Autoencoders Are Scalable Vision Learners Link: MAE Paper Source: Conference on Computer Vision and Pattern Recognition (CVPR) Date: 2021.11.11 2. Summary 该论文提出了一种简单但有效的自监
2025-03-02
论文阅读
#深度学习 #CV

论文笔记 - Generative Adversarial Nets

1. Information Title: Generative Adversarial Nets Link: GAN Paper Source: Annual Conference on Neural Information Processing Systems (NeurIPS) Date: 2014.06.10 2. Summary 本文提出了一种新的生成模型训练框架——生成对抗网络(GAN
2025-02-27
论文阅读
#CV #AIGC

论文笔记 - MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing Editing

1. Information Title: MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing Link: Paper Link Source: ACL 2024 Findings Date: 2024.02.28 2. Summary 提出 MIKE 基准测试:该基准用于细粒度多模态实体知识编辑(
2025-02-13
论文阅读
#深度学习 #知识编辑 #多模态

论文笔记 - Plug-and-Play Adaptation for Continuously-updated QA

1. Information Title: Plug-and-Play Adaptation for Continuously-updated QA Link: PPA Paper Source: Annual Meeting of the Association for Computational Linguistics (ACL) Date: 2022.04.27 2. Summary 本文提
2025-01-26
论文阅读
#深度学习 #NLP #知识编辑
1234

搜索

晚安~
总访问量 次 总访客数 人