BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

November 1, 2024

This post is the note of the paper “BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models”. See docs for more info.

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

Lenan Wu