Text-Supervised Learning 3 [Paper Review] BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation 2024/11/15 [Paper Review] Expanding Language-Image Pretrained Models for General Video Recognition 2024/11/01 [Paper Review] Learning Transferable Visual Models From Natural Language Supervision 2024/11/01