MIL-Adapter: Coupling multiple instance learning and vision-language adapters for few-shot slide-level classification
{{output}}
Contrastive language-image pretraining has greatly enhanced visual representation learning and enabled zero-shot classification. Vision-language language models (VLM) have succeeded in few-shot learning by leveraging adaptation modules fine-tuned for specific ... ...