2/10/2025
•
EN
TIL: Masked Language Models Are Surprisingly Capable Zero-Shot Learners
Explores using a masked language model's head for zero-shot tasks, achieving strong results without task-specific heads.