Media Summary: multilingual language models, many-to-many machine translation, multilingual datasets, zero-shot transfer for low-resource ... prompt-based learning, prefix tuning, prompt tuning slides: REALM, k-NN LM, do LMs use retrievals? slides:
Umass Cs685 F21 Advanced Nlp Knowledge Distillation - Detailed Analysis & Overview
multilingual language models, many-to-many machine translation, multilingual datasets, zero-shot transfer for low-resource ... prompt-based learning, prefix tuning, prompt tuning slides: REALM, k-NN LM, do LMs use retrievals? slides: bertology, probe tasks, control probes slides: course schedule: representing images, multimodal transfer learning, visual question answering, image captioning, multimodal pretraining (VilBERT, ... T5, text-to-text pretraining, Common Crawl, decoding algorithms, greedy search, beam search, sampling slides: ...
BERT, fine-tuning, classification tasks, QA tasks, RoBERTa, ELECTRA, ALBERT, TransformerXL notes: ... This lecture (by Sean Welleck) for CMU CS 11-711,