Two recent trends in NLP---the application of deep neural networks and the use of transfer learning---have resulted in many models that achieve high performance on important tasks but whose behavior on those tasks is difficult to interpret. In this seminar, we will look at methods inspired by linguistics and cognitive science for analyzing what large neural language models have in fact learned: diagnostic/probing classifiers, adversarial test sets, and artificial languages, among others. Particular attention will be paid to probing these models' _semantic_ knowledge, which has received comparably little attention compared to their syntactic knowledge. Students will acquire relevant skills and (in small groups) design and execute a linguistically-informed analysis experiment, resulting in a report in the form of a publishable conference paper.

Monday 3:30 - 5:50 PM

Instructor Shane Steinert-Threlkeld Wednesday, 3-5 PM Pacific


  • Mathematical background: linear algebra, multivariable calculus
  • LING 570 or 571
  • LING 572 recommended, but not required
  • One other linguistics course (not necessarily at UW)
  • Programming in Python
  • Linux/Unix Commands

As a project-oriented, student-driven, seminar-style class, active participation---in the classroom, or in Zoom, as well as on Canvas---is expected.

All student work will be carried out in small groups. Groups are free to divide up work as they see fit, but will be required to explain the division of labor with their final project. Except under rare circumstances, every member of a group will receive the same grades.


The distribution of grades for the final grade will be:

  • Final project paper: 50%
  • Project proposal: 10%
  • Special topic presentation: 30%
  • Class participation: 10%


Any questions concerning course content and logistics should be posted on the Canvas discussion board. If a more personal issue arises, you can email me personally; include "LING575" in the subject line. You can expect responses from teaching staff within 24 hours, but only during normal business hours, and excluding weekends.

Mar 29 Introduction to Transfer Learning in NLP
Course Overview
NLP's ImageNet Moment Has Arrived
NLP's Clever Hans Moment Has Arrived
HW1 (group formation) out
Apr 5 Language Models Deep contextualized word representations (ELMo paper)
Understanding LSTMs
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
The Annotated Transformer
The Illustrated Transformer
HW1 due
Apr 12 Analysis Methods Belinkov and Glass, "Analysis Methods in Neural Language Processing: A Survey"
NAACL 2019 Tutorial on Transfer Learning in NLP (slides 73-96)
Rogers, Kovaleva, Rumshisky, "A Primer in BERTology: What We Know About How BERT Works"

Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies (original linguistic task paper)
Linguistic Knowledge and Transferability of Contextual Representations (prototypical probing paper)
What Does BERT Look at? An Analysis of BERT’s Attention (prototypical attention paper)
Proposal guidelines out [slides]
Apr 19 Guest lecture: Rachel Rudinger on the Universal Decompositional Semantics Initiative
Other datasets
The Universal Decompositional Semantics Dataset and Decomp Toolkit
Presentation sign-up
Apr 26 Technical resources
How to write an NLP paper
HuggingFace Transformers [paper, web]
AllenNLP [paper, web]
Using GPUs on the patas cluster
Proposal due
May 3 Special Topic 1:
Special Topic 2:
May 10 Special Topic 1:
Special Topic 2:
May 17 Special Topic 1:
Special Topic 2:
May 24 Special Topic 1: Final paper and presentation guidelines
Special Topic 2:
May 31 Memorial Day: no class

Reading List

This is a list of a snapshot of some papers on interpretability / analysis of language models, reflecting my knowledge of the state of the field circa December 2019. NB: The field is large and very fast-growing, so this is by no means exhaustive and has not been updated since December 2019. To find even more literature, I recommend:

  • The references in these papers
  • BlackboxNLP proceedings: 2018, 2019, 2020
  • Search terms in Google Scholar/SemanticScholar: probing, analysis, diagnostic classifiers

NB: the list below is an iframe, so make sure to scroll to see everything.