INDEX
Explanations
references to late-night television hosts and their careers
New Auto-Interp
Negative Logits
ansa
-0.19
entai
-0.18
iego
-0.18
chant
-0.17
anzi
-0.15
dök
-0.15
uml
-0.14
anza
-0.14
vore
-0.14
Narr
-0.14
POSITIVE LOGITS
Late
0.45
Fallon
0.40
Late
0.36
Tonight
0.36
Jimmy
0.35
Letter
0.34
Colbert
0.33
Conan
0.31
late
0.30
Letter
0.29
Activations Density 0.044%