INDEX
Explanations
references to Washington or its institutions
New Auto-Interp
Negative Logits
ymax
-0.15
yw
-0.15
ertation
-0.14
ãĥ©ãĥĥãĤ¯
-0.14
ei
-0.14
اÙĨÙĩ
-0.14
дам
-0.14
ãĥ¼ãĤ
-0.14
ilk
-0.14
owell
-0.14
POSITIVE LOGITS
DC
0.31
s
0.26
DC
0.25
Redskins
0.23
dc
0.23
dc
0.22
Irving
0.22
inton
0.21
(dc
0.20
Wash
0.19
Activations Density 0.022%