INDEX
Explanations
mentions of specific organizations or locations mentioned in news articles
references to the New York Times
New Auto-Interp
Negative Logits
advis
-0.74
wagen
-0.73
aquarium
-0.70
debtor
-0.70
purse
-0.69
induct
-0.68
stewards
-0.68
enthusi
-0.68
guiActiveUn
-0.67
imus
-0.66
POSITIVE LOGITS
Va
0.83
O
0.80
J
0.79
Org
0.77
org
0.77
jpg
0.75
CO
0.75
C
0.74
S
0.74
E
0.73
Activations Density 0.097%