INDEX
Explanations
references to the Bush administration
references to the Bush administration
New Auto-Interp
Negative Logits
Frem
-0.69
Qiao
-0.69
helle
-0.67
semble
-0.66
Hawth
-0.62
photos
-0.61
Yao
-0.61
Adin
-0.59
Sebast
-0.59
âĶĢ
-0.58
POSITIVE LOGITS
nell
1.29
ido
1.16
wick
1.06
Sr
1.01
master
0.99
Bush
0.93
Hussein
0.93
Bush
0.93
rod
0.87
Administration
0.86
Activations Density 0.038%