INDEX
Explanations
references to the country "Iran"
mentions of Iran
New Auto-Interp
Negative Logits
LAPD
-0.76
illard
-0.75
Niet
-0.74
inances
-0.73
Trafford
-0.71
gorilla
-0.67
Odin
-0.65
opa
-0.64
DV
-0.63
imore
-0.63
POSITIVE LOGITS
Revolutionary
0.94
Tehran
0.85
ollah
0.85
nuclear
0.84
Contra
0.83
Rouhani
0.82
Nuclear
0.81
uclear
0.80
ibi
0.80
abad
0.80
Activations Density 0.031%