INDEX
Explanations
mentions of the country "Iran"
occurrences of the word "Iran."
New Auto-Interp
Negative Logits
åĭ
-0.83
Trafford
-0.83
VEL
-0.78
rious
-0.74
rompt
-0.72
inances
-0.71
illard
-0.71
roots
-0.71
shaw
-0.70
Jenner
-0.70
POSITIVE LOGITS
Iran
0.85
Contra
0.79
ibi
0.78
Arabia
0.75
retali
0.73
iens
0.72
Tehran
0.72
Iranians
0.70
sanctions
0.70
Iran
0.68
Activations Density 0.013%