INDEX
Explanations
references to the country Afghanistan
references to Afghanistan and related entities
New Auto-Interp
Negative Logits
yss
-0.87
creen
-0.85
vous
-0.83
*/(
-0.77
Hunt
-0.75
constitu
-0.72
Collider
-0.71
ynt
-0.71
nces
-0.67
ometimes
-0.67
POSITIVE LOGITS
istan
1.18
ghan
1.11
Afghan
1.04
Afghanistan
1.02
Kabul
1.01
Afgh
0.99
Taliban
0.98
Afghans
0.96
Albania
0.93
istani
0.88
Activations Density 0.024%