INDEX
Explanations
references to the country Pakistan
mentions of Pakistan
New Auto-Interp
Negative Logits
Lys
-0.71
LU
-0.69
Bucc
-0.63
Saturn
-0.63
Sacrament
-0.63
llo
-0.62
Merrill
-0.61
umbnails
-0.61
Vader
-0.61
mble
-0.60
POSITIVE LOGITS
istani
1.65
abad
1.12
istan
1.07
awar
1.04
Sharif
0.95
Pakistan
0.94
Taliban
0.90
Karachi
0.88
adesh
0.87
Pakistan
0.82
Activations Density 0.030%