INDEX
Explanations
mentions of the word "Pakistani" in the text
references to Pakistan and its citizens
New Auto-Interp
Negative Logits
umbnails
-0.88
aunder
-0.82
phasis
-0.80
rost
-0.79
paio
-0.76
rob
-0.73
Lys
-0.72
uer
-0.72
uden
-0.72
lopp
-0.71
POSITIVE LOGITS
istani
1.52
Pakistani
1.21
Pakistan
1.03
Pak
0.96
Pakistan
0.94
Islamabad
0.91
Karachi
0.89
Taliban
0.87
Punjab
0.83
Nadu
0.80
Activations Density 0.006%