INDEX
Explanations
references to specific locations and political figures
New Auto-Interp
Negative Logits
cove
-0.45
cuchar
-0.44
pot
-0.43
portál
-0.42
stav
-0.41
"
-0.41
oregon
-0.40
tarko
-0.40
Bangalore
-0.40
bumbu
-0.40
POSITIVE LOGITS
lamabad
1.22
########.
1.17
Pakistan
1.09
awtextra
1.07
Pakistani
1.04
Pakistan
1.03
1.02
Islamabad
0.98
Lahore
0.97
pakistan
0.94
Activations Density 0.103%