INDEX
Explanations
mentions of the word "South."
south followed by geography
New Auto-Interp
Negative Logits
gesche
-0.53
abler
-0.50
xpress
-0.48
bagels
-0.48
wią
-0.48
remel
-0.47
verknüp
-0.47
itek
-0.46
müſſen
-0.45
commitment
-0.45
POSITIVE LOGITS
South
2.06
South
1.91
SOUTH
1.74
south
1.51
south
1.49
SOUTH
1.46
North
1.25
Selatan
1.14
North
1.13
南
1.04
Activations Density 0.014%