INDEX
Explanations
references to Western culture or geographical regions
mentions of compass-direction adjectives used to label regions, places, cultures, or institutions.
New Auto-Interp
Negative Logits
iseite
-0.75
makeConstraints
-0.74
akesh
-0.73
gemaakt
-0.72
angeran
-0.68
ofür
-0.65
zocht
-0.65
adple
-0.65
Praze
-0.64
voyance
-0.64
POSITIVE LOGITS
ization
1.24
ized
1.17
Northern
1.08
Northern
1.03
isation
1.00
Southern
0.99
FER
0.97
Southern
0.96
{(\0.96
NORTHERN
0.95
Activations Density 0.043%