INDEX
Explanations
references to cultural identity and its implications on societal behaviors
New Auto-Interp
Negative Logits
createSlice
-0.40
mariana
-0.37
SCRIBE
-0.36
Życiorys
-0.36
uarkan
-0.35
กรรม
-0.34
régle
-0.34
Reputation
-0.33
RefNanny
-0.33
RegisterType
-0.33
POSITIVE LOGITS
Western
1.76
western
1.62
Western
1.58
western
1.42
WESTERN
1.40
occidental
1.39
occident
1.34
wester
1.27
Wester
1.24
WESTERN
1.23
Activations Density 0.555%