INDEX
Explanations
mentions of cultural aspects and values in various contexts
New Auto-Interp
Negative Logits
lain
-0.85
ldon
-0.84
etsk
-0.83
ertodd
-0.80
agher
-0.79
deen
-0.79
upon
-0.78
oidal
-0.77
ishable
-0.77
alos
-0.75
POSITIVE LOGITS
appropriation
1.10
Marxism
1.08
heritage
1.07
significance
0.96
anthropology
0.96
norms
0.94
imperialism
0.93
arte
0.88
literacy
0.87
competence
0.87
Activations Density 0.020%