INDEX
Explanations
references to cultural aspects or themes
references to "culture" and its significance in various contexts
New Auto-Interp
Negative Logits
ieth
-0.91
issan
-0.79
wered
-0.78
iary
-0.75
arted
-0.74
agher
-0.72
deen
-0.71
istant
-0.71
ishable
-0.71
angler
-0.71
POSITIVE LOGITS
Culture
0.80
indo
0.79
Diversity
0.78
immersion
0.78
Appropri
0.77
Marxism
0.76
clash
0.74
diversity
0.73
appropriation
0.70
heritage
0.69
Activations Density 0.024%