INDEX
Explanations
mentions of cultural aspects or references
references to culture
New Auto-Interp
Negative Logits
wered
-0.86
deen
-0.82
ieth
-0.82
issan
-0.78
iary
-0.78
ishable
-0.75
arted
-0.74
angler
-0.73
igans
-0.72
fman
-0.72
POSITIVE LOGITS
appropriation
0.80
wars
0.80
indo
0.80
clash
0.79
Appropri
0.78
Marxism
0.77
Diversity
0.74
Culture
0.73
diversity
0.69
immersion
0.69
Activations Density 0.034%