INDEX
Explanations
references to cultural elements and diversity
New Auto-Interp
Negative Logits
culture
-0.68
Culture
-0.65
Culture
-0.62
cultures
-0.61
culture
-0.58
æĸĩåĮĸ
-0.51
cultura
-0.50
kultur
-0.50
Cultural
-0.47
cultured
-0.47
POSITIVE LOGITS
ãĤ¯ãĥª
0.16
heritage
0.16
religion
0.16
lang
0.15
history
0.15
Heritage
0.15
orum
0.15
values
0.15
Geschichte
0.14
egend
0.14
Activations Density 0.020%