INDEX
Explanations
references to society and societal concepts
New Auto-Interp
Negative Logits
ル
-0.76
mal
-0.73
ر
-0.73
ros
-0.72
cur
-0.72
un
-0.69
ar
-0.68
р
-0.67
f
-0.67
fla
-0.64
POSITIVE LOGITS
Societies
1.74
societies
1.68
SOCIETY
1.64
Society
1.63
Society
1.63
society
1.57
society
1.56
Gesellschaft
1.23
sociedades
1.18
общество
1.18
Activations Density 0.091%