INDEX
Explanations
terms related to ethnic groups and their demographics
New Auto-Interp
Negative Logits
CD
-0.38
し
-0.34
ver
-0.33
mobile
-0.33
τεύ
-0.32
Cien
-0.32
morality
-0.31
simp
-0.31
legal
-0.31
buc
-0.30
POSITIVE LOGITS
principalColumn
0.78
المعيارى
0.73
الرياضيه
0.73
שוליים
0.58
فريبيس
0.58
ority
0.57
pinulongan
0.57
Hentet
0.57
llong
0.56
minorities
0.55
Activations Density 0.709%