INDEX
Explanations
references to social beliefs and demographics within a specific geographic area
New Auto-Interp
Negative Logits
iffies
-0.17
iffin
-0.16
ney
-0.16
gis
-0.15
ancias
-0.15
ÃŃch
-0.15
çĻ
-0.15
пÑĢиÑĩ
-0.15
akte
-0.14
loff
-0.14
POSITIVE LOGITS
nominal
0.17
esser
0.17
Nom
0.16
帯
0.16
Stat
0.16
_nom
0.15
/apis
0.14
اÙĨÙĩ
0.14
ando
0.14
mes
0.14
Activations Density 0.008%