INDEX
Explanations
references to various age groups and relationship statuses
New Auto-Interp
Negative Logits
ighter
-0.20
ertas
-0.16
utut
-0.16
onta
-0.15
iesen
-0.15
ιά
-0.15
Welfare
-0.15
Ŀ
-0.14
.mods
-0.14
\Column
-0.14
POSITIVE LOGITS
middle
0.34
40
0.31
-middle
0.28
mid
0.28
35
0.27
middle
0.27
50
0.27
45
0.26
60
0.26
30
0.24
Activations Density 0.104%