INDEX
Explanations
references to social issues related to crime and youth behavior
New Auto-Interp
Negative Logits
iro
-0.18
roz
-0.17
requested
-0.15
remar
-0.14
alls
-0.14
cher
-0.14
.request
-0.14
hum
-0.14
acute
-0.14
panic
-0.13
POSITIVE LOGITS
idle
0.17
drugs
0.16
PushButton
0.16
archy
0.16
²
0.15
peer
0.15
Idle
0.15
Conduct
0.15
Attention
0.15
-peer
0.14
Activations Density 0.089%