INDEX
Explanations
references to children
references to children in various contexts, particularly related to abuse
New Auto-Interp
Negative Logits
olulu
-0.80
atility
-0.77
aeda
-0.74
CRIP
-0.74
municip
-0.68
reen
-0.67
Bullets
-0.66
henko
-0.65
æ©Ł
-0.63
daq
-0.63
POSITIVE LOGITS
children
1.01
child
0.98
ishly
0.94
child
0.92
hood
0.89
Child
0.83
riages
0.82
Child
0.82
children
0.81
birth
0.78
Activations Density 0.020%