INDEX
Explanations
mentions of legal or criminal activities involving juveniles
terms related to juvenile justice and delinquency
New Auto-Interp
Negative Logits
cross
-0.79
pr
-0.77
lessly
-0.72
ding
-0.70
ãĥ©
-0.68
sts
-0.67
witz
-0.67
prise
-0.66
acle
-0.64
pert
-0.63
POSITIVE LOGITS
delinqu
1.34
delinquent
1.07
Juven
0.86
detention
0.83
Swim
0.79
ishly
0.78
Detention
0.76
egal
0.73
males
0.72
ellar
0.70
Activations Density 0.049%