INDEX
Explanations
numerical references to punishment or sentencing durations in years
long prison sentences
New Auto-Interp
Negative Logits
VIDEOS
-0.96
Collider
-0.89
ople
-0.78
acly
-0.71
achus
-0.66
Spawn
-0.65
inite
-0.65
atech
-0.65
Mom
-0.65
edia
-0.64
POSITIVE LOGITS
ago
1.23
apiece
1.00
probation
0.92
Ago
0.87
imprisonment
0.84
long
0.78
overdue
0.74
incarceration
0.73
olds
0.72
consecut
0.72
Activations Density 0.087%