INDEX
Explanations
phrases related to legal actions and admissions of guilt
instances of the term "guilty" related to legal pleas
New Auto-Interp
Negative Logits
adish
-0.80
edia
-0.76
dos
-0.73
inth
-0.72
Argent
-0.68
nic
-0.68
inas
-0.66
chn
-0.65
clips
-0.64
care
-0.64
POSITIVE LOGITS
plea
0.89
guilty
0.80
icts
0.79
Guilty
0.79
verdict
0.69
repeatedly
0.68
manslaughter
0.68
ysis
0.68
voluntarily
0.68
animous
0.67
Activations Density 0.032%