INDEX
Explanations
the word "guilty" or variations of it
terms related to legal guilt or guilty findings
New Auto-Interp
Negative Logits
Dub
-0.74
edia
-0.69
Secure
-0.67
ffee
-0.65
oldown
-0.64
Security
-0.64
UNCH
-0.63
acco
-0.63
afety
-0.63
psey
-0.63
POSITIVE LOGITS
verdict
1.17
plea
1.03
pleas
0.94
innocence
0.87
guilt
0.87
conscience
0.83
guilty
0.81
ilty
0.78
plead
0.77
alty
0.76
Activations Density 0.040%