INDEX
Explanations
phrases related to legal actions or court sentences
occurrences of the word "sentenced."
New Auto-Interp
Negative Logits
loo
-0.72
park
-0.70
walker
-0.69
aucus
-0.69
yip
-0.68
Pegasus
-0.67
BLIC
-0.66
sie
-0.66
otin
-0.64
wer
-0.64
POSITIVE LOGITS
icts
1.16
sentencing
1.00
sentenced
0.98
harshly
0.93
sentences
0.90
punishments
0.86
sentence
0.82
punishment
0.82
inmates
0.82
punished
0.81
Activations Density 0.016%