INDEX
Explanations
phrases indicating legal outcomes or emotional responses related to crime and punishment
New Auto-Interp
Negative Logits
4
-1.65
four
-1.49
Four
-1.33
four
-1.29
cuatro
-1.27
FOUR
-1.24
quatro
-1.22
quatre
-1.21
4
-1.21
FOUR
-1.20
POSITIVE LOGITS
Tenth
0.52
Sixth
0.52
Seventh
0.51
Eighth
0.50
tenth
0.49
bezeichneter
0.49
ten
0.49
eighth
0.49
seventh
0.48
Seventh
0.47
Activations Density 0.757%