INDEX
Explanations
phrases related to legal proceedings and injustices
New Auto-Interp
Negative Logits
obstante
-0.74
ſind
-0.70
MMV
-0.69
drawSprites
-0.69
ſelf
-0.68
незавершена
-0.68
lecz
-0.68
Majefty
-0.68
leſs
-0.67
NUMX
-0.66
POSITIVE LOGITS
[
0.87
actually
0.84
really
0.82
basically
0.75
actually
0.75
maybe
0.74
really
0.70
thing
0.69
maybe
0.69
kind
0.67
Activations Density 0.361%