INDEX
Explanations
legal terminology and references related to court cases
New Auto-Interp
Negative Logits
Efq
-1.68
myſelf
-1.61
himſelf
-1.56
pleaſure
-1.55
Jefus
-1.55
purpoſe
-1.53
themſelves
-1.52
uſed
-1.50
whoſe
-1.49
itſelf
-1.47
POSITIVE LOGITS
<eos>
0.97
saites
0.86
...
0.81
.
0.79
I
0.78
'
0.76
<=",
0.76
,
0.73
0.71
0.70
Activations Density 0.349%