INDEX
Negative Logits
myſelf
-1.55
Efq
-1.54
himſelf
-1.43
itſelf
-1.41
Monfieur
-1.41
ſtate
-1.37
houſe
-1.36
Houſe
-1.35
Anſ
-1.34
ſever
-1.33
POSITIVE LOGITS
0.84
://
0.83
‘
0.81
.
0.78
(
0.78
“
0.77
:
0.76
’
0.73
↵↵
0.70
'
0.69
Activations Density 0.126%