INDEX
Negative Logits
itſelf
-1.12
ſelf
-1.05
Jefus
-1.03
purpoſe
-1.02
Efq
-1.02
myſelf
-0.99
chofe
-0.94
prevailed
-0.93
pleaſure
-0.93
raiſ
-0.91
POSITIVE LOGITS
[
0.57
a
0.56
the
0.55
an
0.55
set
0.54
<eos>
0.52
setting
0.52
,
0.52
both
0.52
events
0.51
Activations Density 0.017%