INDEX
Negative Logits
Theſe
-0.91
myſelf
-0.90
―――――
-0.89
RegressionTest
-0.88
Reſ
-0.88
itſelf
-0.85
pleaſure
-0.84
ſtate
-0.83
theſe
-0.82
themſelves
-0.82
POSITIVE LOGITS
an
0.53
K
0.48
ones
0.48
as
0.47
strix
0.46
is
0.46
National
0.44
0.44
one
0.44
where
0.43
Activations Density 0.053%