INDEX
Negative Logits
myſelf
-1.00
itſelf
-0.99
auffi
-0.98
Efq
-0.95
houſe
-0.92
CDP
-0.90
bogotá
-0.89
SLP
-0.88
ſche
-0.87
ſtate
-0.86
POSITIVE LOGITS
.
0.59
0.58
co
0.54
(
0.52
,
0.50
Co
0.50
[
0.48
or
0.48
er
0.48
a
0.47
Activations Density 0.115%