INDEX
Negative Logits
}")
-0.79
'}),
-0.73
ſche
-0.67
whoſe
-0.66
'},
-0.65
againſt
-0.63
?}",
-0.63
myſelf
-0.63
iſt
-0.61
)"),
-0.61
POSITIVE LOGITS
!
0.73
ophora
0.54
y
0.49
amis
0.47
ylate
0.45
logia
0.45
шке
0.43
Y
0.43
erate
0.43
inary
0.43
Activations Density 0.001%