INDEX
Negative Logits
us
-0.91
itſelf
-0.84
i
-0.75
a
-0.69
e
-0.64
usza
-0.63
myſelf
-0.63
themſelves
-0.63
invokingState
-0.58
RetentionPolicy
-0.58
POSITIVE LOGITS
utafitiHapana
0.52
-------
0.48
antMatchers
0.48
pheric
0.46
aume
0.46
prar
0.45
γη
0.45
тельстве
0.45
Amos
0.45
sī
0.44
Activations Density 0.024%