INDEX
Negative Logits
glorious
-0.92
its
-0.91
fourteen
-0.88
theres
-0.88
improves
-0.87
存于
-0.85
fifteen
-0.83
those
-0.83
through
-0.82
without
-0.82
POSITIVE LOGITS
prepare
0.90
dette
0.90
dessa
0.88
Lecce
0.85
CERTAIN
0.84
MINIMUM
0.82
焓
0.82
'")
0.82
tete
0.82
ऐसे
0.82
Activations Density 0.002%