INDEX
Negative Logits
ahu
-0.58
amaz
-0.55
inth
-0.55
cised
-0.54
hea
-0.54
requisite
-0.53
scar
-0.52
icka
-0.51
marks
-0.51
dylib
-0.50
POSITIVE LOGITS
desperately
1.10
unsuccessfully
1.02
to
0.96
harder
0.93
hard
0.87
valiant
0.87
vain
0.78
frantically
0.76
toget
0.74
hard
0.73
Activations Density 0.054%