INDEX
Negative Logits
itol
-0.77
oling
-0.66
unin
-0.64
pointers
-0.63
etheless
-0.63
oche
-0.61
untarily
-0.59
iqu
-0.59
chev
-0.58
ov
-0.58
POSITIVE LOGITS
Logged
0.75
Lastly
0.66
Who
0.65
Explain
0.64
What
0.63
Has
0.61
Does
0.59
yss
0.59
Would
0.59
Previous
0.58
Activations Density 10.039%