INDEX
Negative Logits
hus
-0.72
ensing
-0.71
erers
-0.71
warts
-0.68
theless
-0.66
bery
-0.66
eree
-0.65
serv
-0.64
axter
-0.63
ering
-0.63
POSITIVE LOGITS
Operation
0.94
Twist
0.83
Operation
0.81
Glad
0.74
Arrow
0.74
olon
0.74
Bravo
0.67
Pillar
0.66
ctica
0.66
III
0.65
Activations Density 5.025%