INDEX
Negative Logits
duct
-0.72
kamp
-0.70
wark
-0.69
dies
-0.69
clud
-0.67
adium
-0.67
igans
-0.67
ulator
-0.66
Ó
-0.66
duction
-0.65
POSITIVE LOGITS
entimes
1.14
importantly
1.09
unsurprisingly
1.07
nown
1.06
surprisingly
1.01
withstanding
0.95
etheless
0.86
ironic
0.83
interestingly
0.81
majorities
0.80
Activations Density 0.138%