INDEX
Explanations
actions and intentions expressed in sentences
New Auto-Interp
Negative Logits
soever
-0.77
transpired
-0.76
pse
-0.75
guiActiveUn
-0.71
precincts
-0.70
ABC
-0.70
permitting
-0.69
Laf
-0.67
formulated
-0.65
ths
-0.65
POSITIVE LOGITS
uate
0.88
someday
0.83
thood
0.82
anything
0.81
ASAP
0.78
ezvous
0.77
dress
0.72
emulate
0.72
unic
0.72
FANTASY
0.70
Activations Density 11.381%