INDEX
Explanations
negations related to an action or event
negative statements concerning the existence or inclusion of certain elements or conditions
New Auto-Interp
Negative Logits
restless
-0.75
unlucky
-0.73
watching
-0.72
Watching
-0.71
glad
-0.68
Mayhem
-0.68
minds
-0.66
hobbies
-0.66
rolled
-0.64
lucky
-0.63
POSITIVE LOGITS
contain
1.67
include
1.48
involve
1.48
encompass
1.40
incorporate
1.37
entail
1.28
Include
1.21
include
1.20
reflect
1.19
depict
1.18
Activations Density 0.201%