INDEX
Explanations
negative descriptions directed at a specific person
negative attributes or criticisms associated with entities or situations
New Auto-Interp
Negative Logits
bounce
-0.87
square
-0.77
drawer
-0.74
hiber
-0.74
drink
-0.73
fright
-0.71
breathing
-0.70
sep
-0.69
context
-0.69
breat
-0.69
POSITIVE LOGITS
FM
1.38
LP
1.22
derived
1.21
II
1.20
III
1.19
DL
1.16
SN
1.15
RN
1.14
related
1.13
induced
1.12
Activations Density 0.067%