INDEX
Explanations
keywords denoting a difference or unique characteristic
the word "also."
New Auto-Interp
Negative Logits
enough
-0.71
control
-0.65
recovery
-0.63
savings
-0.63
management
-0.63
close
-0.62
values
-0.60
cap
-0.60
access
-0.59
arm
-0.59
POSITIVE LOGITS
also
3.13
sometimes
1.82
often
1.67
formerly
1.53
actually
1.48
along
1.46
both
1.43
usually
1.42
again
1.37
literally
1.33
Activations Density 0.016%