INDEX
Explanations
phrases indicating advice or recommendation
instances of the word "advised" and related terms indicating recommendations or suggestions
New Auto-Interp
Negative Logits
neath
-0.65
animate
-0.64
spark
-0.61
mismatch
-0.60
streak
-0.60
access
-0.59
attm
-0.59
bang
-0.58
holes
-0.58
tie
-0.58
POSITIVE LOGITS
advised
3.71
advises
2.33
warned
1.92
advise
1.92
recommended
1.90
cautioned
1.84
urged
1.82
advising
1.79
instructed
1.74
advisable
1.69
Activations Density 0.007%