INDEX
Explanations
phrases related to intentions or purposes
phrases indicating intent or purpose
New Auto-Interp
Negative Logits
writ
-0.68
wrote
-0.65
Steps
-0.61
millenn
-0.61
prints
-0.59
standing
-0.59
Printed
-0.59
Binary
-0.58
Offline
-0.57
Debor
-0.57
POSITIVE LOGITS
appease
1.43
alleviate
1.26
lessen
1.22
relieve
1.20
maximize
1.18
satisfy
1.18
avoid
1.16
bolster
1.16
intimidate
1.15
discourage
1.15
Activations Density 0.204%