INDEX
Explanations
phrases related to expressing intentions or decisions
subjects and their actions or intentions related to communication
New Auto-Interp
Negative Logits
obscurity
-0.65
Clever
-0.64
hovah
-0.62
Reporting
-0.62
newcom
-0.60
Translation
-0.59
Citation
-0.58
Pse
-0.58
cial
-0.56
YP
-0.56
POSITIVE LOGITS
intends
1.43
regretted
1.26
intend
1.25
wants
1.23
would
1.18
wanted
1.15
'd
1.14
intention
1.13
expects
1.13
'll
1.12
Activations Density 0.278%