INDEX
Explanations
phrases related to being content, accepting, or having found something helpful
phrases indicating action or intent related to continuity and effort
New Auto-Interp
Negative Logits
xus
-0.68
arger
-0.67
Difference
-0.65
Þ
-0.64
DAQ
-0.63
Means
-0.63
Changing
-0.62
MpServer
-0.61
larg
-0.61
significant
-0.61
POSITIVE LOGITS
bask
1.35
stare
1.22
ignore
1.22
shrug
1.22
pretend
1.19
moan
1.17
wander
1.15
grin
1.14
wait
1.12
concentrate
1.11
Activations Density 0.351%