INDEX
Explanations
phrases indicating a desire or intent to do something
the repeated use of the verb "be" in various contexts
New Auto-Interp
Negative Logits
PsyNetMessage
-0.71
fray
-0.69
suffice
-0.67
iasco
-0.66
pedia
-0.65
crumble
-0.64
converge
-0.64
plings
-0.63
strous
-0.63
spill
-0.63
POSITIVE LOGITS
able
1.08
friends
0.98
acons
0.88
honest
0.87
getting
0.87
thankful
0.87
AUT
0.85
hemoth
0.84
mo
0.81
orc
0.80
Activations Density 0.180%