INDEX
Explanations
questions posed by 'Do we' or 'Do you'
questions about personal desires and collective actions
New Auto-Interp
Negative Logits
edIn
-0.69
Flavoring
-0.68
itized
-0.67
Vict
-0.65
ãĤ¦ãĤ¹
-0.65
Conversation
-0.64
srfAttach
-0.64
Awakening
-0.64
uces
-0.64
Integrity
-0.64
POSITIVE LOGITS
intend
0.93
think
0.92
deserve
0.90
owe
0.88
presume
0.87
imply
0.86
reckon
0.86
look
0.85
derive
0.85
follow
0.85
Activations Density 0.085%