INDEX
Explanations
phrases indicating preference or desirability
expressions indicating hypothetical or conditional situations
New Auto-Interp
Negative Logits
anwhile
-0.65
ãĤ¨ãĥ«
-0.63
sometimes
-0.61
culosis
-0.61
rising
-0.60
iop
-0.59
Commando
-0.57
PsyNetMessage
-0.57
idem
-0.56
Kurd
-0.56
POSITIVE LOGITS
unthinkable
1.09
preferable
1.06
impossible
1.03
impractical
0.96
prohib
0.95
folly
0.93
laughable
0.93
foolish
0.92
appreciated
0.91
nice
0.91
Activations Density 0.105%