INDEX
Explanations
phrases indicating uncertainty or speculation, often starting with "Who knows"
expressions of uncertainty or curiosity
New Auto-Interp
Negative Logits
ItemTracker
-0.77
ciating
-0.77
phrine
-0.76
herent
-0.66
inance
-0.66
etsk
-0.64
REM
-0.62
ructose
-0.61
packages
-0.61
charges
-0.61
POSITIVE LOGITS
how
0.70
darn
0.70
fri
0.67
how
0.67
whats
0.65
scen
0.64
srfAttach
0.63
geop
0.61
lege
0.61
ãĤ½
0.60
Activations Density 0.030%