INDEX
Explanations
phrases related to future outcomes or consequences
phrases related to outcomes and consequences
New Auto-Interp
Negative Logits
ĸļ
-0.68
tained
-0.66
hess
-0.59
emp
-0.58
²¾
-0.58
oons
-0.57
aq
-0.56
requently
-0.56
ouf
-0.56
aeper
-0.56
POSITIVE LOGITS
someday
1.15
sooner
1.06
morrow
1.05
hopefully
0.98
Eventually
0.98
Eventually
0.96
Will
0.95
will
0.94
eventually
0.94
soon
0.94
Activations Density 0.516%