INDEX
Explanations
questions directed at the reader about their situation or actions
New Auto-Interp
Negative Logits
leck
-0.16
ania
-0.16
onus
-0.15
lett
-0.15
aisal
-0.15
ONO
-0.15
chalk
-0.14
\Id
-0.14
Lear
-0.14
ados
-0.13
POSITIVE LOGITS
considering
0.28
ready
0.27
thinking
0.25
Ready
0.24
READY
0.23
Considering
0.22
Considering
0.21
Ready
0.21
looking
0.20
interes
0.20
Activations Density 0.083%