INDEX
Explanations
future tense verbs and phrases indicating intention or capability
New Auto-Interp
Negative Logits
uien
-0.15
themselves
-0.15
缼
-0.15
ιλο
-0.15
ibal
-0.14
eso
-0.14
icus
-0.14
bothers
-0.14
ð
-0.14
bothering
-0.14
POSITIVE LOGITS
notice
0.36
find
0.29
notice
0.27
Notice
0.27
Notice
0.26
notices
0.25
want
0.25
noticed
0.24
hear
0.23
note
0.22
Activations Density 0.069%