INDEX
Explanations
words or phrases related to noticing or being aware of something
instances of perception and awareness-related actions
New Auto-Interp
Negative Logits
ioxide
-0.74
ActionCode
-0.70
terness
-0.69
subur
-0.68
Ý
-0.68
pione
-0.67
é¾įå¥
-0.67
£ı
-0.67
Ą
-0.67
-0.67
POSITIVE LOGITS
us
0.99
yours
0.95
me
0.89
your
0.88
you
0.83
theirs
0.82
ours
0.82
hers
0.82
SPONSORED
0.73
my
0.72
Activations Density 0.377%