INDEX
Explanations
emotional and positive expressions related to experiences
New Auto-Interp
Negative Logits
_by
-0.15
bes
-0.15
inae
-0.14
Which
-0.14
inis
-0.14
ByName
-0.14
orsi
-0.13
uby
-0.13
WithContext
-0.13
ãģ«ãĤĪãĤĭ
-0.13
POSITIVE LOGITS
how
0.31
hearing
0.31
to
0.30
knowing
0.27
having
0.26
being
0.25
watching
0.25
that
0.24
seeing
0.23
when
0.21
Activations Density 0.100%