INDEX
Explanations
personal pronouns and expressions of personal opinion or feelings
New Auto-Interp
Negative Logits
λÏį
-0.14
ajs
-0.14
uzu
-0.13
åħ¹
-0.13
amburger
-0.13
Potential
-0.13
//{{-0.13
umbs
-0.13
uster
-0.13
tane
-0.13
POSITIVE LOGITS
feeling
0.27
impression
0.27
experience
0.24
gut
0.24
understanding
0.23
colleague
0.21
experiences
0.21
concern
0.20
observation
0.20
own
0.20
Activations Density 0.156%