INDEX
Explanations
emotional expressions and sentiments related to personal experiences
New Auto-Interp
Negative Logits
esti
-0.16
avou
-0.15
Making
-0.14
elp
-0.14
ickest
-0.14
Creating
-0.14
making
-0.14
Creating
-0.14
creating
-0.13
Building
-0.13
POSITIVE LOGITS
hearing
0.73
seeing
0.57
Hearing
0.52
watching
0.51
reading
0.49
listening
0.48
seeing
0.45
viewing
0.44
learning
0.40
witnessing
0.39
Activations Density 0.863%