INDEX
Explanations
narrative elements related to personal experiences and actions
New Auto-Interp
Negative Logits
ittal
-0.15
plib
-0.14
çĽijåIJ¬é¡µéĿ¢
-0.14
ingham
-0.14
agues
-0.14
beden
-0.14
hasher
-0.14
zych
-0.14
anlı
-0.14
(íģ¬ê¸°
-0.14
POSITIVE LOGITS
holding
0.30
hold
0.29
press
0.27
touch
0.27
touching
0.26
Holding
0.25
pressing
0.25
press
0.24
touched
0.24
push
0.24
Activations Density 0.674%