INDEX
Explanations
actions involving raising or lifting objects or limbs
New Auto-Interp
Negative Logits
ahoo
-0.18
onis
-0.16
prime
-0.15
igure
-0.14
Herr
-0.14
bard
-0.14
nerves
-0.13
brains
-0.13
brain
-0.13
acho
-0.13
POSITIVE LOGITS
.raise
0.22
Raises
0.20
.scalablytyped
0.20
arms
0.20
raised
0.19
Raise
0.19
raise
0.19
arms
0.19
raises
0.19
raised
0.18
Activations Density 0.020%