INDEX
Explanations
instructions and descriptions related to physical positioning and arrangement
New Auto-Interp
Negative Logits
lict
-0.16
Branch
-0.15
obox
-0.15
Blonde
-0.15
am
-0.15
omba
-0.14
352
-0.14
,
-0.14
Kou
-0.14
256
-0.14
POSITIVE LOGITS
orientation
0.38
orientations
0.35
Orientation
0.35
orientation
0.32
Orientation
0.30
orient
0.27
Orient
0.26
_orientation
0.26
orient
0.26
facing
0.25
Activations Density 0.168%