INDEX
Explanations
structural elements and organization in technical descriptions
New Auto-Interp
Negative Logits
heads
-0.15
rise
-0.14
elib
-0.14
],[-
-0.13
ÑĪин
-0.13
ze
-0.13
udu
-0.13
Heads
-0.13
oxid
-0.13
izard
-0.12
POSITIVE LOGITS
bottom
1.13
Bottom
0.99
bottom
0.96
Bottom
0.93
BOTTOM
0.86
-bottom
0.85
_bottom
0.81
bottoms
0.79
.bottom
0.75
BOTTOM
0.75
Activations Density 0.129%