INDEX
Explanations
mathematical equations and notation
New Auto-Interp
Negative Logits
Base
-0.21
Body
-0.20
(Base
-0.19
Base
-0.19
Brick
-0.19
Beh
-0.18
Branch
-0.18
Bel
-0.18
Body
-0.18
Block
-0.17
POSITIVE LOGITS
-b
0.79
Âłb
0.72
b
0.64
+b
0.64
*b
0.63
=b
0.63
/b
0.62
.b
0.61
,b
0.60
:b
0.60
Activations Density 1.017%