INDEX
Explanations
mathematical expressions involving inequalities and algebraic functions
New Auto-Interp
Negative Logits
?f
-0.18
lesc
-0.17
?-
-0.17
?=
-0.16
plat
-0.16
leet
-0.15
ková
-0.15
bour
-0.14
?
-0.14
ambi
-0.14
POSITIVE LOGITS
)^
0.45
]^
0.32
).^
0.26
|^
0.24
))^
0.24
)?↵
0.21
)?↵↵
0.19
)**
0.19
)?
0.18
)?;↵
0.18
Activations Density 0.114%