INDEX
Explanations
opening parentheses followed by numerical values
New Auto-Interp
Negative Logits
(
-0.44
*
-0.34
$
-0.31
_
-0.30
&
-0.30
@
-0.28
%
-0.26
[
-0.25
<
-0.21
"
-0.20
POSITIVE LOGITS
).__
0.20
...)↵
0.19
âĢŀ
0.18
__)
0.18
â̦)
0.17
âīł
0.15
â̦)↵↵
0.15
/*@
0.15
,)↵
0.15
‘
0.15
Activations Density 0.152%