INDEX
Explanations
comparisons and equality checks in code
New Auto-Interp
Negative Logits
.」
-0.40
ipelago
-0.37
PDC
-0.36
linger
-0.36
DHS
-0.36
D
-0.35
byshire
-0.35
ponemos
-0.34
ukov
-0.34
finder
-0.33
POSITIVE LOGITS
==
1.84
===
1.31
==
1.31
]==
1.28
()==
1.22
)==
1.21
']==
1.15
!=
1.06
==-
1.06
==$
0.98
Activations Density 0.085%