INDEX
Explanations
equality and inequality comparisons
New Auto-Interp
Negative Logits
’
-0.70
er
-0.67
[`
-0.60
es
-0.58
rizio
-0.56
[`
-0.55
Aufg
-0.53
لاثة
-0.53
Pader
-0.52
Rolf
-0.52
POSITIVE LOGITS
==
1.70
]==
1.35
']==
1.26
")==
1.24
)==
1.19
!=
1.12
================
1.09
()==
1.06
===
1.06
==
1.04
Activations Density 0.059%