INDEX
Explanations
mathematical symbols and operations in equations
New Auto-Interp
Negative Logits
muſt
-0.59
Bake
-0.57
Arno
-0.55
}$
-0.54
Früchte
-0.54
dono
-0.54
karna
-0.53
Winslow
-0.53
Misa
-0.53
torino
-0.51
POSITIVE LOGITS
}+
1.07
>+</
1.06
}+\
1.05
)+\
1.03
()+
0.99
&+
0.96
)}+
0.91
)}+\
0.89
+
0.89
|+
0.89
Activations Density 0.653%