INDEX
Explanations
numerical data and comparisons
New Auto-Interp
Negative Logits
trio
-0.15
μβ
-0.14
VÅ¡
-0.14
BOSE
-0.14
none
-0.14
theirs
-0.14
THREE
-0.13
кÑĥÑĤ
-0.13
ycler
-0.13
``(
-0.13
POSITIVE LOGITS
Both
0.34
both
0.33
both
0.33
Both
0.32
BOTH
0.31
ambos
0.28
_both
0.27
beide
0.26
_BOTH
0.22
обо
0.22
Activations Density 0.125%