INDEX
Explanations
numerical measurements and specifications
New Auto-Interp
Negative Logits
AZE
-0.16
Rowe
-0.15
rios
-0.15
Weiss
-0.14
achuset
-0.14
>NN
-0.14
rych
-0.14
abbit
-0.14
Dick
-0.13
aze
-0.13
POSITIVE LOGITS
"
0.23
",
0.21
"↵
0.17
VL
0.16
â̳
0.16
&q
0.16
")
0.15
".
0.15
”
0.15
";
0.15
Activations Density 0.018%