INDEX
Negative Logits
becomes
-0.07
receptive
-0.07
921
-0.07
these
-0.07
curso
-0.06
912
-0.06
ζό
-0.06
Captain
-0.06
mlad
-0.06
criticism
-0.06
POSITIVE LOGITS
/router
0.07
Vì
0.06
eget
0.06
اینچ
0.06
}?>↵
0.06
.swing
0.06
hr
0.06
<textarea
0.06
mỹ
0.05
Expr
0.05
Activations Density 0.003%