INDEX
Explanations
terms that indicate measurement or evaluation
New Auto-Interp
Negative Logits
ãĥ¼ãĥijãĥ¼
-0.15
ç§°
-0.15
omu
-0.15
spar
-0.14
Collections
-0.14
.cgi
-0.14
ipse
-0.14
sty
-0.14
Sabbath
-0.14
Sabb
-0.13
POSITIVE LOGITS
.tm
0.16
OKIE
0.16
utex
0.15
assen
0.15
nett
0.15
Ïģιά
0.14
ickle
0.14
_anchor
0.14
大åħ¨
0.14
leccion
0.13
Activations Density 0.048%