INDEX
Explanations
phrases indicating distance or measurements in terms of "at least."
New Auto-Interp
Negative Logits
ite
-0.15
0
-0.14
opt
-0.14
hab
-0.14
bis
-0.14
itz
-0.14
мени
-0.14
47
-0.14
es
-0.14
UEST
-0.14
POSITIVE LOGITS
loor
0.18
utor
0.17
/gin
0.17
inyin
0.16
onomous
0.15
ecz
0.15
eyn
0.14
ominator
0.14
PIO
0.14
#
0.14
Activations Density 0.091%