INDEX
Explanations
phrases indicating a small amount or degree of something
New Auto-Interp
Negative Logits
ed
-0.19
ftware
-0.17
nt
-0.16
hi
-0.15
somewhat
-0.15
hs
-0.15
leo
-0.14
δί
-0.14
hiro
-0.14
ho
-0.14
POSITIVE LOGITS
umen
0.29
/stdc
0.26
.ly
0.26
mapped
0.25
Torrent
0.21
umin
0.20
rary
0.19
tern
0.18
more
0.18
bit
0.18
Activations Density 0.019%