INDEX
Explanations
punctuation marks or periods in text
New Auto-Interp
Negative Logits
edium
-0.16
kola
-0.14
lus
-0.14
inya
-0.14
usch
-0.13
oke
-0.13
efeller
-0.13
abad
-0.13
ARB
-0.13
bourne
-0.13
POSITIVE LOGITS
prec
0.15
Lindsay
0.15
åİļ
0.14
exo
0.14
Prec
0.14
expo
0.14
yu
0.14
iku
0.13
Ivanka
0.13
itir
0.13
Activations Density 0.031%