INDEX
Explanations
numbers with or without special symbols
instances of the letter 'l'
New Auto-Interp
Negative Logits
ãĥ¼ãĥĨãĤ£
-0.69
havoc
-0.64
spirited
-0.63
fund
-0.62
BART
-0.62
Satanic
-0.61
Vald
-0.59
Labor
-0.59
Lit
-0.58
household
-0.57
POSITIVE LOGITS
aptop
1.48
ongevity
1.37
ocated
1.36
ayers
1.35
ifestyle
1.34
anguages
1.34
ateral
1.30
ibraries
1.29
ounge
1.26
ubric
1.26
Activations Density 0.033%