INDEX
Explanations
negative contractions and phrases indicating limitation or disapproval
Contractions and numbers
numbers followed by 's or numbers
New Auto-Interp
Negative Logits
typelib
-0.76
DoubleQuotes
-0.65
:✨
-0.60
sizeCache
-0.59
konomi
-0.58
kháu
-0.58
SIMBAD
-0.57
那个
-0.57
astore
-0.55
الحره
-0.55
POSITIVE LOGITS
ſelf
0.65
fernández
0.60
GIPHY
0.60
ſelves
0.59
―――――
0.58
་་
0.56
managing
0.56
Portale
0.56
DEG
0.55
kiin
0.55
Activations Density 0.347%