INDEX
Explanations
uncommon or unusual occurrences or entities described as "strange"
descriptors related to unusual or bizarre elements
New Auto-Interp
Negative Logits
($
-0.78
outweigh
-0.72
payer
-0.69
aper
-0.68
provides
-0.68
®
-0.67
Prep
-0.66
Priority
-0.66
buster
-0.65
âĢ
-0.65
POSITIVE LOGITS
strange
3.11
weird
2.23
bizarre
2.10
odd
2.05
mysterious
2.01
peculiar
1.99
strangely
1.88
Strange
1.81
unusual
1.80
inexplicable
1.79
Activations Density 0.025%