INDEX
Explanations
names or references to individuals, particularly those in positions of authority or recognition
New Auto-Interp
Negative Logits
terday
-0.71
Admir
-0.69
chnology
-0.68
IST
-0.65
aboriginal
-0.64
Commodore
-0.63
isance
-0.60
whales
-0.60
TODAY
-0.60
underwater
-0.59
POSITIVE LOGITS
bons
1.02
bles
1.00
bled
1.00
endum
0.99
bling
0.98
bage
0.97
bent
0.95
bett
0.93
shaw
0.92
bler
0.90
Activations Density 0.002%