INDEX
Explanations
names of specific geographical locations
names and terms associated with specific historical figures and locations
New Auto-Interp
Negative Logits
uality
-0.82
istant
-0.81
hematically
-0.80
ère
-0.79
hered
-0.77
wo
-0.77
arians
-0.75
agen
-0.75
ory
-0.74
ancy
-0.74
POSITIVE LOGITS
vernment
0.90
Lumpur
0.83
emouth
0.82
Weasley
0.80
bably
0.75
bom
0.73
bang
0.70
redes
0.70
boom
0.67
leaflets
0.67
Activations Density 0.019%