INDEX
Explanations
instances of the word "Phoenicians"
names or terms associated with specific locations or populations
New Auto-Interp
Negative Logits
izoph
-0.80
Ĥª
-0.73
Pwr
-0.71
HAEL
-0.67
Ambro
-0.64
Manila
-0.64
RED
-0.63
ENTS
-0.62
Jet
-0.62
dos
-0.62
POSITIVE LOGITS
vironment
1.48
zyme
1.06
esis
1.02
viron
0.91
vironments
0.88
uve
0.86
isance
0.85
pard
0.85
cedes
0.81
itent
0.81
Activations Density 0.019%