INDEX
Explanations
names or words related to people or places
names and terms related to specific geographic or fictional locations
New Auto-Interp
Negative Logits
rooting
-0.69
stump
-0.68
provision
-0.64
unsett
-0.64
morbid
-0.63
metic
-0.62
CRC
-0.62
departing
-0.62
bailed
-0.60
avez
-0.59
POSITIVE LOGITS
gard
1.27
ner
1.07
gart
1.07
nings
0.96
ners
0.93
geist
0.93
ens
0.90
ener
0.88
ians
0.88
ian
0.87
Activations Density 0.017%