INDEX
Explanations
references to municipalities and religious contexts
New Auto-Interp
Negative Logits
oned
-0.19
ÏĥειÏĤ
-0.15
ored
-0.15
enÃŃ
-0.15
tiết
-0.15
olle
-0.15
amaño
-0.15
kám
-0.14
aned
-0.14
urre
-0.14
POSITIVE LOGITS
io
0.34
ium
0.33
ius
0.32
ios
0.29
ious
0.25
ie
0.25
iu
0.24
ije
0.23
IO
0.23
iev
0.22
Activations Density 0.066%