INDEX
Explanations
references to Portuguese culture and people
New Auto-Interp
Negative Logits
ANNEL
-0.15
ála
-0.14
amat
-0.14
EDA
-0.14
YM
-0.14
anos
-0.13
Ñij
-0.13
ATRIX
-0.13
äd
-0.13
Kir
-0.13
POSITIVE LOGITS
Portuguese
0.18
iku
0.15
ires
0.15
Rua
0.15
Lisbon
0.15
onnen
0.14
imoto
0.14
isease
0.14
eprom
0.14
iu
0.14
Activations Density 0.110%