INDEX
Explanations
specific references to Italian culture or entities
New Auto-Interp
Negative Logits
titolata
-0.65
queſta
-0.63
termica
-0.58
myſelf
-0.57
coscienza
-0.56
decorazione
-0.55
Monfieur
-0.52
Majefty
-0.52
fotografico
-0.52
ACHUSET
-0.51
POSITIVE LOGITS
Italian
1.83
Italy
1.63
Italian
1.59
Italians
1.52
Italy
1.45
italian
1.45
イタリア
1.45
意大利
1.37
italienischen
1.34
italien
1.31
Activations Density 0.505%