INDEX
Explanations
references to Italian identity or culture
New Auto-Interp
Negative Logits
titolata
-0.72
isolato
-0.64
coscienza
-0.63
tetto
-0.62
decorazione
-0.60
vostri
-0.57
peccato
-0.56
spagno
-0.54
sonno
-0.54
parete
-0.51
POSITIVE LOGITS
Italian
1.95
italian
1.79
Italian
1.74
Italy
1.73
Italians
1.71
Italia
1.63
Italy
1.62
イタリア
1.59
italian
1.55
意大利
1.52
Activations Density 0.644%