INDEX
Explanations
references to Italy and Italian culture or cuisine
New Auto-Interp
Negative Logits
cientos
-0.70
xuan
-0.64
mentaux
-0.62
Poppy
-0.60
</blockquote>
-0.60
Schrader
-0.60
Polskiego
-0.59
Evan
-0.59
udu
-0.59
قيقة
-0.58
POSITIVE LOGITS
Italy
1.49
Italie
1.37
Italians
1.36
Italy
1.36
italy
1.33
Itali
1.30
Italian
1.29
ITALY
1.24
Italien
1.19
Italie
1.18
Activations Density 0.084%