INDEX
Explanations
references to Italy and Italian culture
New Auto-Interp
Negative Logits
cientos
-0.61
dersfield
-0.59
croix
-0.58
morgon
-0.56
Hochspringen
-0.56
vaih
-0.56
</blockquote>
-0.56
hombros
-0.55
puol
-0.54
x
-0.54
POSITIVE LOGITS
Italy
1.61
Italians
1.55
Italian
1.47
Italy
1.40
italy
1.40
Itali
1.37
italian
1.32
Italien
1.29
Italie
1.28
Italian
1.26
Activations Density 0.041%