INDEX
Explanations
references to Italian culture, traditions, and cuisine
New Auto-Interp
Negative Logits
Monfieur
-1.06
basicConfig
-1.06
Houſe
-0.96
myſelf
-0.93
Theſe
-0.92
againſt
-0.92
LookAnd
-0.91
AndEndTag
-0.89
يتيمه
-0.89
Efq
-0.89
POSITIVE LOGITS
Italy
0.90
Italien
0.81
Итали
0.81
Italy
0.80
Italian
0.79
italien
0.74
Italie
0.74
Itália
0.73
Itali
0.73
Italia
0.73
Activations Density 0.428%