INDEX
Explanations
mentions of Italian-related terms, including language, people, and locations
repetitions of the word "Italian" and related terms
New Auto-Interp
Negative Logits
\\\\\\\\
-0.88
pir
-0.82
TPPStreamerBot
-0.82
tnc
-0.82
holder
-0.79
ignty
-0.79
tick
-0.78
izons
-0.77
merce
-0.76
bilt
-0.75
POSITIVE LOGITS
Italy
0.89
etta
0.88
otti
0.86
Marino
0.85
zzo
0.83
olini
0.83
Alps
0.83
Sicily
0.83
Giovanni
0.82
Inquisition
0.82
Activations Density 0.016%