INDEX
Explanations
references to Italian institutions or individuals
New Auto-Interp
Negative Logits
apult
-0.15
)(__
-0.14
Drum
-0.14
gere
-0.14
ople
-0.14
velt
-0.14
AMPLE
-0.13
vÄĽt
-0.13
ominator
-0.13
uesday
-0.13
POSITIVE LOGITS
uler
0.16
icus
0.14
third
0.14
âijł
0.14
lider
0.14
iset
0.14
Äįan
0.13
second
0.13
Sans
0.13
↵
0.13
Activations Density 0.023%