INDEX
Explanations
references to the Spanish language or Spain
repeated mentions of the word "Spanish" and related references to Spanish context
New Auto-Interp
Negative Logits
oscope
-0.92
mble
-0.88
aucus
-0.88
onies
-0.83
olulu
-0.83
lessly
-0.81
imus
-0.81
chens
-0.80
allery
-0.79
ochond
-0.76
POSITIVE LOGITS
Inquisition
1.33
Rican
0.88
pes
0.85
Rico
0.83
Spanish
0.83
corrid
0.81
Spanish
0.81
Nieto
0.80
Fork
0.79
Spain
0.78
Activations Density 0.007%