INDEX
Explanations
names of people
it identifies prominent names associated with specific historical or cultural contexts
New Auto-Interp
Negative Logits
awar
-0.84
ochond
-0.75
ancies
-0.75
addons
-0.74
orks
-0.73
urst
-0.73
tml
-0.72
umbn
-0.72
ears
-0.71
wich
-0.70
POSITIVE LOGITS
Enrique
1.14
Gomez
1.11
Martinez
1.09
Garcia
1.07
Antonio
1.06
Ramos
1.02
Rodriguez
1.02
Rico
1.01
Suarez
1.01
Carlos
1.01
Activations Density 0.051%