INDEX
Explanations
proper nouns or names ending in 'ras'
references to a specific term or identifier related to a group's name or individuals
New Auto-Interp
Negative Logits
Yankee
-0.71
tails
-0.70
atures
-0.66
tains
-0.65
glers
-0.65
ĵ
-0.65
Holmes
-0.64
rica
-0.62
Grimes
-0.58
rican
-0.57
POSITIVE LOGITS
sembly
1.00
vati
0.99
hea
0.97
irez
0.97
xual
0.96
ources
0.96
arin
0.95
bian
0.93
ylum
0.92
hens
0.92
Activations Density 0.017%