INDEX
Explanations
proper nouns related to specific locations, organizations, and individuals
proper nouns related to specific locations and names
New Auto-Interp
Negative Logits
Vulcan
-0.71
estinal
-0.69
tails
-0.66
matic
-0.66
Africans
-0.65
fishes
-0.64
Grail
-0.63
Viking
-0.63
aredevil
-0.62
iodine
-0.62
POSITIVE LOGITS
Bernardino
0.86
_>
0.84
zai
0.80
REDACTED
0.80
cade
0.78
olid
0.78
olic
0.77
ombies
0.76
icio
0.75
icans
0.75
Activations Density 0.026%