INDEX
Explanations
names, specifically the name "Najib Razak"
proper nouns, specifically names often related to a particular context or narrative
New Auto-Interp
Negative Logits
sonian
-0.90
gling
-0.79
rome
-0.78
ãĥ¼ãĥĨãĤ£
-0.75
glers
-0.72
Dull
-0.70
Circle
-0.65
gest
-0.64
stone
-0.64
Pioneer
-0.63
POSITIVE LOGITS
ibaba
0.95
eed
0.86
idav
0.84
irmed
0.83
orthy
0.82
velength
0.81
irms
0.81
poons
0.79
iband
0.77
ee
0.76
Activations Density 0.047%