INDEX
Explanations
names of individuals or places
proper nouns and names of individuals or entities
New Auto-Interp
Negative Logits
erella
-0.66
reviewed
-0.63
ateurs
-0.61
glers
-0.60
WT
-0.60
nesday
-0.60
anchester
-0.59
ruary
-0.58
called
-0.57
Egyptians
-0.57
POSITIVE LOGITS
ibrary
0.79
ĪĴ
0.74
Pwr
0.67
adi
0.65
è£ıè
0.65
nih
0.64
intel
0.64
zu
0.64
fen
0.61
schild
0.60
Activations Density 0.886%