INDEX
Explanations
names of individuals and locations
proper nouns, particularly names of people
New Auto-Interp
Negative Logits
querque
-0.74
prostate
-0.65
idated
-0.63
ãĥIJ
-0.61
Magikarp
-0.61
heddar
-0.61
è¦ļéĨĴ
-0.61
ogun
-0.60
emetery
-0.59
newcom
-0.59
POSITIVE LOGITS
herself
1.90
Devi
1.13
she
0.99
eva
0.93
gigg
0.93
her
0.92
miscar
0.92
She
0.90
she
0.90
Kardashian
0.89
Activations Density 0.263%