INDEX
Explanations
proper nouns and locations
New Auto-Interp
Negative Logits
Sins
-0.71
Noir
-0.69
Preferred
-0.65
Scarlet
-0.63
Emin
-0.62
Attribution
-0.61
Ivory
-0.59
backer
-0.58
Reson
-0.57
CPC
-0.57
POSITIVE LOGITS
prising
1.32
seless
1.25
pperc
1.17
nexpected
1.15
berman
1.10
pees
1.07
mber
1.07
pee
1.05
gly
1.03
pport
1.01
Activations Density 3.324%