INDEX
Explanations
mentions of specific entities or people
instances of the letter "A" at the beginning of phrases or sentences
New Auto-Interp
Negative Logits
apes
-0.64
stripes
-0.63
appointments
-0.63
disabilities
-0.63
rebounds
-0.62
noses
-0.62
unconscious
-0.62
lies
-0.61
oids
-0.61
cylinders
-0.60
POSITIVE LOGITS
chieve
1.45
ircraft
1.40
uckland
1.38
gency
1.34
ctions
1.27
ctors
1.26
usterity
1.24
erial
1.24
UTH
1.24
ffect
1.21
Activations Density 0.078%