INDEX
Explanations
mentions of a female pronoun 'she'
mentions of the word 'she' in various forms
New Auto-Interp
Negative Logits
constit
-0.70
ument
-0.59
iso
-0.57
Techn
-0.57
firsthand
-0.56
eleven
-0.56
Jr
-0.56
Circuit
-0.55
Yiannopoulos
-0.55
ANCE
-0.55
POSITIVE LOGITS
pher
1.62
ffield
1.51
lled
1.46
ppard
1.44
pherd
1.43
pard
1.42
athed
1.38
ldon
1.30
ikh
1.26
athing
1.25
Activations Density 0.045%