INDEX
Explanations
references to personal stories and reflections on identity and acceptance
New Auto-Interp
Negative Logits
GF
-0.70
aut
-0.69
bishop
-0.66
pac
-0.64
zar
-0.63
ACH
-0.63
BD
-0.60
Ķ
-0.60
Detailed
-0.60
afer
-0.59
POSITIVE LOGITS
it
0.74
he
0.69
they
0.69
pandemonium
0.66
however
0.65
she
0.65
untled
0.64
chances
0.64
citing
0.62
enment
0.62
Activations Density 0.165%