INDEX
Explanations
the pronoun "she" in various contexts
New Auto-Interp
Negative Logits
natureconservancy
-0.87
���
-0.69
EGIN
-0.64
�
-0.64
actionDate
-0.64
Brenda
-0.62
udeb
-0.62
OUND
-0.62
��
-0.62
chromosome
-0.61
POSITIVE LOGITS
clears
0.77
flies
0.76
rises
0.74
adds
0.74
climbs
0.74
smokes
0.72
raises
0.72
carries
0.72
identifies
0.71
travels
0.70
Activations Density 0.133%