INDEX
Explanations
personal pronouns indicating groups of people
pronouns indicating groups of people and their experiences
New Auto-Interp
Negative Logits
Contents
-0.73
appellant
-0.72
âĢ
-0.69
³³³³
-0.67
�
-0.63
!!
-0.62
~~~~
-0.62
..
-0.59
OME
-0.58
Letter
-0.58
POSITIVE LOGITS
'll
1.79
're
1.64
'd
1.55
've
1.52
'm
1.31
ain
1.07
's
1.03
hasn
1.01
't
0.99
shouldn
0.92
Activations Density 0.643%