INDEX
Explanations
pronouns indicating unidentified entities or situations
the pronoun "it"
New Auto-Interp
Negative Logits
DAQ
-0.76
ROR
-0.64
ZA
-0.62
IVE
-0.58
IONS
-0.56
pmwiki
-0.55
natureconservancy
-0.54
EMBER
-0.54
UG
-0.54
ALE
-0.53
POSITIVE LOGITS
it
2.01
it
1.04
It
1.04
It
1.00
its
0.95
there
0.86
they
0.85
this
0.84
everything
0.80
you
0.78
Activations Density 0.319%