INDEX
Explanations
the pronoun 'it' referring to a previously mentioned subject
the word "it" in various contexts
New Auto-Interp
Negative Logits
Priv
-0.61
evil
-0.59
hips
-0.59
Highest
-0.58
quist
-0.58
=>
-0.58
Buck
-0.58
TED
-0.58
Unlimited
-0.58
ears
-0.57
POSITIVE LOGITS
unes
1.08
asca
1.03
chy
1.00
ueller
0.96
self
0.95
alian
0.93
zbollah
0.91
zik
0.87
chwitz
0.86
ÃĥÃĤ
0.85
Activations Density 0.249%