INDEX
Explanations
mentions of the pronoun "It" at the beginning of the text
instances of the word "It"
New Auto-Interp
Negative Logits
Eighth
-0.67
Dayton
-0.61
Annotations
-0.60
NX
-0.60
hips
-0.59
Tur
-0.59
PUBLIC
-0.59
itatively
-0.55
Jub
-0.55
Institution
-0.54
POSITIVE LOGITS
zbollah
1.32
unes
1.15
self
1.13
asca
1.07
iner
1.07
chwitz
1.07
achi
0.96
amar
0.94
anyahu
0.94
chy
0.92
Activations Density 0.215%