INDEX
Explanations
proper nouns
occurrences of the word "It" indicating a shift to a new subject or point in the text
New Auto-Interp
Negative Logits
hips
-0.68
Polk
-0.66
Dayton
-0.61
Tur
-0.61
ded
-0.59
Pearce
-0.58
ears
-0.58
Uni
-0.57
household
-0.57
gift
-0.57
POSITIVE LOGITS
self
1.24
zbollah
1.08
unes
1.08
ueller
1.06
chwitz
1.01
chy
0.97
zik
0.95
iner
0.93
anyahu
0.91
alm
0.91
Activations Density 0.265%