INDEX
Explanations
personal pronouns preceding verbs, particularly the pronoun "it" before verbs
the pronoun "it."
New Auto-Interp
Negative Logits
Orn
-0.68
hips
-0.68
Dayton
-0.66
Priv
-0.66
Lar
-0.65
Anat
-0.64
Breast
-0.64
Superior
-0.64
Friend
-0.63
Cour
-0.63
POSITIVE LOGITS
alian
1.16
self
0.99
ueller
0.99
unes
0.97
asca
0.92
chy
0.84
hers
0.80
zbollah
0.80
heit
0.79
geist
0.77
Activations Density 0.419%