INDEX
Explanations
pronouns 'it' or 'It' followed by either a verb or a possessive pronoun
New Auto-Interp
Negative Logits
Dayton
-0.65
Priv
-0.64
hips
-0.61
Eighth
-0.60
Polk
-0.57
Unlimited
-0.57
mission
-0.55
Tur
-0.55
Friend
-0.54
package
-0.54
POSITIVE LOGITS
unes
1.23
alian
1.19
chy
1.17
seems
1.15
zbollah
1.09
iner
1.04
self
1.03
asca
1.01
ain
0.99
'll
0.99
Activations Density 1.932%