INDEX
Explanations
pronouns followed by the verb "say" or "regard"
New Auto-Interp
Negative Logits
ancial
-0.59
Screw
-0.57
ammy
-0.57
surprises
-0.56
refresh
-0.56
concentration
-0.56
hang
-0.55
Supply
-0.55
YES
-0.55
compatibility
-0.54
POSITIVE LOGITS
termed
1.49
dubbed
1.16
euphem
1.10
deems
1.10
called
1.09
dub
1.09
called
1.08
deem
1.06
deemed
1.03
perceive
1.00
Activations Density 0.121%