INDEX
Explanations
phrases containing the word "you"
references to the pronoun "you."
New Auto-Interp
Negative Logits
ipal
-0.71
ĨĴ
-0.70
Course
-0.66
aughed
-0.65
acca
-0.62
majority
-0.61
tein
-0.61
asia
-0.60
Agriculture
-0.60
ħ
-0.60
POSITIVE LOGITS
tub
1.24
guys
1.19
're
1.13
RS
1.03
hei
0.85
've
0.83
Tube
0.83
'll
0.79
sir
0.75
ldon
0.75
Activations Density 0.119%