INDEX
Explanations
personal pronouns followed by verbs
the pronoun "you" and its variations, indicating direct address or engagement with the audience
New Auto-Interp
Negative Logits
¿½
-0.76
ĸļ
-0.76
ipal
-0.73
ĨĴ
-0.69
advertisement
-0.69
unic
-0.68
ĺħ
-0.67
=~
-0.65
luck
-0.63
Ĥª
-0.61
POSITIVE LOGITS
guys
1.45
're
1.23
yourselves
1.11
've
1.05
gentlemen
1.00
tub
0.91
mention
0.91
mentioned
0.89
sir
0.87
imply
0.85
Activations Density 0.184%