INDEX
Explanations
the word "you" with a strong match
references to the word "you."
New Auto-Interp
Negative Logits
WATCHED
-0.63
ulous
-0.61
)]
-0.58
ãĤ´ãĥ³
-0.55
ccording
-0.54
stad
-0.54
ãģ®å®
-0.54
Government
-0.53
âĢİ
-0.52
ischer
-0.51
POSITIVE LOGITS
you
2.68
you
2.17
YOU
1.94
You
1.69
ya
1.68
You
1.67
your
1.63
YOU
1.53
yours
1.45
yourself
1.37
Activations Density 0.291%