INDEX
Explanations
specific instances of the word "you" in text
references to the word "you."
New Auto-Interp
Negative Logits
images
-0.72
ice
-0.70
Associated
-0.69
math
-0.69
Dimension
-0.67
Gibbs
-0.67
rss
-0.66
asia
-0.65
Katie
-0.64
Frank
-0.64
POSITIVE LOGITS
're
1.44
've
1.32
'll
1.14
tub
1.11
guys
1.05
'd
1.03
want
0.94
yourselves
0.94
know
0.92
intend
0.91
Activations Density 0.198%