INDEX
Explanations
references to the second person, specifically the use of "you."
New Auto-Interp
Negative Logits
المعيارى
-0.58
:✨
-0.50
ppincott
-0.49
gypte
-0.49
}}}
-0.48
}}$}
-0.47
aarrggbb
-0.47
Италијани
-0.47
aternion
-0.47
ksanakan
-0.47
POSITIVE LOGITS
You
0.90
You
0.81
There
0.49
It
0.48
Your
0.48
It
0.47
If
0.47
YOU
0.47
There
0.46
If
0.46
Activations Density 0.022%