INDEX
Explanations
personal commands or instructions
the second-person pronoun "you" and its variants
New Auto-Interp
Negative Logits
ãĥ³ãĤ¸
-0.68
Dimension
-0.65
Submission
-0.63
Ambro
-0.62
burgh
-0.60
interstitial
-0.58
Gamb
-0.58
ilib
-0.58
ogenesis
-0.57
Rosenthal
-0.57
POSITIVE LOGITS
're
1.28
wanna
1.01
want
0.99
've
0.97
guessed
0.94
guys
0.90
wish
0.87
subscribed
0.86
weren
0.84
subscribe
0.84
Activations Density 0.094%