INDEX
Explanations
phrases of advice or observations
the word "you" in various contexts and its associated phrases
New Auto-Interp
Negative Logits
è£ıè
-0.63
ãĥ³ãĤ¸
-0.63
pedia
-0.63
ENDED
-0.61
temp
-0.61
icum
-0.61
Verge
-0.60
aiden
-0.60
Joined
-0.60
IME
-0.59
POSITIVE LOGITS
're
1.41
gotta
1.38
know
1.19
've
1.18
wanna
1.13
guys
1.11
realise
1.08
realize
1.02
cannot
1.01
want
0.98
Activations Density 0.120%