INDEX
Explanations
instances of the pronoun "you"
New Auto-Interp
Negative Logits
Uncomment
-0.16
too
-0.15
ynet
-0.15
dehy
-0.15
浩
-0.14
debit
-0.14
SYNC
-0.14
ÑĸблÑĸ
-0.14
.Undef
-0.14
ó
-0.14
POSITIVE LOGITS
sh
0.21
seem
0.20
mileage
0.20
said
0.18
seemed
0.18
seems
0.18
stated
0.17
_MIX
0.17
sir
0.17
forgot
0.17
Activations Density 0.091%