INDEX
Explanations
sentences that include the word "you" and related variations
New Auto-Interp
Negative Logits
ÑĢÑĸÑĩ
-0.15
çĴ
-0.15
itness
-0.14
ît
-0.14
æł
-0.13
atır
-0.13
uzz
-0.13
dux
-0.13
جاÙĨ
-0.13
ikler
-0.13
POSITIVE LOGITS
need
0.26
can
0.25
must
0.25
should
0.24
could
0.21
may
0.21
cannot
0.20
basically
0.20
needs
0.20
need
0.19
Activations Density 0.101%