INDEX
Explanations
words and phrases related to offering guidance or advice
New Auto-Interp
Negative Logits
èĢħçļĦ
-0.18
YOUR
-0.17
-your
-0.16
YOUR
-0.16
Your
-0.16
Ihre
-0.16
481
-0.15
hus
-0.15
pite
-0.15
å®¶çļĦ
-0.15
POSITIVE LOGITS
us
0.50
him
0.37
them
0.34
me
0.33
lui
0.23
you
0.22
емÑĥ
0.22
ihm
0.20
ihn
0.20
йомÑĥ
0.20
Activations Density 0.295%