INDEX
Explanations
phrases related to success and achievement
New Auto-Interp
Negative Logits
_inside
-0.18
inside
-0.17
Inside
-0.15
aring
-0.15
inside
-0.15
داخÙĦ
-0.14
_within
-0.14
dentro
-0.14
Inside
-0.14
Dans
-0.14
POSITIVE LOGITS
à¹ĥà¸Ļà¸ģาร
0.34
towards
0.29
regarding
0.29
toward
0.27
in
0.27
when
0.21
concerning
0.20
Towards
0.19
Towards
0.19
khi
0.18
Activations Density 0.338%