INDEX
Explanations
words related to shoving or pushing actions
New Auto-Interp
Negative Logits
zek
-0.17
minority
-0.16
Vend
-0.15
aeda
-0.14
ynchronously
-0.14
Ã¥l
-0.14
Minority
-0.14
738
-0.14
itu
-0.14
359
-0.14
POSITIVE LOGITS
sh
0.21
vier
0.17
enan
0.16
oulder
0.16
alink
0.15
zung
0.14
sh
0.14
ãĥ«ãĥķ
0.14
(sh
0.14
adv
0.14
Activations Density 0.023%