INDEX
Negative Logits
éĻĦ
-0.30
Attached
-0.28
umper
-0.27
thêm
-0.26
append
-0.25
ä¾Ŀ
-0.25
ç²ĺ
-0.25
Append
-0.25
Explicit
-0.25
æľīæĦı
-0.25
POSITIVE LOGITS
recipes
0.27
mine
0.25
often
0.25
cul
0.24
就说
0.24
Recipes
0.23
considered
0.23
cater
0.23
questions
0.23
most
0.23
Activations Density 0.002%