INDEX
Explanations
expressions of uncertainty or potential actions in conversational contexts
New Auto-Interp
Negative Logits
owi
-0.16
çĦ¡æĸĻ
-0.16
ymi
-0.15
rz
-0.14
arently
-0.14
ORE
-0.14
à¤Ŀ
-0.14
зÑĮ
-0.14
usu
-0.14
rir
-0.14
POSITIVE LOGITS
even
0.20
åIJ§
0.17
algún
0.16
slightly
0.16
bol
0.15
sogar
0.15
qualche
0.15
ought
0.15
maybe
0.15
indeed
0.15
Activations Density 0.054%