INDEX
Explanations
phrases related to the concept of 'doing nothing' or the lack of action
New Auto-Interp
Negative Logits
rok
-0.17
Ľ°
-0.16
umann
-0.15
gaard
-0.14
ities
-0.14
åŀ
-0.14
pedia
-0.14
Ùĥار
-0.14
ud
-0.13
ãĥ¼ãĥ
-0.13
POSITIVE LOGITS
directly
0.22
irect
0.17
DIRECT
0.17
diret
0.17
Direct
0.17
.direct
0.16
Direct
0.16
direct
0.16
_DIRECT
0.15
енз
0.15
Activations Density 0.009%