INDEX
Explanations
verbs related to capability and potential
New Auto-Interp
Negative Logits
nez
-0.07
ίÏĥ
-0.07
ucch
-0.06
ÑģÑĤвоÑĢ
-0.06
oure
-0.06
chio
-0.06
oston
-0.06
esco
-0.06
odos
-0.06
WW
-0.06
POSITIVE LOGITS
leftright
0.07
aren
0.07
caler
0.07
ãĥĨãĥ«
0.07
aben
0.06
ãĥ¼ãĥĵ
0.06
etched
0.06
677
0.06
زÙħ
0.06
anvas
0.06
Activations Density 0.003%