INDEX
Explanations
verbs and phrases related to taking action or making attempts
New Auto-Interp
Negative Logits
ish
-0.16
ald
-0.16
æł·çļĦ
-0.15
اÙĨÙĩ
-0.14
ening
-0.14
root
-0.14
away
-0.14
enny
-0.14
head
-0.13
cock
-0.13
POSITIVE LOGITS
mente
0.22
ately
0.17
emente
0.17
itarian
0.16
uppe
0.16
memberOf
0.16
iveness
0.16
mith
0.15
mastur
0.14
ÑģÑĮ
0.14
Activations Density 0.620%