INDEX
Explanations
verbs indicating performance or effectiveness in various contexts
New Auto-Interp
Negative Logits
acro
-0.16
indeb
-0.15
ption
-0.15
ise
-0.15
ei
-0.14
ipro
-0.14
squash
-0.14
apes
-0.14
raw
-0.14
squ
-0.14
POSITIVE LOGITS
.cum
0.15
ylko
0.14
ERSIST
0.14
edd
0.14
ARGET
0.14
rani
0.14
izik
0.14
|#
0.14
.oper
0.14
AMA
0.14
Activations Density 0.081%