INDEX
Explanations
verbs indicating the creation or production of objects or systems
New Auto-Interp
Negative Logits
esin
-0.14
033
-0.14
ever
-0.14
adel
-0.14
ad
-0.14
935
-0.14
-0.13
asi
-0.13
yourselves
-0.13
ersist
-0.13
POSITIVE LOGITS
ForRow
0.17
itore
0.16
by
0.16
otts
0.16
mods
0.15
Ctrls
0.15
SA
0.15
ILES
0.14
ή
0.14
zte
0.14
Activations Density 0.267%