INDEX
Explanations
phrases indicating passive voice
New Auto-Interp
Negative Logits
disappe
-0.16
Hüs
-0.16
osloven
-0.16
EditingStyle
-0.16
quil
-0.15
AFX
-0.15
поба
-0.15
ilet
-0.15
ControlItem
-0.15
stanov
-0.14
POSITIVE LOGITS
virtue
0.38
means
0.33
dint
0.26
gone
0.24
-products
0.24
the
0.23
a
0.23
default
0.23
ron
0.22
products
0.22
Activations Density 0.178%