INDEX
    Explanations

    phrases indicating passive voice

    New Auto-Interp
    Negative Logits
     disappe
    -0.16
     Hüs
    -0.16
    osloven
    -0.16
    EditingStyle
    -0.16
    quil
    -0.15
    AFX
    -0.15
     поба
    -0.15
     ilet
    -0.15
    ControlItem
    -0.15
    stanov
    -0.14
    POSITIVE LOGITS
     virtue
    0.38
     means
    0.33
     dint
    0.26
    gone
    0.24
    -products
    0.24
     the
    0.23
     a
    0.23
     default
    0.23
    ron
    0.22
    products
    0.22
    Act Density 0.178%

    No Known Activations