INDEX
    Explanations

    expressions of negation or avoidance

    New Auto-Interp
    Negative Logits
    ViewInit
    -0.81
    InjectAttribute
    -0.70
     &___
    -0.69
    CppMethod
    -0.67
     kasarigan
    -0.65
     résulte
    -0.64
     ProtoMessage
    -0.64
    quelize
    -0.63
    ClientSize
    -0.63
     Normdatei
    -0.63
    POSITIVE LOGITS
    试试
    0.69
     chande
    0.67
     kans
    0.60
    ArgsConstructor
    0.59
     Conley
    0.59
     Pues
    0.58
     favore
    0.58
    干脆
    0.58
    atalos
    0.58
    gettes
    0.57
    Act Density 0.053%

    No Known Activations