INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     undertake
    -0.06
    $scope
    -0.06
     Cancel
    -0.06
     Occupy
    -0.06
    /new
    -0.06
     departing
    -0.06
     nemoc
    -0.06
    Submit
    -0.06
    Geo
    -0.06
     başına
    -0.06
    POSITIVE LOGITS
     is
    0.23
     are
    0.17
     was
    0.17
     Is
    0.14
     isn
    0.13
    —is
    0.13
    was
    0.13
     were
    0.13
     wasn
    0.13
    ,is
    0.12
    Act Density 1.900%

    No Known Activations