INDEX
    Explanations

    phrases that indicate maintaining or keeping something

    New Auto-Interp
    Negative Logits
    -0.70
    ioca
    -0.61
     Tages
    -0.60
    ValueStyle
    -0.59
    znych
    -0.57
     Reisedaten
    -0.57
     secondaires
    -0.56
    NullCheck
    -0.56
     sẻ
    -0.56
     Hej
    -0.55
    POSITIVE LOGITS
     Keeping
    1.75
     kept
    1.72
    Keeping
    1.72
     keep
    1.71
    keeping
    1.69
     keeping
    1.65
    keep
    1.63
    KEEP
    1.63
    kept
    1.60
     Kept
    1.59
    Act Density 0.156%

    No Known Activations