INDEX
    Explanations

    references to maintaining or preserving something

    New Auto-Interp
    Negative Logits
    -0.98
    ValueStyle
    -0.80
    OGND
    -0.78
    digm
    -0.75
    Personensuche
    -0.74
     Dorsey
    -0.73
     băr
    -0.70
     Adamson
    -0.70
    Portale
    -0.69
     cascades
    -0.68
    POSITIVE LOGITS
     keep
    1.67
    Keeps
    1.62
     KEEP
    1.59
    keep
    1.58
    KEEP
    1.58
     kept
    1.56
     Keep
    1.53
     keeps
    1.50
     Keeping
    1.48
    Keep
    1.48
    Act Density 0.047%

    No Known Activations