INDEX
    Explanations

    phrases related to maintenance or continuity

    New Auto-Interp
    Negative Logits
    -1.06
    ValueStyle
    -0.95
    Portale
    -0.85
    OGND
    -0.81
    Personensuche
    -0.80
     geldt
    -0.76
     Adamson
    -0.75
     Dorsey
    -0.74
     băr
    -0.71
     ModelRenderer
    -0.71
    POSITIVE LOGITS
     keep
    1.42
     KEEP
    1.39
     kept
    1.34
    keep
    1.34
    Keeps
    1.32
    KEEP
    1.32
     Keep
    1.30
    Keep
    1.24
     keeps
    1.24
    keeps
    1.18
    Act Density 0.045%

    No Known Activations