INDEX
    Explanations

    phrases indicating frequency or habitual actions

    New Auto-Interp
    Negative Logits
    IntoConstraints
    -0.59
    deinit
    -0.53
     Schild
    -0.50
    IContainer
    -0.49
    shiro
    -0.45
    -0.44
    PreExecute
    -0.43
     diatas
    -0.43
    ladesh
    -0.42
    autonomie
    -0.42
    POSITIVE LOGITS
     often
    1.13
    often
    1.05
    Often
    1.02
     Often
    1.01
     frequently
    0.91
    frequently
    0.88
     oftentimes
    0.87
     spesso
    0.85
     kerap
    0.84
     souvent
    0.82
    Act Density 0.014%

    No Known Activations