INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     being
    -0.79
     was
    -0.74
    being
    -0.68
    Being
    -0.61
    .
    -0.59
    k
    -0.59
    (
    -0.57
    te
    -0.56
     having
    -0.56
     BEING
    -0.56
    POSITIVE LOGITS
     كومونز
    0.94
    0.94
    ScopeManager
    0.85
    IVEREF
    0.83
    SharedDtor
    0.83
     ModelExpression
    0.80
    esterday
    0.79
    ьаж
    0.74
    ConstraintMaker
    0.73
    abestanden
    0.73
    Act Density 0.083%

    No Known Activations