INDEX
    Explanations

    references to leadership roles and influential positions

    New Auto-Interp
    Negative Logits
    featureID
    -0.56
    ConstraintMaker
    -0.53
    ьаж
    -0.49
     snippetHide
    -0.48
     insuffisamment
    -0.47
     للاسماء
    -0.47
    #
    -0.45
    Erreferentziak
    -0.45
    awaiter
    -0.43
    IsMutable
    -0.43
    POSITIVE LOGITS
     Vorg
    0.42
    clearance
    0.42
     agregado
    0.41
    readObject
    0.40
    返回值
    0.39
     bakom
    0.38
    Longer
    0.38
    ValueStyle
    0.38
    Ghosts
    0.38
     Longer
    0.38
    Act Density 0.201%

    No Known Activations