INDEX
    Explanations

    terms related to independence and transparency in various contexts

    New Auto-Interp
    Negative Logits
    IntoConstraints
    -0.69
    +#+#
    -0.68
    niająca
    -0.63
    ftagPool
    -0.60
     насељу
    -0.58
    loger
    -0.57
    AddTagHelper
    -0.57
    HomeAsUpEnabled
    -0.57
    كويكب
    -0.55
    клопе
    -0.55
    POSITIVE LOGITS
    ly
    1.78
    tly
    1.42
    tely
    1.36
    LY
    1.33
    ily
    1.30
    ingly
    1.29
    mente
    1.28
    ally
    1.28
    lly
    1.27
    ably
    1.24
    Act Density 0.613%

    No Known Activations