INDEX
    Explanations

    proper nouns, specifically names and abbreviations related to individuals and organizations

    New Auto-Interp
    Negative Logits
    ^(@)
    -0.71
     pleaſure
    -0.71
    __':
    
    -0.70
    SupportActionBar
    -0.69
    وأضاف
    -0.69
    <?
    -0.68
     Preferencias
    -0.68
    findpost
    -0.68
    󠁧
    -0.67
    CrossRef
    -0.66
    POSITIVE LOGITS
     leads
    0.51
     Italijanski
    0.50
    Bibliograf
    0.49
    mobileqq
    0.49
     atve
    0.48
     det
    0.46
     Werk
    0.45
    водства
    0.43
    leads
    0.43
    ftagPool
    0.41
    Act Density 0.762%

    No Known Activations