INDEX
    Explanations

    proper nouns related to names and places

    New Auto-Interp
    Negative Logits
     faits
    -0.51
    esis
    -0.50
    pt
    -0.46
     Faça
    -0.45
    רג
    -0.42
    iry
    -0.42
     패
    -0.42
     oph
    -0.41
    прав
    -0.41
     nao
    -0.41
    POSITIVE LOGITS
     Paglinawan
    1.14
     surla
    0.82
     Савезне
    0.78
    AsUp
    0.75
     >=",
    0.74
     ویکی‌پدیای
    0.73
    
    0.73
     Signalez
    0.71
    SharedDtor
    0.71
     createState
    0.69
    Act Density 0.950%

    No Known Activations