INDEX
    Explanations

    proper nouns, specifically names of people and organizations

    authors and researchers

    New Auto-Interp
    Negative Logits
     inmig
    -0.38
    Географи
    -0.36
    SerializedName
    -0.35
    Билгалдахарш
    -0.34
    })));
    -0.34
    Controllo
    -0.34
     Sedangkan
    -0.34
     Manusia
    -0.34
    ortunadamente
    -0.33
     useRef
    -0.33
    POSITIVE LOGITS
     queſto
    0.55
     оригіналу
    0.54
    expandindo
    0.52
    0.52
    0.50
    Насе
    0.50
    Ӕ
    0.50
    raulic
    0.49
     Pauli
    0.49
    tius
    0.49
    Act Density 0.061%

    No Known Activations