INDEX
    Explanations

    proper nouns, particularly names of people and places

    New Auto-Interp
    Negative Logits
     Gallimard
    -0.69
     Cæsar
    -0.66
    :]:
    -0.65
     hvem
    -0.65
     revanche
    -0.65
    appé
    -0.64
    andaag
    -0.64
     zaś
    -0.62
    tersebut
    -0.62
    epam
    -0.62
    POSITIVE LOGITS
     تضيفلها
    0.93
     McN
    0.77
    CopyWith
    0.69
     Familienname
    0.64
    leh
    0.63
     Савезне
    0.62
     szóci
    0.61
    PreferredItem
    0.60
    SharedDtor
    0.59
    oul
    0.58
    Act Density 0.837%

    No Known Activations