INDEX
    Explanations

    proper nouns, particularly names of people

    New Auto-Interp
    Negative Logits
     autorytatywna
    -0.59
    Diweddarwch
    -0.53
    (;;)
    -0.50
    __':
    
    -0.48
    ISupport
    -0.48
    :]:
    -0.47
    "}\
    -0.47
     @"/
    -0.46
     चीज़ों
    -0.46
    __':
    -0.45
    POSITIVE LOGITS
     producteurs
    0.51
     człowiek
    0.51
     collaborateurs
    0.50
    MethodManager
    0.48
     Syrie
    0.48
     Romains
    0.48
     Inscrivez
    0.47
    parsedMessage
    0.47
     kimdir
    0.47
     consommateurs
    0.46
    Act Density 0.346%

    No Known Activations