INDEX
    Explanations

    Japanese pronouns

    New Auto-Interp
    Negative Logits
     nhanh
    -0.07
    рою
    -0.06
    _Pos
    -0.06
     Range
    -0.06
     pregnant
    -0.06
     amazed
    -0.06
     Ст
    -0.06
     charitable
    -0.06
    Links
    -0.06
     (++
    -0.06
    POSITIVE LOGITS
     männ
    0.08
    nahme
    0.07
    mpz
    0.07
     scarc
    0.06
    CHANNEL
    0.06
    //------------------------------------------------
    0.06
    agne
    0.06
     zend
    0.06
    	admin
    0.06
     інші
    0.06
    Act Density 0.044%

    No Known Activations