INDEX
    Explanations

    numeric references and bibliographic citations

    New Auto-Interp
    Negative Logits
    ailer
    -0.17
    arov
    -0.16
    inkel
    -0.16
    oidal
    -0.16
    acker
    -0.16
    atra
    -0.15
    arken
    -0.15
    anger
    -0.15
    ellow
    -0.15
    dál
    -0.15
    POSITIVE LOGITS
    éļĨ
    0.14
     Pere
    0.14
    orz
    0.14
     Silver
    0.14
    زاÙħ
    0.14
     दब
    0.14
    etat
    0.13
    ابت
    0.13
    berapa
    0.13
     Mug
    0.13
    Act Density 0.013%

    No Known Activations