INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     mengumumkan
    -1.01
     quelconque
    -1.00
    ñores
    -0.97
    Андрей
    -0.94
    apk
    -0.90
     съм
    -0.86
    Откры
    -0.86
    Ито
    -0.86
    Глава
    -0.86
    Що
    -0.85
    POSITIVE LOGITS
     June
    1.07
     December
    1.04
     January
    1.04
     July
    1.02
     October
    1.01
     September
    1.00
     March
    0.98
     and
    0.96
    ieu
    0.95
    /*",
    0.94
    Act Density 0.010%

    No Known Activations