INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    in
    0.89
    im
    0.71
    on
    0.68
    as
    0.68
    ata
    0.65
    abouts
    0.63
    cations
    0.63
    ó
    0.61
     to
    0.61
    iv
    0.59
    POSITIVE LOGITS
     mencegah
    0.59
     colaboradores
    0.56
     jangan
    0.55
     menyatakan
    0.54
     rabbi
    0.54
     dati
    0.54
     menyebut
    0.53
    0.53
     domenica
    0.53
     rispondere
    0.53
    Act Density 0.234%

    No Known Activations