INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    n
    0.77
    na
    0.74
    nie
    0.68
    nth
    0.68
    no
    0.66
     satisfaction
    0.66
    true
    0.65
    0.65
    nor
    0.64
    ן
    0.64
    POSITIVE LOGITS
    1.00
     Crime
    1.00
    0.99
    Соцмережа
    0.98
     diálogo
    0.97
    መር
    0.97
     anunciar
    0.96
     ሽፋ
    0.96
     Ispol
    0.95
     მიმოწერა
    0.95
    Act Density 0.000%

    No Known Activations