INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     dame
    -0.08
     ഉള്ള
    -0.08
     unanim
    -0.08
     fades
    -0.08
    Campo
    -0.08
     Ugly
    -0.08
     Bianca
    -0.08
     chứa
    -0.08
     getroffen
    -0.08
     icy
    -0.07
    POSITIVE LOGITS
    成本
    0.14
     incurred
    0.14
     costos
    0.13
     coûts
    0.12
     expenditure
    0.12
     ખર્ચ
    0.12
     खर्च
    0.11
    -cost
    0.11
     비용
    0.11
    _cost
    0.11
    Act Density 0.027%

    No Known Activations