INDEX
    Explanations

    potential actions and abilities

    New Auto-Interp
    Negative Logits
    このように
    0.82
     ಒಂದು
    0.81
    0.77
     ඔහුගේ
    0.77
     било
    0.77
     ഒരു
    0.76
     което
    0.75
    സിന്റെ
    0.75
     അതിന്റെ
    0.73
     grafico
    0.73
    POSITIVE LOGITS
     themselves
    1.33
     whom
    1.14
    whom
    0.94
     willing
    0.92
     who
    0.92
     reputations
    0.92
     sympathize
    0.84
     जिनके
    0.84
     quienes
    0.83
     salaries
    0.83
    Act Density 0.045%

    No Known Activations