INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     تضيفلها
    -0.88
     незавершена
    -0.81
    Rujuakan
    -0.77
    Билгалдахарш
    -0.76
    featureID
    -0.73
    addCriterion
    -0.73
    Geplaatst
    -0.73
     الحره
    -0.72
     kasarigan
    -0.71
     propOrder
    -0.70
    POSITIVE LOGITS
     kindly
    0.52
     foil
    0.51
    оле
    0.50
     species
    0.49
     (
    0.48
    mazioni
    0.45
     prior
    0.45
    0.45
     approximately
    0.45
     mainly
    0.45
    Act Density 0.014%

    No Known Activations