INDEX
    Explanations

    phrases indicating comparisons and contrasts

    New Auto-Interp
    Negative Logits
    لس
    -0.47
     hun
    -0.47
     nour
    -0.47
     גב
    -0.44
    ANS
    -0.43
    ueses
    -0.43
    -0.43
    eps
    -0.43
     Class
    -0.42
    sika
    -0.42
    POSITIVE LOGITS
     disambiguazione
    0.86
     مرئيه
    0.77
    MLLoader
    0.71
     تضيفلها
    0.71
     Савезне
    0.71
    SBATCH
    0.69
     compared
    0.66
    endregion
    0.66
    GIVEREF
    0.65
     كومونز
    0.65
    Act Density 0.264%

    No Known Activations