INDEX
    Explanations

    expressions of comparison or similarity

    New Auto-Interp
    Negative Logits
     οποία
    -0.91
    hip
    -0.72
    ES
    -0.72
     ISR
    -0.72
     Ancona
    -0.71
    cini
    -0.71
    المراجع
    -0.70
    es
    -0.69
     PopupWindow
    -0.68
     UnityEditor
    -0.66
    POSITIVE LOGITS
     LIKE
    1.67
     Like
    1.54
     like
    1.50
    Like
    1.48
    LIKE
    1.47
    like
    1.33
     Likes
    1.13
    likes
    1.09
    Likes
    1.08
     likes
    1.06
    Act Density 0.136%

    No Known Activations