INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     faker
    -0.08
     hashtags
    -0.08
    -0.07
     prisma
    -0.07
    í
    -0.07
    -0.07
     chefs
    -0.07
     bakery
    -0.07
    -0.07
    hashtags
    -0.07
    POSITIVE LOGITS
     القديمة
    0.17
     äldre
    0.17
     eski
    0.17
     outdated
    0.17
     older
    0.16
    0.16
     oudere
    0.16
     पुराने
    0.16
    -era
    0.16
     പഴ
    0.16
    Act Density 0.086%

    No Known Activations