INDEX
    Explanations

    references to the social media platform Instagram

    New Auto-Interp
    Negative Logits
    peč
    -0.74
     PFC
    -0.69
    onias
    -0.69
    vindo
    -0.68
    Produkte
    -0.68
     Fuerza
    -0.68
     Lázaro
    -0.68
    TTE
    -0.67
     médical
    -0.66
     Heavens
    -0.66
    POSITIVE LOGITS
     Instagram
    1.28
     instagram
    1.21
    Instagram
    1.15
    INSTAGRAM
    1.00
    instagram
    0.99
     INSTAGRAM
    0.99
     IG
    0.94
    Insta
    0.88
     Insta
    0.82
     insta
    0.79
    Act Density 0.065%

    No Known Activations