INDEX
    Explanations

    references to social media platforms, particularly Instagram and Twitter

    New Auto-Interp
    Negative Logits
    AndEndTag
    -0.69
     éché
    -0.66
    __':
    -0.63
    __":
    -0.63
    __":
    
    -0.62
    vois
    -0.61
    TTE
    -0.60
     gobiernos
    -0.60
    ]=$
    -0.60
    ">'.$
    -0.59
    POSITIVE LOGITS
     Instagram
    3.17
    Instagram
    2.90
     instagram
    2.75
     INSTAGRAM
    2.25
    instagram
    2.22
    INSTAGRAM
    1.96
     Insta
    1.64
    Insta
    1.51
     insta
    1.42
     Twitter
    1.38
    Act Density 0.066%

    No Known Activations