INDEX
    Explanations

    references to social media activity, particularly on Instagram

    New Auto-Interp
    Negative Logits
     sentado
    -0.35
     occupa
    -0.35
    løs
    -0.35
    าง
    -0.34
     เบ
    -0.33
     lopp
    -0.33
     Headache
    -0.32
     Nagy
    -0.32
    UseVisualStyle
    -0.32
    เบ
    -0.31
    POSITIVE LOGITS
     Instagram
    1.17
    Instagram
    1.07
     instagram
    1.02
    INSTAGRAM
    0.88
    instagram
    0.87
     INSTAGRAM
    0.86
     اینستاگرام
    0.71
     Insta
    0.67
    :+:
    0.65
    RTEE
    0.61
    Act Density 0.122%

    No Known Activations