INDEX
    Explanations

    mentions of social media and online interactions

    New Auto-Interp
    Negative Logits
     مؤرشف
    -0.61
     XNUMX
    -0.60
     Jefus
    -0.59
    ">—
    -0.56
    олові
    -0.55
     kaynağından
    -0.52
     }}$}
    -0.51
    ".
    
    -0.51
    ()['
    -0.51
    -0.50
    POSITIVE LOGITS
     @
    3.39
    @
    3.04
    (@
    2.14
    ,@
    2.07
     (@
    2.07
     "@
    1.91
    /@
    1.90
    =@
    1.89
    .@
    1.87
    @_
    1.87
    Act Density 0.408%

    No Known Activations