INDEX
    Explanations

    Twitter handles

    Twitter handles and user mentions in the text

    New Auto-Interp
    Negative Logits
     Association
    -0.92
     Parish
    -0.87
     Evaluation
    -0.82
     Reconstruction
    -0.81
     Act
    -0.80
     CSI
    -0.80
     Penal
    -0.79
     Transparency
    -0.78
     Rend
    -0.78
     Advisory
    -0.78
    POSITIVE LOGITS
    john
    1.33
    mma
    1.33
    mad
    1.32
    ngth
    1.32
    christ
    1.31
    phil
    1.30
    brown
    1.30
    wild
    1.29
    podcast
    1.29
    kid
    1.28
    Act Density 0.178%

    No Known Activations