INDEX
    Explanations

    social media references and hashtags

    New Auto-Interp
    Head Attr Weights
    0:0.04
    1:0.03
    2:0.04
    3:0.11
    4:0.07
    5:0.05
    6:0.04
    7:0.05
    8:0.06
    9:0.05
    10:0.08
    11:0.33
    Negative Logits
    flag
    -1.74
    andise
    -1.73
     jam
    -1.63
     anthem
    -1.60
     flag
    -1.57
    76561
    -1.54
     Flag
    -1.52
    endars
    -1.51
    -1.50
    AAF
    -1.50
    POSITIVE LOGITS
     unemploy
    1.75
     confinement
    1.68
    inav
    1.59
     malnutrition
    1.56
    levels
    1.51
     Gord
    1.50
    eger
    1.49
     Processing
    1.47
     Classification
    1.45
     Gupta
    1.44
    Act Density 0.038%

    No Known Activations