INDEX
    Explanations

    names of individuals involved in a news story or scandal

    mentions of specific individuals, particularly those involved in a scandal

    New Auto-Interp
    Negative Logits
    inth
    -0.77
    ãĤ©
    -0.73
    dinand
    -0.72
    ALLY
    -0.71
    ãĥ¤
    -0.70
    д
    -0.69
    */(
    -0.69
    eer
    -0.69
    ICS
    -0.68
    nesota
    -0.68
    POSITIVE LOGITS
     Duffy
    0.79
     Chow
    0.65
    igger
    0.64
    dry
    0.64
    enhagen
    0.64
     wig
    0.64
     Races
    0.64
    cake
    0.62
     itching
    0.62
    orks
    0.61
    Act Density 0.014%

    No Known Activations