INDEX
    Explanations

    references to the media platform "BuzzFeed"

    references to the "Buzzfeed" media outlet

    New Auto-Interp
    Negative Logits
    esthetic
    -0.81
     discipl
    -0.81
    xual
    -0.75
     dedication
    -0.73
    izations
    -0.72
    semble
    -0.71
     restitution
    -0.70
     nonviolent
    -0.70
    egal
    -0.70
     compassionate
    -0.68
    POSITIVE LOGITS
     Buzz
    1.23
    feed
    1.17
    Buzz
    0.96
    ards
    0.95
    Feed
    0.87
    arded
    0.86
    arro
    0.86
    oola
    0.83
     buzz
    0.82
    vine
    0.80
    Act Density 0.003%

    No Known Activations