INDEX
Explanations
references to the media platform "BuzzFeed"
references to the "Buzzfeed" media outlet
New Auto-Interp
Negative Logits
esthetic
-0.81
discipl
-0.81
xual
-0.75
dedication
-0.73
izations
-0.72
semble
-0.71
restitution
-0.70
nonviolent
-0.70
egal
-0.70
compassionate
-0.68
POSITIVE LOGITS
Buzz
1.23
feed
1.17
Buzz
0.96
ards
0.95
Feed
0.87
arded
0.86
arro
0.86
oola
0.83
buzz
0.82
vine
0.80
Activations Density 0.003%