INDEX
Explanations
mentions of celebrities
references to celebrities
New Auto-Interp
Negative Logits
OTS
-0.78
unda
-0.76
¸
-0.76
¾
-0.75
anus
-0.74
¼
-0.73
arten
-0.72
etheless
-0.72
choes
-0.71
YA
-0.71
POSITIVE LOGITS
endors
1.23
rities
1.10
gossip
1.09
endorsements
1.06
chef
1.00
nude
0.92
chefs
0.90
TMZ
0.88
Celeb
0.85
entertain
0.84
Activations Density 0.086%