INDEX
Explanations
mentions of famous individuals or blockbuster movies
references to high-profile individuals and entertainment entities
New Auto-Interp
Negative Logits
burn
-0.80
swick
-0.79
ns
-0.78
roe
-0.76
yg
-0.74
reb
-0.74
atur
-0.73
spe
-0.73
reen
-0.73
chron
-0.72
POSITIVE LOGITS
superstar
0.79
unemploy
0.71
franchises
0.67
hips
0.65
markets
0.62
ashtra
0.62
ervatives
0.61
Epstein
0.61
risome
0.61
exponentially
0.60
Activations Density 0.019%